Sitemap Finder & Checker

Sitemap Finder illustration

Find and validate all sitemaps on any website instantly. Discover hidden sitemaps, recursively check sitemap indexes, and extract total URL counts.

link

How we find sitemaps

1
smart_toy

Check robots.txt

We scan the site's robots.txt file for explicitly declared Sitemap: directives.

2
explore

Probe Common Paths

We test standard locations like /sitemap.xml and /wp-sitemap.xml just in case.

3
account_tree

Resolve Indexes

If a sitemap is an index, we recursively fetch all child sitemaps listed inside it.

4
fact_check

Validate & Count

We check XML validity, parse schemas, and aggregate the total URL count across the entire site.

Sitemap Discovery Examples

See what the tool finds for different types of websites. Click any example to load it into the checker above.

WordPress Site

wordpress.org

WordPress sites use a wp-sitemap.xml sitemap index that references child sitemaps for posts, pages, categories, and tags.

📄 robots.txt → Sitemap directive found

🗂️ Type → Sitemap Index

📦 Child sitemaps → Multiple (posts, pages, …)

E-Commerce Site

shopify.com

Large commercial sites often have massive sitemap indexes with thousands of product, collection, and blog sitemaps.

📄 robots.txt → Multiple Sitemap directives

🗂️ Type → Sitemap Index (large)

🔢 URL count → Thousands

Small Site

engtools.dev

Small sites typically have a single flat sitemap.xml with all URLs in one file — no index needed.

📄 robots.txt → Sitemap directive found

🗂️ Type → Single urlset

🔢 URL count → Under 100

No Sitemap ⚠️

httpstat.us

Some sites have no sitemap at all — no robots.txt reference, no common paths. The tool reports this clearly so you know what's missing.

📄 robots.txt → No Sitemap directive

🔍 Common paths → All returned 404

📦 Sitemaps found → 0

Frequently Asked Questions

Why is the tool not finding my sitemap? expand_more
If your sitemap is located at a non-standard path (e.g., /custom-name.xml) and is NOT declared in your robots.txt file, our tool cannot discover it. To fix this, always add a Sitemap: https://yourdomain.com/custom-name.xml directive to your robots.txt.
What is a sitemap index? expand_more
A sitemap index (<sitemapindex>) is a sitemap for sitemaps. Due to the 50,000 URL limit per file, large sites split their URLs across multiple files and link them all together in an index. Our tool automatically follows these links to discover all child sitemaps.
Does this tool crawl my pages? expand_more
No. This tool only fetches robots.txt and the XML sitemap files themselves. It does not load your web pages or follow HTML links, making it extremely fast and lightweight.
What's the difference between this and the Sitemap Validator? expand_more
This tool is for discovery — you give it a domain, and it finds all the sitemaps. The Sitemap Validator is for a deep-dive audit on a specific sitemap file, doing rigorous checks on date formats, protocol usage, and duplicate URLs.