Sitemap includes noindex pages: how to audit your sitemap.xml
A page in your sitemap.xml is a request to Google: "please index this." If that same page has a noindex tag, you are simultaneously telling Google to drop it. Google reads the conflict as a quality signal against your whole site. Remove noindex URLs from the sitemap.
A page in your sitemap.xml is a request to Google: "please index this." If that same page has a noindex tag, you are simultaneously telling Google to drop it. Google reads the conflict as a quality signal against your whole site. Remove noindex URLs from the sitemap. sitemap.xml is the list of URLs you want indexed. noindex is a meta tag (or X-Robots-Tag HTTP header) that tells Google to keep a page out of the index. The two must not contradict each other.
Last updated·
What it is
sitemap.xml is the list of URLs you want indexed. noindex is a meta tag (or X-Robots-Tag HTTP header) that tells Google to keep a page out of the index. The two must not contradict each other.
Why it matters
Google explicitly treats conflicting signals as a quality issue. Beyond that, every noindex URL in the sitemap is a wasted crawl request. Sitemap quality also influences how often Google re-crawls — a clean sitemap gets fetched more often, so new content is indexed faster.
How to fix it
- Crawl your site and collect noindex URLs. Most crawlers flag noindex pages explicitly. Save the list.
- Pull your sitemap(s) and diff against the noindex list. Any overlap is a bug. For each overlapping URL, decide: should it be indexed (remove the noindex) or stay out of the index (remove it from the sitemap)?
- Filter your sitemap-generation pipeline. If your CMS generates sitemap.xml automatically (WordPress, Shopify, etc.), check the plugin settings — most have a checkbox to exclude noindex pages. Turn it on.
- Resubmit the cleaned sitemap in Search Console. After fixing, ping Search Console so the new sitemap is re-read. Watch the "Indexed" count over the following 2–4 weeks.
Authoritative sources
- Google Search Central documentation — Google
- Schema.org vocabulary — schema.org
- SEO Starter Guide — Google Search Central
- MDN — HTML meta and link elements — Mozilla MDN