Back to glossary

Noindex

A meta directive that instructs search engines not to include a page in their search index. Noindex prevents a page from appearing in search results while still allowing it to be crawled and its links to be followed.

Noindex is the definitive way to keep a page out of search results. Unlike robots.txt (which blocks crawling but not indexing), a noindex tag ensures the page will not appear in search results even if other sites link to it. Implement noindex via a meta robots tag in the HTML head or an X-Robots-Tag HTTP header.

For engineering teams, noindex should be applied strategically to pages that provide user value but should not appear in search results: internal search results pages, paginated archive pages beyond the first page, user account pages, staging environments, and duplicate content variations. Common mistakes include accidentally noindexing important pages during development and forgetting to remove noindex tags after launching from staging. Build noindex auditing into your CI/CD pipeline to catch unintended noindex tags before they reach production. Monitor the Coverage report in Google Search Console for pages excluded due to noindex and verify they are intentionally excluded. A sudden increase in noindexed pages usually indicates a configuration error.

Related Terms