Glossary · Glossary

Sitemap

A sitemap is a file that lists important URLs and metadata to help search engines discover site content.

Updated Jun 3, 2026 Reviewed Jun 3, 2026 en

A sitemap is a file that lists important URLs and optional metadata to help search engines discover site content. For a static content site, the sitemap is a publication map: it should reflect the canonical, public, indexable pages the site actually wants crawlers to consider.

Most SEO discussions refer to an XML sitemap, often available at /sitemap.xml. HTML sitemaps can also help users navigate a site, but the XML file is the crawler-focused format.

Why it matters

Sitemaps help crawlers find pages that may not be discovered quickly through links alone, especially after launching new glossary terms, reports, comparisons, or localized routes. They can also communicate last-modified metadata and alternate-language relationships when configured correctly.

For GEO and AI search work, the sitemap is not a visibility guarantee, but it is part of making source pages discoverable. A useful page that never appears in crawl paths has less chance of becoming an organic result, supporting link, or cited source.

How it differs

A sitemap helps discovery. It does not guarantee indexing, ranking, traffic, or AI citation. It should support internal linking, not replace it.

A sitemap also differs from a canonical URL. The sitemap should list canonical URLs, but the canonical signal itself comes from consistent URLs, links, redirects, and page-level tags.

Example

<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
  <url>
    <loc>https://www.example.com/glossary/canonical-url/</loc>
    <lastmod>2026-06-03</lastmod>
  </url>
  <url>
    <loc>https://www.example.com/guides/geo-vs-seo/</loc>
    <lastmod>2026-06-03</lastmod>
  </url>
</urlset>

The sitemap should list clean public URLs, not draft paths, raw source paths, tracking-parameter variants, or accidental preview routes.

How teams use it

Teams review sitemaps after adding public content, changing routes, adding localized pages, or setting noindex. A practical checklist:

IncludeExclude
Canonical published pagesDraft pages
Stable glossary and guide URLsraw/ source material
Indexable report and comparison pagesParameterized tracking URLs
Reviewed localized pagesNoindex or internal pages

Common misunderstanding

A sitemap is not a search submission queue where every URL gets accepted. It is a discovery hint. Search systems still decide whether a page is crawlable, indexable, canonical, useful, and distinct enough to include in results.

Read next

Use these glossary paths to move from the definition into adjacent concepts, topic clusters, and operator guides.