Orphan Pages

SEO

Pages on a website that have no internal links pointing to them, making them difficult for both users and search engine crawlers to discover.

Definition

An orphan page is a page that exists on a website but has no internal links from any other page. It can still be accessed directly by URL, and it may appear in the sitemap, but no navigation, menu, or content link leads to it from the rest of the site.

Orphan pages are usually accidental: old pages left behind after a site redesign, pages created via CMS workflows that never got added to navigation, landing pages built for a campaign and then forgotten, or dynamically generated pages (faceted search, tags, archives) that are not linked from the main site structure.

Why It Matters

Search engines discover pages primarily through internal links. An orphan page may still be crawled if it appears in a sitemap, but Google is less likely to consider it important because no other page on the site votes for it. In practice, orphan pages tend to rank poorly, get crawled less frequently, and accumulate less link equity than well-linked pages.

Orphan pages also signal a broader site health issue. A large number of orphans usually means the internal linking strategy is weak or inconsistent, which affects the entire site's topical authority and crawl efficiency. Regular audits to find and either link, redirect, or remove orphan pages are a standard part of technical SEO maintenance.

Examples

Orphan pages can be detected by cross-referencing a full site crawl with the sitemap or a list of all URLs. A typical detection workflow using Screaming Frog or a similar crawler looks like this:

bash
# 1. Crawl the site starting from the homepage
$ screamingfrog --crawl https://example.com \
    --export-tabs "Internal:HTML" \
    --output-folder ./crawl

# 2. Export the list of URLs from your sitemap
$ curl https://example.com/sitemap.xml | \
    grep -oP "(?<=<loc>)[^<]+" > sitemap-urls.txt

# 3. Find URLs in the sitemap but not in the crawl.
#    Anything left over is an orphan page.
$ comm -23 \
    <(sort sitemap-urls.txt) \
    <(sort crawl/internal_html.csv | cut -d, -f1) \
    > orphan-pages.txt

$ wc -l orphan-pages.txt
47 orphan-pages.txt

Each orphan then needs a decision: add internal links to it (if valuable), 301 redirect it (if superseded), or remove it entirely (if obsolete).

All Glossary Terms
See it in action

Every article on our blog was written by Acta AI. No edits. No ghostwriter.

Read Our BlogStart Free Trial