SEO Fundamentals
How Search Engines Work
Understanding how search engines work helps you make better SEO decisions. Search engines use three main processes: crawling to discover content, indexing to store and organise it, and ranking to determine what appears in search results. Understanding these processes clarifies what you can and cannot control.
Crawling
Crawling is how search engines discover web pages. Search engines use automated programs called crawlers (or spiders) that follow links from page to page, discovering new content and revisiting existing pages to check for updates.
To make your content discoverable, ensure search engines can access it. This means having a clear site structure, using sitemaps, avoiding blocking crawlers unnecessarily, and ensuring important pages are linked from other pages on your site.
Common crawling issues include pages blocked by robots.txt, pages with no internal links, pages behind login walls, or pages that return errors. A clear site structure with logical internal linking helps crawlers discover all your important content.
Indexing
Indexing is how search engines store and organise discovered content. Once a page is crawled, search engines analyse it and decide whether to add it to their index—a massive database of web pages that can be searched.
For content to appear in search results, it must be indexed. Not all crawled content gets indexed—search engines may choose not to index duplicate content, low-quality pages, or pages that don't meet quality guidelines.
You can check if your content is indexed using Google Search Console. If content isn't indexed, common reasons include: the page is blocked, it's marked as noindex, it's duplicate content, or it doesn't meet quality standards. You can use noindex tags to prevent indexing of pages you don't want in search results, such as thank-you pages or internal tools.
Ranking
Ranking is how search engines decide which content to show for specific queries. When someone searches, search engines evaluate indexed pages and rank them based on relevance, quality, and user experience signals.
Ranking algorithms consider many factors: relevance to the query, content quality, user experience signals, technical factors, and trust signals. The exact formula is not public and changes frequently, but the goal is always to show the most useful results for each query.
What businesses can focus on instead of trying to game rankings: creating high-quality, relevant content; ensuring good technical setup; providing excellent user experience; and building trust and authority over time. These factors support ranking, but no one can guarantee specific positions.
Why no one controls rankings
Search engines control rankings, not businesses or SEO agencies. This is important to understand because it sets realistic expectations about what SEO can and cannot achieve.
What businesses can control: content quality, technical setup, user experience, site structure, and how they present information. What they cannot control: exact rankings, algorithm changes, what competitors do, or how search engines interpret their content.
SEO is about optimising what you can control to improve the likelihood of good performance, not about guaranteeing specific rankings. Understanding this helps set realistic expectations and focus on what actually matters: serving customers well.
Examples
Well-structured site for crawling
Example: A coffee shop website with a clear hierarchy: homepage links to main sections (menu, locations, about), each section links to relevant pages, and all pages are accessible via internal links. A sitemap is submitted to Google Search Console, and robots.txt allows crawling of all public pages.
This structure makes it easy for search engines to discover and crawl all important content. Crawlers can follow links logically and understand the site structure.
Poor structure that hinders crawling
Example: A website where important pages have no internal links, the sitemap is outdated, robots.txt blocks important pages unnecessarily, and the site structure is confusing with orphaned pages.
This makes it difficult for search engines to discover content. Important pages may never be crawled, and the site structure is unclear, which can hurt indexing and ranking.
Previous topic
What is SEO?
Introduction to SEO fundamentals
Next topic
Content SEO
Understanding content and SEO
Recommended next steps
Content SEO
Learn how to create content that search engines can understand and that serves customers well.
Technical SEO
Understand the technical foundations that make content discoverable and indexable.
Google Search Console
Learn how to use Google Search Console to monitor how search engines see your site.