If you’ve ever shopped online, you’ve definitely come across faceted navigation — you may just not have realized it!
Ordering by price, filtering by size, and sorting by color are all great examples of faceted navigation.
Simply put, faceted navigation is a way to help visitors navigate and personalize a page to find the exact product they’re looking for. The pages faceted navigations create can also help people searching specific, long-tail queries find the exact products they’re looking for in the search results.
As useful as faceted navigation can be for users, it can cause some serious problems for SEO if you’re not careful. Let’s take a look at what faceted navigation looks like on different types of sites, and what kinds of issues it can cause.
Faceted navigation has become synonymous with e-commerce sites, but retailers aren’t the only ones who use it.
All of these sites are prone to the SEO pitfalls that faceted navigation often causes, so let’s explore what those common issues are.
According to Google, faceted navigation is often not search-friendly.
Why? Because facets create multiple versions of the same URL.
This causes four main SEO problems:
Some SEOs might argue that faceted navigation is a way to easily create pages targeted for long-tail search queries with small but specific search intent. However, you should always first validate whether search intent for that topic actually exists.
Remember, just because you can create a page doesn’t mean you should!
Let’s look at an example of an e-commerce website with less than 200,000 product pages.
When Botify conducted a crawl, following the same rules as the ones this site had set for Google in robots.txt, we found that there were more than 500 million pages accessible.
Remember, this e-commerce site had less than 200,000 products and yet their page count was more than 500 million accessible to search engine bots!
The cause? This site’s faceted navigation was creating an infinite number of combinations for the same URL, and unfortunately, that’s not an uncommon occurrence.
But what do we mean by “multiple versions of the same URL”?
Say, for example, we’re on an e-commerce site that sells computers, and we’re looking at the ‘monitors’ category page.
The main URL for that page would likely be:
Now let’s say there are filters for screen size, resolution, and connectivity, and I select that I want to see 19 inch monitors with 1600×900 resolution that have hdmi connectivity.
After making those selections, I’d end up with a URL that looks something like this:
This is a narrowed-down version of the same monitors page I’ve been on the whole time. This page doesn’t have different content, just less of it.
That page will also be incredibly similar to other narrowed-down versions of the page that are possible for visitors to create, like this:
And, depending on your Content Management System (CMS) and your developers, there could even be different combinations of the same URL — essentially, two pages that both exist and are exactly the same, like:
The only difference between the two pages above is that the facets are listed in different orders, depending on which order the visitor clicked on the filters.
Knowing this, it’s probably no longer surprising that a site with 200,000 product pages could end up with more than 500 million pages.
As good as it is for users to be able to customize their search and find the best product depending on their needs, search engines will certainly have trouble figuring out which pages you want them to index.
Knowing that Google isn’t crawling half the pages on your average enterprise website, you might want to ask the question; which half are they missing?
Are they missing your low-value, duplicate pages? Or are they missing your critical, revenue-driving product pages?
We also know that, with a low ratio of crawled pages, sites have fewer pages generating traffic from organic search, and those numbers only get worse as sites increase in size.
Case in point — we can’t leave it up to bots to guess.
To see if search engine bots like Google are finding your important pages, or wasting time on duplicate facet pages, you can look at your server log files.
To see which of your pages Google has indexed, you can spotcheck URLs in Google Search Console (GSC) or use Botify to see all faceted pages that are indexed.
To see which of these pages are receiving clicks and traffic, you can use Google or Adobe Analytics, GSC for spot checks, or Botify (via our Analytics and GSC integrations).
Ideally, it’s only your primary, valuable pages that Googlebot is crawling, indexing, ranking, and driving traffic to your site. You’ll want Googlebot to stay away from your duplicate pages created by facets and filters so that you don’t risk harming the performance of your high-value pages.
It’s one thing to understand faceted navigation SEO issues in theory. The next hurdle is to understand how faceted navigation is impacting our own sites.
If you want to audit your site for faceted navigation SEO issues, we recommend the following steps:
In order to diagnose faceted navigation issues, you’ll need to have a good understanding of how your site uses faceted navigation.
To figure this out, ask questions such as:
In Botify, you can use Advanced Segmentation or Custom Extracts to quantify and visualize your website’s faceted navigation. In the example below, we’re looking at a 20 million page website with more than 19 million faceted pages — more than 80% of those have 3+ facets added.
Through getting to know your site’s faceted navigation, you’ll gain a better understanding of the scope of the SEO issues it might be causing.
Once you understand how many and what kind of facets are on your website, you can use your analytics to identify if any of those pages are valuable — in other words, which faceted pages are driving organic search traffic to your site?
You can easily see this by looking at active pages by facet in Botify. For example, on this site we can see that most of its pages (URLs with 3 facets) are not driving traffic. That’s a huge crawl budget risk, which leads us to the next step.
Like we just saw, some faceted pages can drive traffic from organic search. However, other faceted pages can waste your crawl budget. How can you know which is which?
A good place to start is comparing “Crawls by Google” (bots) to “Visits from Google” (users).
In the example below, URLs with 3+ facets take up a large portion of the crawl budget (they get more hits from Googlebot than any other URLs on the site). However, those pages generate very few visitors from Google Organic.
If Googlebot is crawling certain pages a ton, but those pages aren’t driving traffic, those pages may be wasting your crawl budget.
Once you understand what’s happening on your site, it’s time to look off-site at search demand. In other words, is there enough search demand for the pages we’re creating?
If a page created by your faceted navigation has low or no demand (e.g. very few people are searching for “leather jacket size medium”), you may want to consider keeping it out of the index. On the other hand, if a page created by your faceted navigation has high demand (e.g. lots of people search for “leather jacket under $100”), it’s a good idea to ensure these pages are indexable.
Finally, use your inventory (product or content) to understand where you could serve more results to users.
In Botify, you can use custom extracts to identify how many products you have on each category page. Then, you could prioritize the URLs with optimal products to drive more traffic and conversions.
Knowing how to treat your faceted navigation pages is complex, so we’ve created our version of a decision chart based on Aleyda’s Crawling Monday episode on Indexing Faceted Pages to help.
Following this chart, you’ll end up with three buckets of faceted pages:
Once you’ve sorted your faceted navigation pages into those groups, you’ll have a much clearer picture of what actions you need to take.
While there’s no one-size-fits-all solution for dealing with faceted navigation (this differs based on many factors like your CMS and the development resources you can get), we’ll walk you through some tips and best practices you can use to index, noindex, or block crawls of your faceted navigation pages in part two of this article, so stay tuned for that!