What Is Faceted Navigation? The Main Factors to Consider for SEO

Posted on

If you’ve ever shopped online, you’ve definitely come across faceted navigation — you may just not have realized it!

Ordering by price, filtering by size, and sorting by color are all great examples of faceted navigation. 

faceted navigation example
An example of filtering options on a product category page.

Simply put, faceted navigation is a way to help visitors navigate and personalize a page to find the exact product they’re looking for. The pages faceted navigations create can also help people searching specific, long-tail queries find the exact products they’re looking for in the search results. 

As useful as faceted navigation can be for users, it can cause some serious problems for SEO if you’re not careful. Let’s take a look at what faceted navigation looks like on different types of sites, and what kinds of issues it can cause.

What types of websites have faceted navigation?

Faceted navigation has become synonymous with e-commerce sites, but retailers aren’t the only ones who use it. 

For example:

  • Publishers often use faceted navigation to help visitors filter by content type (Date, Topic, etc.) 
  • Classifieds sites such as job listing sites use faceted navigation to help visitors filter by things like location. 
  • Large websites of all types often use faceted navigation. Generally, the larger the site, the more likely there’s some sort of faceted navigation to improve user experience.

All of these sites are prone to the SEO pitfalls that faceted navigation often causes, so let’s explore what those common issues are.

What SEO problems can faceted navigation cause? 

According to Google, faceted navigation is often not search-friendly.

Why? Because facets create multiple versions of the same URL. 

why are faceted navigations bad for search engines
Faceted navigation: great for users, not always great for search engines.

This causes four main SEO problems:

  • Duplicate content, because multiple versions of the same page exist on your site. Many facets don’t change the page content much, if at all. For example, an option to sort products by price could create multiple pages with the same products on it, just in a different order. 
  • Diluted link equity, because internal linking will be spread across multiple URLs. Instead of one variation of a page to link to, there may be hundreds. This is bad because, instead of one page getting the benefit of all the links to it, some of those links go to duplicates.
  • Crawl waste, because Google will spend time on duplicate pages and may miss / not have time to crawl your valuable pages. 
  • Crawl traps, because in many cases, faceted navigation can create an almost endless combination of your core URLs. This is called a crawl trap, because bots literally get trapped crawling these URLs.
SEO problems caused by faceted navigation
Common SEO problems caused by faceted navigation.

Some SEOs might argue that faceted navigation is a way to easily create pages targeted for long-tail search queries with small but specific search intent. However, you should always first validate whether search intent for that topic actually exists. 

Remember, just because you can create a page doesn’t mean you should!

Examples of SEO issues created by faceted navigation

Let’s look at an example of an e-commerce website with less than 200,000 product pages.

When Botify conducted a crawl, following the same rules as the ones this site had set for Google in robots.txt, we found that there were more than 500 million pages accessible. 

Remember, this e-commerce site had less than 200,000 products and yet their page count was more than 500 million accessible to search engine bots! 

diagram illustrating page bloat caused by faceted navigation

The cause? This site’s faceted navigation was creating an infinite number of combinations for the same URL, and unfortunately, that’s not an uncommon occurrence. 

But what do we mean by “multiple versions of the same URL”?

Say, for example, we’re on an e-commerce site that sells computers, and we’re looking at the ‘monitors’ category page. 

The main URL for that page would likely be:

https://example.com/monitors/ 

Now let’s say there are filters for screen size, resolution, and connectivity, and I select that I want to see 19 inch monitors with 1600×900 resolution that have hdmi connectivity.  

After making those selections, I’d end up with a URL that looks something like this: 

https://example.com/monitors/filters?size=19inch&resolution=1600x900pixel&connectivity=hdmi

This is a narrowed-down version of the same monitors page I’ve been on the whole time. This page doesn’t have different content, just less of it. 

That page will also be incredibly similar to other narrowed-down versions of the page that are possible for visitors to create, like this: 

https://example.com/monitors/filters?size=19inch,22inch&resolution=1600x900pixel&connectivity=hdmi,wga

And, depending on your Content Management System (CMS) and your developers, there could even be different combinations of the same URL — essentially, two pages that both exist and are exactly the same, like: 

https://example.com/monitors/filters?size=19inch&connectivity=hdmi

&

https://example.com/monitors/filters?connectivity=hdmi&size=19inch

The only difference between the two pages above is that the facets are listed in different orders, depending on which order the visitor clicked on the filters. 

Knowing this, it’s probably no longer surprising that a site with 200,000 product pages could end up with more than 500 million pages. 

As good as it is for users to be able to customize their search and find the best product depending on their needs, search engines will certainly have trouble figuring out which pages you want them to index. 

Knowing that Google isn’t crawling half the pages on your average enterprise website, you might want to ask the question; which half are they missing?

how does Google crawl the web - Botify
Based on Botify’s analysis of 6.2B Googlebot requests across 413M web pages June 2018 “How Does Google Crawl the Web?”

Are they missing your low-value, duplicate pages? Or are they missing your critical, revenue-driving product pages? 

We also know that, with a low ratio of crawled pages, sites have fewer pages generating traffic from organic search, and those numbers only get worse as sites increase in size.

crawl ratio is worse on larger websites

Case in point — we can’t leave it up to bots to guess.

How do I know if Google is finding my important pages?

To see if search engine bots like Google are finding your important pages, or wasting time on duplicate facet pages, you can look at your server log files.

crawls venn diagram in Botify
The crawls venn diagram is one of the many charts in Botify that reveals what pages search engines are finding vs. missing. 

To see which of your pages Google has indexed, you can spotcheck URLs in Google Search Console (GSC) or use Botify to see all faceted pages that are indexed.

URL inspect in Google Search Console
The URL Inspection tool in Google Search Console can help you identify whether a page has been indexed by Google.

To see which of these pages are receiving clicks and traffic, you can use Google or Adobe Analytics, GSC for spot checks, or Botify (via our Analytics and GSC integrations).

A report in Botify that allows you to see what pages on your site are “active” (any page that received at least 1 organic search visit in the last 30 days) vs. pages that are not active.

Ideally, it’s only your primary, valuable pages that Googlebot is crawling, indexing, ranking, and driving traffic to your site. You’ll want Googlebot to stay away from your duplicate pages created by facets and filters so that you don’t risk harming the performance of your high-value pages. 

How to audit your faceted navigation for SEO 

It’s one thing to understand faceted navigation SEO issues in theory. The next hurdle is to understand how faceted navigation is impacting our own sites. 

If you want to audit your site for faceted navigation SEO issues, we recommend the following steps:  

1. Get familiar with how facets work on your site

In order to diagnose faceted navigation issues, you’ll need to have a good understanding of how your site uses faceted navigation.  

To figure this out, ask questions such as:

  • Do facets exist only on my category pages, or do we use facets on other sections of the site like the blog as well? 
  • Is there a hierarchical order how facets are appended to your URLs? (e.g. brand facets are always first and pricing facets are always second) 
  • Is there a limit to how many facets you can add, or are endless combinations possible? (e.g. user-generated facets where anything typed into the site search bar creates a new facet)

In Botify, you can use Advanced Segmentation or Custom Extracts to quantify and visualize your website’s faceted navigation. In the example below, we’re looking at a 20 million page website with more than 19 million faceted pages — more than 80% of those have 3+ facets added. 

faceted navigation visualization in Botify

Through getting to know your site’s faceted navigation, you’ll gain a better understanding of the scope of the SEO issues it might be causing. 

2. Evaluate traffic to your faceted pages 

Once you understand how many and what kind of facets are on your website, you can use your analytics to identify if any of those pages are valuable — in other words, which faceted pages are driving organic search traffic to your site?

You can easily see this by looking at active pages by facet in Botify. For example, on this site we can see that most of its pages (URLs with 3 facets) are not driving traffic. That’s a huge crawl budget risk, which leads us to the next step.

active vs. not active pages in Botify

3. Identify crawl waste on faceted pages

Like we just saw, some faceted pages can drive traffic from organic search. However, other faceted pages can waste your crawl budget. How can you know which is which?

A good place to start is comparing “Crawls by Google” (bots) to “Visits from Google” (users). 

In the example below, URLs with 3+ facets take up a large portion of the crawl budget (they get more hits from Googlebot than any other URLs on the site). However, those pages generate very few visitors from Google Organic. 

crawl waste caused by faceted navigation

If Googlebot is crawling certain pages a ton, but those pages aren’t driving traffic, those pages may be wasting your crawl budget.

4. Identify search demand for your faceted pages

Once you understand what’s happening on your site, it’s time to look off-site at search demand. In other words, is there enough search demand for the pages we’re creating?

If a page created by your faceted navigation has low or no demand (e.g. very few people are searching for “leather jacket size medium”), you may want to consider keeping it out of the index. On the other hand, if a page created by your faceted navigation has high demand (e.g. lots of people search for “leather jacket under $100”), it’s a good idea to ensure these pages are indexable. 

should I index my faceted  navigation pages?

5. Check your inventory to understand where you could serve more results to users 

Finally, use your inventory (product or content) to understand where you could serve more results to users. 

In Botify, you can use custom extracts to identify how many products you have on each category page. Then, you could prioritize the URLs with optimal products to drive more traffic and conversions.

number of products per category page

How should I treat my faceted navigation pages?

Knowing how to treat your faceted navigation pages is complex, so we’ve created our version of a decision chart based on Aleyda’s Crawling Monday episode on Indexing Faceted Pages to help. 

how should I treat my faceted navigation pages flow chart

Following this chart, you’ll end up with three buckets of faceted pages:

  • Index (should always have a self-referencing canonical, shouldn’t be blocked from robots.txt, and should have valuable internal links pointing to it)
  • Don’t index (should contain a noindex tag)
  • Block crawl (should not be accessible to bots, which can be achieved by a combination of robots.txt, parameter handling in GSC, JavaScript, and nofollow links)

Once you’ve sorted your faceted navigation pages into those groups, you’ll have a much clearer picture of what actions you need to take. 

While there’s no one-size-fits-all solution for dealing with faceted navigation (this differs based on many factors like your CMS and the development resources you can get), we’ll walk you through some tips and best practices you can use to index, noindex, or block crawls of your faceted navigation pages in part two of this article, so stay tuned for that!

 

 

Related posts

Get more articles like this in your inbox monthly!