Google Was Wasting 71% of Our Crawl Budget on Useless Filter Pages
A crawl budget optimization that cut indexation time from weeks to days
Faceted navigation is essential for user experience on e-commerce sites. Customers need filters for size, color, price range, and brand. But each filter combination creates a new URL. Our 300-product store had generated 8,000 crawlable URLs through filter combinations, and Google was spending most of its crawl budget on these instead of actual product pages.
The Myth of Unlimited Crawling
Many store owners assume Google will crawl everything eventually. Not true. Google allocates a crawl budget based on site size, authority, and server performance. When you waste that budget on duplicate filter pages, Google might miss new products or important updates. Our new product pages were taking 3-4 weeks to get indexed because Google was busy crawling pages like category-red-size-large-price-50-to-100.
Diagnosing the Problem
We analyzed server logs for 30 days. Googlebot made 47,000 requests. Only 6,200 were to product pages. The rest hit filter combinations, pagination, and sort variations. Meanwhile, Search Console showed 23 new products added in the past month with zero impressions because they hadn't been indexed yet.
The Experiment Parameters
We implemented a two-part solution on half our categories as a test. First, we added noindex tags to all filter URLs except single-filter pages that had actual search volume. Second, we set up URL parameters in Search Console telling Google which parameters to ignore. The control categories kept all filter pages indexable.
Implementation Details
Our developer added conditional noindex tags: if more than one filter was active, add noindex. If URL contained sorting parameters, add noindex. Single filters like category-red or category-size-large remained indexable because people actually searched for those terms. This reduced indexable filter URLs from 8,000 to 340.
Crawl Behavior Changed Fast
Within one week, server logs showed different Googlebot patterns. Requests to multi-filter pages dropped 68%. Requests to product pages increased 89%. By week three, our new products were getting indexed within 4-6 days instead of 3-4 weeks. Organic traffic to the test categories increased 23% as fresher content appeared in search faster.
The control categories showed no change in indexation speed or crawl patterns. Their filter pages still consumed crawl budget with no SEO value.
Long-Term Results
After rolling out the changes site-wide, our average time-to-index for new products dropped to 5 days. Organic traffic increased 34% over three months as Google spent more crawl budget on actual products. We also saw improvements in rankings for product pages, likely because Google crawled them more frequently and recognized updates faster.
What You Should Check
Look at your server logs or Google Search Console crawl stats. If Google is crawling thousands of filter combinations, you're wasting crawl budget. Audit which filter pages actually get organic traffic. Noindex the rest. It's a technical fix that takes a few hours but fundamentally changes how efficiently Google crawls your site.