Fix Crawl Errors in Google Search Console: A Practical Walkthrough
Crawl errors are one of the most overlooked reasons why pages disappear from Google. If Googlebot can’t reach a URL, it can’t index it, and if it can’t index it, you lose rankings, clicks, and revenue. The good news: most crawl errors follow predictable patterns, and you can resolve them quickly once you know how to read the signals inside Google Search Console (GSC).
In this guide, we walk through a decision-tree approach to diagnose and fix crawl errors in Google Search Console. We’ll cover soft 404s, server errors (5xx), redirect chains, blocked resources, and DNS issues, with concrete actions you can take today.

Step 1: Locate the Crawl Errors in Google Search Console
Before fixing anything, you need a clean inventory of the issues. Open Google Search Console and follow this path:
- Go to Indexing > Pages in the left sidebar.
- Scroll to the Why pages aren’t indexed section.
- Click any error type to see the affected URLs.
- For server-level signals, open Settings > Crawl stats to review host status, response codes, and Googlebot request volume.
Export the URL list (CSV or Google Sheets) so you can group errors and track fixes over time.

Step 2: Categorize the Errors
Not all crawl errors are equal. Some hurt SEO immediately; others are normal and expected. Use the table below to prioritize.
| Error Type | Severity | Typical Cause | Action Priority |
|---|---|---|---|
| Soft 404 | High | Thin content, empty category, wrong status code | Fix this week |
| Server error (5xx) | Critical | Hosting, PHP timeout, overload | Fix today |
| Redirect error | High | Redirect loops, long chains, bad target | Fix this week |
| Not found (404) | Medium | Deleted pages, broken internal links | Review monthly |
| Blocked by robots.txt | Variable | Intentional or misconfigured rule | Verify intent |
| DNS / connectivity | Critical | DNS misconfiguration, firewall blocking Googlebot | Fix today |
Step 3: The Decision Tree for Each Error
Fixing Soft 404 Errors
A soft 404 happens when a page returns a 200 OK status but Google believes the content is missing, thin, or unhelpful. Ask yourself:
- Is the page supposed to exist? If no, return a proper 404 or 410 status code.
- Is the page valuable but thin? Add unique copy, internal links, and structured data. Aim for 300+ meaningful words.
- Is it an empty category or out-of-stock product? Add fallback content, related products, or redirect to a parent category with a 301.
- Is it a JavaScript-rendered page? Test with the URL Inspection tool and check the rendered HTML. If content fails to render, enable SSR or prerendering.
After fixing, click Validate Fix inside the soft 404 report in GSC.
Fixing Server Errors (5xx)
Server errors mean Googlebot reached your site but received a failure response. The decision flow:
- Open Settings > Crawl stats > Host status. If you see red bars, identify the date and timeframe of the spike.
- Cross-check with server logs (Apache, Nginx, or your CDN dashboard).
- Common culprits to inspect:
- PHP memory limit too low
- Database connection limits exceeded
- Cloudflare or WAF blocking the Googlebot user-agent
- Rate-limiting plugins (security plugins on WordPress)
- Whitelist Googlebot using the official IP ranges Google publishes.
- Reduce Time to First Byte through caching (object cache, full-page cache, CDN).
Fixing Redirect Errors
Redirect errors usually fall into four buckets:
- Redirect loops: Page A points to B, B points back to A. Map the chain with a crawler like Screaming Frog and break the loop.
- Long chains: Keep redirects to a single hop. Update each intermediate URL to point directly to the final destination.
- Bad target URL: The destination returns 404 or another redirect. Update the rule to a live page.
- HTTPS / WWW conflicts: Ensure one canonical version (https://www. or https:// non-www) and enforce it at server level.
Fixing Blocked by robots.txt
Open your robots.txt file (usually at yourdomain.com/robots.txt) and check the rules. If a URL you want indexed is blocked:
- Remove or adjust the
Disallowdirective. - Test the updated file with the robots.txt Tester in GSC.
- Resubmit the affected URLs through the URL Inspection tool.
Fixing DNS and Connectivity Issues
If Googlebot reports DNS errors, the issue is upstream of your application. Steps to take:
- Run a dig or nslookup from multiple regions.
- Verify your DNS provider’s uptime status.
- Lower your DNS TTL temporarily so changes propagate faster.
- Confirm your firewall and DDoS protection are not blocking Googlebot’s verified IP ranges.

Step 4: Validate Your Fixes
After deploying corrections, return to the specific error report in GSC and click Validate Fix. Google will then recrawl a sample of affected URLs over the next few days or weeks. You’ll receive a notification with the result.
While you wait, use the URL Inspection tool to manually request indexing for your most important URLs (homepage, money pages, fresh content).
Step 5: Monitor and Prevent Future Crawl Errors
Crawl error management is not a one-time job. Build a recurring workflow:
- Weekly: Check the Pages report for new error spikes.
- Monthly: Review Crawl stats for host availability trends.
- Quarterly: Run a full site crawl with an external tool to catch errors before Google does.
- Always: Keep your XML sitemap clean. Only include indexable, canonical, 200-status URLs.

Common Mistakes to Avoid
- Mass redirecting every 404 to the homepage (this creates soft 404s).
- Ignoring 5xx spikes assuming “Google will figure it out.”
- Blocking Googlebot through aggressive security plugins.
- Submitting noindex pages in the sitemap.
- Forgetting to click Validate Fix after corrections.
FAQ: Fixing Crawl Errors in Google Search Console
How long does it take for Google to recrawl after fixing an error?
It varies. For high-authority sites, recrawl can happen within hours. For smaller sites, it may take one to four weeks. Using Validate Fix and URL Inspection speeds the process.
Are 404 errors bad for SEO?
Not inherently. 404s are a normal part of the web. They become problematic only when they affect URLs with traffic, backlinks, or internal links pointing to them. In those cases, redirect to a relevant page with a 301.
What’s the difference between a soft 404 and a real 404?
A real 404 returns the proper HTTP status code, telling Google the page is gone. A soft 404 returns 200 OK but has empty or irrelevant content. Soft 404s waste crawl budget and confuse Google’s index.
Should I worry about “Discovered, not indexed”?
Yes, especially at scale. It often signals quality issues, crawl budget limits, or duplicate content. Improve internal linking, content quality, and reduce low-value URLs.
Can I use the old Crawl Errors report?
No. Google retired the legacy report years ago. All crawl error data now lives inside the Pages report and Crawl stats section of the new Google Search Console.
Final Thoughts
Crawl errors are not just technical noise. They are signals about the health of your site and how Google experiences it. By following a structured decision tree, prioritizing critical errors first, and validating every fix, you keep your site indexable and competitive. Bookmark this guide, share it with your team, and revisit your GSC dashboard on a weekly cadence. Your rankings will thank you.
