An SEO audit is the diagnostic that reveals why a site is or is not ranking — and what to fix to change that. Done well, it produces a prioritized roadmap that drives months of work. Done badly, it produces a 200-page checklist that nobody reads. The difference is process. This guide walks through the audit workflow we use — scoping, data collection, analysis, prioritization, and deliverable — with the specific tools, queries, and templates that turn an audit into action.
Phase 1: Scoping & Goals
Every audit starts with explicit goals. "Audit my site" is not a goal. "Diagnose why organic traffic dropped 35% in March," "Plan the SEO scope for the upcoming replatforming," or "Identify the top 10 changes to recover ranking on our money keywords" are goals — and they shape what you measure.
Scoping questions
- What is the audit's primary objective (recovery, growth, migration risk, competitive)?
- Which keywords or page templates matter most for the business?
- What recent changes have happened (launches, redesigns, content overhauls)?
- What is the timeline and budget?
- Who is the deliverable for (executive, marketing team, dev team) and what level of detail do they need?
Baseline metrics
Document the starting point before doing anything else: total indexed pages, organic sessions (last 90 days vs prior 90), top 50 ranking keywords with current positions, Core Web Vitals pass rate by template, total referring domains. These become the metrics you measure improvements against.
Phase 2: Crawl & Index Audit
Crawl and index issues block everything else. Diagnose them first because the fixes are usually fast and the impact compounds across every other optimization.
Run a full site crawl
Use Screaming Frog (up to 500 URLs free, paid above), Sitebulb, or any modern crawler. Configure to render JavaScript so you see what Google sees post-render. Crawl from a Googlebot user-agent to surface any UA-specific issues.
What to look for in crawl data
- HTTP status distribution (200s, 301s, 404s, 500s) — anything unusual?
- Redirect chains and loops
- Missing or duplicate title tags and meta descriptions
- Missing or duplicate H1s
- Pages with thin content (under 200 words)
- Orphan pages (no internal links pointing to them)
- Pages with broken outbound links
- Canonical issues (missing, cross-domain, conflicting)
- Meta robots noindex tags (intentional or bug?)
Compare crawl to Search Console
Export indexed URLs from GSC's Pages report and compare to your crawl. Pages crawled but not in GSC are not indexed; pages in GSC not in your crawl are orphans or canonicalized elsewhere. The gap reveals indexation problems no other audit step can show.
Log file analysis (for larger sites)
For sites over 50,000 URLs, log file analysis reveals what Googlebot actually fetches versus what you think it should fetch. Filter access logs to Googlebot UAs, group by URL pattern, and look for wasted crawls on facet URLs, search results, calendar pages, or session IDs. Crawl budget waste is invisible in standard crawl tools.
Phase 3: On-Page Audit
On-page covers the visible content and metadata of your pages. The goal is finding pages where small content or metadata changes unlock ranking — usually pages stuck on page 2 of search results.
Title and meta description audit
Export titles and metas for your top 500 pages. Audit for: missing tags, duplicates across pages, length issues (titles over 60 characters often truncate in SERPs), keyword absence, click-worthy phrasing. Pages on page 2 of Google often jump to page 1 with a stronger, more relevant title.
Heading structure
Every page needs one H1 that closely matches the title and primary keyword. H2 and H3 nesting should follow logical hierarchy. Audit for missing H1s, multiple H1s, skipped heading levels, and headings used purely for styling.
Content quality and intent matching
Sample 20–50 of your top traffic pages and the pages targeting your money keywords. For each, ask: does the page match the intent of its target keyword (informational vs commercial vs transactional)? Is the content as comprehensive as the top 3 ranking pages? Are there clear next-step CTAs?
Keyword cannibalization
Cannibalization happens when multiple pages target the same keyword and split the ranking signal. Use a rank tracker to find keywords where multiple of your URLs alternate in the rankings. Consolidate or differentiate them — usually by merging weaker pages into stronger ones with 301 redirects.
Phase 4: Technical Performance Audit
Performance audit uses both field data (CrUX from Search Console and PSI) and lab data (Lighthouse). Field data is what Google uses for ranking; lab data is what you use to debug.
Core Web Vitals review
Pull the CWV report from Search Console. Note which page templates fail and on which metrics. Run individual PSI audits on representative pages from each failing template. Identify the highest-impact fix per template (usually image optimization for LCP, JavaScript reduction for INP, image dimensions and ad slot reservations for CLS).
Mobile-friendliness
Sample 10–20 pages per major template and run the Mobile-Friendly Test. Look for viewport configuration, tap target size, text legibility, and rendering issues. Search Console's Mobile Usability report (where still available) aggregates issues across the site.
HTTPS and security headers
Confirm HTTPS site-wide with HSTS. Check certificate validity at SSL Labs (target A or A+). Audit security headers (CSP, X-Content-Type-Options, Referrer-Policy). Document any mixed content warnings from a crawl.
Phase 5: Backlink Audit
The backlink audit reveals where your link authority comes from, what the toxic risk is, and where the gaps are versus competitors.
Profile health
Pull referring domain data from Ahrefs or Semrush. Look at: total referring domains, growth rate (is it increasing, flat, or declining?), domain rating distribution (mostly low-DR or balanced?), country distribution (matches your target market?), anchor text distribution (mostly branded, or suspiciously over-optimized?).
Spam and toxic links
Filter to links from low-DR sites, suspicious TLDs (.tk, .cf, .ml), or sites in unrelated foreign languages. Google's algorithm typically ignores spam links rather than penalizing for them, so disavow files are rarely needed in 2026. Reserve disavow for cases of demonstrated penalty or negative SEO attack.
Competitor gap analysis
Use Ahrefs' "Link Intersect" or Semrush's "Backlink Gap" to find domains linking to your top 3 competitors but not to you. These are your highest-probability outreach targets. Prioritize by relevance and domain rating.
Phase 6: Content Audit & Gap Analysis
Content inventory
Export every indexed URL with: organic clicks, impressions, average position (from GSC), word count, publish date, and number of backlinks. Sort by performance. The pattern is usually a long tail of zero-traffic pages and a small head of high performers.
Content decisions per URL
- Keep and improve — moderate traffic, ranking on page 2 — small content additions and on-page tweaks can push to page 1.
- Consolidate — multiple thin pages on the same topic — merge into one comprehensive page with 301 redirects.
- Refresh — high-traffic pages where content is now outdated — rewrite, update statistics, add new sections.
- Prune (noindex or delete) — zero-traffic, low-quality pages — remove from the index to improve overall site quality signal.
- Leave alone — performing well, no clear improvement.
Topic gap analysis
Compare your topical coverage to competitors. Use a tool like Ahrefs' Content Gap or Semrush's Keyword Gap to find keywords competitors rank for that you do not. Cluster these into topics and prioritize new content production around them.
Phase 7: Prioritization & Deliverable
An audit that ends with a 200-item checklist is a failure. The deliverable's job is to focus the team on what matters.
Prioritization matrix
Score each finding on two axes: impact (how much organic traffic or revenue could this unlock?) and effort (how many developer days or content hours?). The top-left quadrant (high impact, low effort) is your immediate work. The top-right (high impact, high effort) is your quarterly roadmap. The bottom-left (low impact, low effort) is filler if you have spare cycles. The bottom-right is parked.
Deliverable structure
- Executive summary — one page, three to five headline findings, the recommended roadmap, projected impact.
- Current state baseline — the starting metrics you documented in Phase 1.
- Findings by category — crawl, indexation, on-page, performance, links, content. Each finding includes: what is the issue, where does it appear (URL examples), why does it matter, how to fix it, effort estimate.
- Prioritized roadmap — quarter 1, quarter 2, quarter 3 plan with owners and dependencies.
- Appendix — raw data exports, screenshots, schema validation reports.
Presenting findings
Walk stakeholders through the executive summary, not the appendix. Anchor every recommendation to the business goal. "We can recover the 35% traffic drop by addressing these three issues, in this order, over this timeline." The dev team gets the implementation-detail appendix; the executive team gets the strategy.
Common Audit Mistakes
- Checklist without prioritization. Everything is critical, nothing gets fixed.
- Tool dependence without judgment. SEO tools generate hundreds of "errors" that are not actually problems. Filter for what matters.
- No baseline, no measurement plan. If you cannot measure improvement, you cannot demonstrate value.
- Audit theater. Producing a 100-page report no one reads. Length is not value.
- Ignoring business context. Recommending changes that conflict with launch timelines, brand guidelines, or product roadmap.
Start Your Audit With Your Heading Structure
Run any URL through our heading analyzer — one of the fastest checks in any audit and the source of many easy wins.
Heading Analyzer →