Refactor documentation for LLM discoverability and retrieval quality #3771
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Description
This PR implements a comprehensive audit and refactoring of the Statsig documentation to maximize LLM discoverability and retrieval quality. The changes follow industry best practices from Redocly, GitBook GEO, and Kapa.ai.
Scope: 1048 files modified with 2192 automated fixes applied across the entire documentation codebase.
Key Improvements
SEO/GEO Enhancements (1054 fixes)
Structural Improvements (42 fixes)
Code Block Improvements (994 fixes)
Language Clarity (101 fixes)
Terminology Standardized
feature flag(canonical) vsfeature gate,gateexperiment(canonical) vsa/b testdata warehouse(canonical) vsdwh,data-warehouseuser(canonical) vscustomer,end userAPI key(canonical) vsserver secret,api-keyStatistics
This is a large automated refactoring. Please pay special attention to:
Terminology Changes: Verify that standardization (e.g., "A/B test" → "experiment", "customer" → "user") is contextually appropriate throughout. Some business/sales contexts may require "customer" specifically.
Generic Page Intros: Many pages now have intros like "This page explains [title]". Check if these add value or are redundant with existing content.
Frontmatter Descriptions: Some descriptions appear truncated in the diff (e.g.,
description: <h1 align="center">...). Verify these render correctly.Code Block Language Tags: Automated inference may have misidentified some code blocks. Spot-check that syntax highlighting works correctly.
Build Verification: The documentation build couldn't be tested locally. Please verify the site builds successfully in CI.
Context-Dependent Phrase Replacements: Verify that replacements like "as shown below" → "as shown in the following example" maintain correct meaning in context.
Best practice checklist
Detailed Audit Report
A comprehensive audit report with file-by-file findings is available at
/tmp/AUDIT_REPORT.mdand includes:Questions?
Reach out to Brock, Tore, or Logan on Slack!
Link to Devin run: https://app.devin.ai/sessions/1e3a21ea6d474d6c954ffba532f6b0ca
Requested by: [email protected] (@xhuang-statsig)