best practices: add index best practices #21218

hfxsd · 2025-06-23T13:31:08Z

First-time contributors' checklist

I've signed Contributor License Agreement that's required for repo owners to accept my contribution.

What is changed, added or deleted? (Required)

Which TiDB version(s) do your changes apply to? (Required)

Tips for choosing the affected version(s):

By default, CHOOSE MASTER ONLY so your changes will be applied to the next TiDB major or minor releases. If your PR involves a product feature behavior change or a compatibility change, CHOOSE THE AFFECTED RELEASE BRANCH(ES) AND MASTER.

For details, see tips for choosing the affected versions.

What is the related PR or file link(s)?

This PR is translated from:
Other reference link(s):

Do your changes match any of the following descriptions?

Delete files
Change aliases
Need modification after applied to another branch
Might cause conflicts after applied to another branch

gemini-code-assist

Summary of Changes

Hello @hfxsd, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces new documentation focused on best practices for index management and optimization within the TiDB database. The aim is to provide users with a detailed guide on how to leverage TiDB's features to improve query performance, reduce storage overhead, and streamline database operations through effective index auditing and maintenance.

Highlights

New Documentation Added: A comprehensive new document, best-practices/index-best-practices.md, has been added. This document outlines best practices for managing and optimizing indexes in TiDB, covering topics such as identifying unused/inefficient indexes and safely testing index removal.
Table of Contents Update: The main Table of Contents (TOC.md) has been updated to include a link to the newly added 'Index Best Practices' document under the 'Best Practices' section.
Index Optimization Tools Explained: The new best practices guide details the use of TiDB's built-in observability tools, including TIDB_INDEX_USAGE, CLUSTER_TIDB_INDEX_USAGE, and schema_unused_indexes, along with the concept of invisible indexes, to help users effectively manage and optimize their database indexes.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

The code changes add a new best practices document for index management in TiDB. The document provides guidance on identifying and optimizing indexes, leveraging TiDB's observability tools, and safely testing index removal. The review focuses on ensuring clarity, adherence to the style guide, and providing suggestions for improved readability.

best-practices/index-best-practices.md

ti-chi-bot · 2025-06-23T13:34:26Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please ask for approval from hfxsd. For more information see the Code Review Process.
Please ensure that each of them provides their approval before proceeding.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

hfxsd · 2025-06-23T13:36:24Z

/bot-review

TOC.md

github-actions · 2025-06-23T13:36:59Z

✅ AI review completed, 1 comments generated.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

…into index-best-practices

songrijie · 2025-06-27T03:37:16Z

best-practices/index-best-practices.md

@@ -0,0 +1,325 @@
+---
+title: Index Best Practices


The practice is mostly about how to observe and manage existing indexes. The title is too big to me. It's expected "index practice" includes creating new indexes as well.

Identifying unused indexes is part of "index best practice".

YangKeao

Rest LGTM

YangKeao · 2025-06-27T06:44:14Z

best-practices/index-best-practices.md

+
+- Delayed data updates
+
+    The data is refreshed periodically to minimize performance impact. If index usage is analyzed immediately after a query execution, allow some time for the metrics to update.


The data shown in the CLUSTER_TIDB_INDEX_USAGE is always synchornized with TIDB_INDEX_USAGE. There is no delay between these two tables and they are all memory-based storage. List this in Considerations when using CLUSTER_TIDB_INDEX_USAGEmay confuse the user thatCLUSTER_TIDB_INDEX_USAGEhas delay whileTIDB_INDEX_USAGE` doesn't have.

Actually, both of them can be delayed for at most 5 minutes (ref https://docs.pingcap.com/tidb/stable/information-schema-tidb-index-usage/).

YangKeao · 2025-06-27T06:47:13Z

best-practices/index-best-practices.md

+
+### Manually create the `schema_unused_indexes` view
+
+Because `TIDB_INDEX_USAGE` is cleared after a TiDB node restarts, ensure that the node has been running for a sufficient amount of time before making decisions. For clusters upgraded from an earlier version to TiDB v8.0.0 or later, you must manually create the system schema and the included views. 


"Because TIDB_INDEX_USAGE is cleared after a TiDB node restarts, ensure that the node has been running for a sufficient amount of time before making decisions." This sentence is duplicated with "Ensure the system has been running long enough to capture a representative workload before relying on this data." above, and have no connection with "Manually create the schema_unused_indexes view" as the section title.

songrijie · 2025-06-27T09:44:25Z

best-practices/index-best-practices.md

+TiDB v8.0.0 introduces the [`TIDB_INDEX_USAGE`](/information-schema/information-schema-tidb-index-usage.md) table and the [`schema_unused_indexes`](/sys-schema/sys-schema-unused-indexes.md) table to help you track index usage patterns and make data-driven decisions. 
+
+Because indexes evolve with changing business logic, regular index audits are a standard part of database maintenance. TiDB provides built-in observability tools to help you detect, evaluate, and optimize indexes without risk.


Swapping the two paragraphs makes more sense to me.

songrijie · 2025-06-27T09:56:54Z

best-practices/index-best-practices.md

+- `TIDB_INDEX_USAGE`: monitors index usage patterns and query frequency.
+- `schema_unused_indexes`: lists indexes that have not been used since the database is last restarted.


Suggested change

- `TIDB_INDEX_USAGE`: monitors index usage patterns and query frequency.

- `schema_unused_indexes`: lists indexes that have not been used since the database is last restarted.

- `INFORMATION_SCHEMA`.`TIDB_INDEX_USAGE`: monitors index usage patterns and query frequency.

- `mysql`.`schema_unused_indexes`: lists indexes that have not been used since the database is last restarted.

initial draft

ee28c19

hfxsd self-assigned this Jun 23, 2025

ti-chi-bot bot added contribution This PR is from a community contributor. missing-translation-status This PR does not have translation status info. labels Jun 23, 2025

hfxsd added translation/doing This PR's assignee is translating this PR. and removed contribution This PR is from a community contributor. missing-translation-status This PR does not have translation status info. labels Jun 23, 2025

ti-chi-bot bot added the size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. label Jun 23, 2025

gemini-code-assist bot reviewed Jun 23, 2025

View reviewed changes

hfxsd added needs-cherry-pick-release-8.1 Should cherry pick this PR to release-8.1 branch. needs-cherry-pick-release-8.5 Should cherry pick this PR to release-8.5 branch. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Jun 23, 2025

gemini-code-assist bot reviewed Jun 23, 2025

View reviewed changes

Update index-best-practices.md

aa2e98f

ti-chi-bot bot added the size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. label Jun 23, 2025

github-actions bot reviewed Jun 23, 2025

View reviewed changes

TOC.md Show resolved Hide resolved

hfxsd and others added 4 commits June 23, 2025 21:37

Apply suggestions from code review

a9675bf

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update index-best-practices.md

f48aa04

Merge branch 'index-best-practices' of https://github.com/hfxsd/docs …

a9adaad

…into index-best-practices

refined

90438df

hfxsd requested a review from songrijie June 24, 2025 02:14

hfxsd removed the needs-cherry-pick-release-8.1 Should cherry pick this PR to release-8.1 branch. label Jun 24, 2025

hfxsd added 2 commits June 24, 2025 10:32

Update index-best-practices.md

4392add

Update index-best-practices.md

50f6a6c

hfxsd requested review from time-and-fate and YangKeao June 27, 2025 03:34

songrijie reviewed Jun 27, 2025

View reviewed changes

YangKeao reviewed Jun 27, 2025

View reviewed changes

songrijie reviewed Jun 27, 2025

View reviewed changes


		- Delayed data updates

		The data is refreshed periodically to minimize performance impact. If index usage is analyzed immediately after a query execution, allow some time for the metrics to update.


		### Manually create the `schema_unused_indexes` view

		Because `TIDB_INDEX_USAGE` is cleared after a TiDB node restarts, ensure that the node has been running for a sufficient amount of time before making decisions. For clusters upgraded from an earlier version to TiDB v8.0.0 or later, you must manually create the system schema and the included views.

		TiDB v8.0.0 introduces the [`TIDB_INDEX_USAGE`](/information-schema/information-schema-tidb-index-usage.md) table and the [`schema_unused_indexes`](/sys-schema/sys-schema-unused-indexes.md) table to help you track index usage patterns and make data-driven decisions.

		Because indexes evolve with changing business logic, regular index audits are a standard part of database maintenance. TiDB provides built-in observability tools to help you detect, evaluate, and optimize indexes without risk.

		- `TIDB_INDEX_USAGE`: monitors index usage patterns and query frequency.
		- `schema_unused_indexes`: lists indexes that have not been used since the database is last restarted.

best practices: add index best practices #21218

Are you sure you want to change the base?

best practices: add index best practices #21218

Uh oh!

Conversation

hfxsd commented Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

First-time contributors' checklist

What is changed, added or deleted? (Required)

Which TiDB version(s) do your changes apply to? (Required)

What is the related PR or file link(s)?

Do your changes match any of the following descriptions?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ti-chi-bot bot commented Jun 23, 2025

Uh oh!

hfxsd commented Jun 23, 2025

Uh oh!

Uh oh!

github-actions bot commented Jun 23, 2025

Uh oh!

songrijie Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

songrijie Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

YangKeao left a comment

Choose a reason for hiding this comment

Uh oh!

YangKeao Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

YangKeao Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

songrijie Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

songrijie Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

songrijie Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

hfxsd commented Jun 23, 2025 •

edited

Loading