Zoekt indexer trigram and file size limits#1436
Merged
natehessler merged 5 commits intomainfrom Nov 19, 2025
Merged
Conversation
https://ampcode.com/threads/T-0390a39a-9c04-441e-8982-7e2ef7b9bf76 Co-authored-by: Amp <amp@ampcode.com>
https://ampcode.com/threads/T-0390a39a-9c04-441e-8982-7e2ef7b9bf76 Co-authored-by: Amp <amp@ampcode.com>
|
The latest updates on your projects. Learn more about Vercel for GitHub.
|
s3nu
reviewed
Nov 19, 2025
docs/admin/search.mdx
Outdated
|
|
||
| To view which files are skipped during indexing, visit the repository settings page and click on **Indexing**. | ||
|
|
||
| To force the indexer to include specific files (like `yarn.lock` or other large text files) that are otherwise skipped, add their file path or a glob pattern to the [`search.largeFiles`](/admin/config/site_config#search-largeFiles) setting in your site configuration and reindex the repository. Note that files must still be valid UTF-8 to be indexed, even if added to `search.largeFiles`. |
There was a problem hiding this comment.
This is a dead markdown path: https://github.com/sourcegraph/docs/blob/amp/zoekt-indexer-trigram-and-file-size-limits/admin/config/site_config#search-largeFiles
Maybe we want to point here or docs/admin/config/site_config.mdx
Updated link in search documentation for large files setting.
Updated the documentation to include a link for the search.largeFiles setting.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
The documentation was updated in
search.mdxto specify that the Zoekt indexer skips files exceeding 20,000 unique trigrams or those that are not valid UTF-8. Instructions were added detailing how to override these limits by configuring thesearch.largeFilessetting and reindexing the repository.Thread: https://ampcode.com/threads/T-0390a39a-9c04-441e-8982-7e2ef7b9bf76