Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feat] Remove repetitive content on /crawl #334

Open
calebpeffer opened this issue Jun 28, 2024 · 0 comments
Open

[Feat] Remove repetitive content on /crawl #334

calebpeffer opened this issue Jun 28, 2024 · 0 comments

Comments

@calebpeffer
Copy link
Contributor

Sometimes, customers want to remove any boilerplate on pages.

One potential strategy on the crawl endpoint is to remove any content that is present on all pages. For example, if there is a navigation bar at the top of the page, that would be cleaned from the content. because it would be present on each page.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant