Skip to content

Conversation

@th3w1zard1
Copy link

Add a docs crawler using to implement the following crawlers:

  • ChromiumCrawler
  • ChromiumCrawler
  • DefaultCrawler
  • specific implementation for github.

following major packages used:

  • vectordb
  • Puppetteer

This PR is highly inspired by Continue.dev's implementation, and the following resources/topics:

- Added 'vectordb' as a dependency in package.json.
- Introduced the DocsCrawler class to handle web crawling with special handling for GitHub documentation.
- Updated existing crawler classes to use a consistent maxRequestsPerCrawl parameter.
- Improved error handling and fallback strategies when crawling fails.
@th3w1zard1
Copy link
Author

currently only the vscode side is implemented. A brainstorming session is needed to determine best way to integrate with aider

@lee88688
Copy link
Owner

this looks good idea. but I think this may not be the good as continue.dev. because this extension use aider as backend, it will have no control of what the system or other message is. so the plugin may not have enough way to optimize the result.
do you have any ideas about this?

@th3w1zard1
Copy link
Author

I don't think the system message would be relevant: what would be important is relevant documentation being pulled and appended to the user prompt?

The system message I can't imagine why it would be relevant.

The main issue i see is the documentation retrieval should be created through aider and then an interface provided to this extension

@lee88688
Copy link
Owner

Aider's core code is not provide a general interface for extension, some features may be a little hard or tricky to implement.

This may be a good idea,but may not a good for now. If the extension want to go forward, we need aider 's support as well.

Otherwise this extension needs to implement all backend ourselves.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants