A faster approach for fast crawler! #3

black-fractal · 2021-01-17T19:49:52Z

Is your feature request related to a problem? Please describe.
1- Every time fast crawler run, it open all JSON files for gathering historical information about repetitive crawling!
2- Many JSON files would be merged! only the longest chan should be left and the others would be eliminated!
Describe the solution you'd like
1- A new python script or a new function in fast crawler should be written for more multiple runs!
2- in function traverse_link or continue_crawl or search_in_files_history new path should be stored in new data structure if:
- The new fetched link is repetitive
- The new path has longer chain!

The text was updated successfully, but these errors were encountered:

$@black-fractal$ black-fractal self-assigned this Jan 17, 2021

$@black-fractal$ black-fractal added the enhancement New feature or request label Jan 17, 2021

$@black-fractal$ black-fractal mentioned this issue Jan 18, 2021

An estimation timer! #5

Open

Provide feedback