Skip to content

Downloads pictures of upermarket leaflets and extracts the text for further analysis

Notifications You must be signed in to change notification settings

tutlum/leaflet-downloader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Leaflet downloader

extracts and analyzes leaflets from www.prospektangebote.de

  1. the html pages are parsed in a non intelligent way
  2. the found images are downloaded and
  3. the images are analyzed via tesseract ocr

TODO: implement search on textfiles

About

Downloads pictures of upermarket leaflets and extracts the text for further analysis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published