Skip to content

Latest commit

 

History

History
36 lines (21 loc) · 1.26 KB

README.md

File metadata and controls

36 lines (21 loc) · 1.26 KB

Tesseract auto-training

Project homepage: github.com

Download current (devel) source: tar.gz or zip

Licence: Code is released under Apache License, Version 2.0. Other files (e.g. fonts) can have its own licenses.

DESCRIPTION

This is attempt to create python scripts for automatic tesseract traning. Scripts are based on script from project tesseractindic tesseract_trainer

REQUIREMENTS

  • Python (tested on 2.6.6)
  • tesseract (tested on 3.00; at the moment it do not work with 3.01)

USAGE

python generate.py -font <font name> -l <language> -s <size> -a <input alphabet directory>

This generate testing images and run tesseract traning on it.

Example:

$ python generate.py -font mitra -l beng -s 10 -a beng.alphabet/
$ python generate.py -font Courier -l eng -s 10 -a eng.alphabet/
$ python generate.py -font Arial -l slk -s 10 -a slk.alphabet/