Tesseract output improvement #5

halfguru · 2019-10-01T20:49:05Z

Hi,

First of all, thank you for your work. I was looking for OCR projects since it's very difficult to find english subtitles for chinese youtube shows.

I'm wondering if you've attempted to optimize the Tesseract output with different image processing techniques as illustrated here. The use_fullframe argument could be changed to specific rectangular coordinates. Also, the Tesseract wiki indicates a dark text with light background is preferable so adding an option to invert the colors could be helpful. Binarisation could also help further isolate the subtitles. Finally, I believe adding the --psm 6 option to the Tesseract config to indicate a single uniform block of text would be beneficial.

The text was updated successfully, but these errors were encountered:

mongy910 · 2020-09-22T07:33:10Z

@halfguru These are really good insights. In the year since you've posted this, have you found any better solutions? I have the same use case as you (reading chinese soft captions).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tesseract output improvement #5

Tesseract output improvement #5

halfguru commented Oct 1, 2019 •

edited

Loading

mongy910 commented Sep 22, 2020

Tesseract output improvement #5

Tesseract output improvement #5

Comments

halfguru commented Oct 1, 2019 • edited Loading

mongy910 commented Sep 22, 2020

halfguru commented Oct 1, 2019 •

edited

Loading