Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing scripts in tesseract language documentation #94

Open
guigarfr opened this issue Sep 19, 2022 · 2 comments
Open

Missing scripts in tesseract language documentation #94

guigarfr opened this issue Sep 19, 2022 · 2 comments

Comments

@guigarfr
Copy link

guigarfr commented Sep 19, 2022

Documentation page data files in different versions is incomplete.

For example i just got "Katakana" script as output for image_to_osd call, which is not documented under available scripts.

@amitdo
Copy link
Collaborator

amitdo commented Sep 19, 2022

osd is a special case. Most of the scripts it can recognize are available as separate models.

Katakana is one of the scripts used for writing Japanese.

To see all the scripts that can be detected using the osd model, you can extract the model and then open the unicharset file.

@guigarfr
Copy link
Author

ok i was just reporting that the documentation seems incomplete then. there are more scripts than the ones stated there. That one and i don't know if any more.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants