Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Python 3 port may have broken ocr.py #5

Open
VickyFrenzy opened this issue Jul 2, 2021 · 1 comment · May be fixed by #6
Open

Python 3 port may have broken ocr.py #5

VickyFrenzy opened this issue Jul 2, 2021 · 1 comment · May be fixed by #6

Comments

@VickyFrenzy
Copy link

VickyFrenzy commented Jul 2, 2021

#3's cb0530f and/or #4's 356e77f have changed tesseract to pytesseract but I don't think they are the same thing (or it was a really old version) since I'm getting this error:

Traceback (most recent call last):
  File "ocr.py", line 224, in <module>
    main()
  File "ocr.py", line 219, in main
    blurbs = ocr_on_bounding_boxes(binary, components)
  File "ocr.py", line 134, in ocr_on_bounding_boxes
    api = pytesseract.TessBaseAPI()
AttributeError: module 'pytesseract' has no attribute 'TessBaseAPI'

The ocr.py code may need to be rewritten to work with (latest) pytesseract.
https://github.com/madmaze/pytesseract

I'm on Python 3.8.

@johnoneil
Copy link
Owner

Thank for reporting this. I really haven't been using or maintaining this repo in years.
I might take a stab at fixing this over the weekend, but I don't have too much interest in maintaining Python which is plagued with versioning issues.
Still, I'm a little happy someone somewhere is playing with this code.

@blueaxis blueaxis linked a pull request Sep 7, 2021 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants