Question: using ipysegment for OCR text? #10

joseberlines · 2020-11-22T13:59:53Z

Is it possible to select part of a pdf doc in order to ocr it with whatever other library?
thx.

Use case: Official letters where you want to select and grab a part of the text to pass it to the clipboard.

ianhi · 2020-11-22T18:15:57Z

Unfortunately that's probably not easy. this library has the assumption that your image can represented as a numpy array baked in pretty deep. So if you can convert your PDF to to a with rasterized image then it could work, but it would no longer be a pdf.

joseberlines · 2020-11-23T01:16:16Z

It's not important that it remains a pdf. So to be clear the pdf has to be converted in a numpy "rastered image"?

ianhi · 2020-11-23T03:07:40Z

I think I may have been out of my depth when I used words like rastered what I mean is, get it into a numpy array (i.e. something that you could call plt.imshow on with matplotlib) and it should work.

ianhi · 2020-11-23T03:10:38Z

Maybe you could use this https://github.com/Belval/pdf2image to get the array first.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: using ipysegment for OCR text? #10

Question: using ipysegment for OCR text? #10

joseberlines commented Nov 22, 2020

ianhi commented Nov 22, 2020

joseberlines commented Nov 23, 2020

ianhi commented Nov 23, 2020

ianhi commented Nov 23, 2020

Question: using ipysegment for OCR text? #10

Question: using ipysegment for OCR text? #10

Comments

joseberlines commented Nov 22, 2020

ianhi commented Nov 22, 2020

joseberlines commented Nov 23, 2020

ianhi commented Nov 23, 2020

ianhi commented Nov 23, 2020