Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Anyone knows a PDF OCR tool? I am using a free online one. I take pictures with opennotescanner which spits out a PDF which I want searchable.

Tesseracts expects PNG and outputs to text. I want the same PDF to be hidden overlaid with OCR text.

This free PDF online service does a decent job but offline would be better.



Free, but also online: https://ocr.space/searchablepdf

It comes with an API, so you can integrate it with your workflow.


I tried that. Their free api is too little of use. 1MB file.

I use PDF24.org which actually does everything great.

Anything offline?


Free & offline: Tesseract + pdfsandwich, see http://www.tobias-elze.de/pdfsandwich/




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: