OCR processing on Linux

Created by: Lester Caine, Last modification: 26 Feb 2025 (19:34 UTC)

My current search only processing of pdf's hinges around OCRmyPDF and another manual I need to read is the one for this. It is essentially just a wrapper around Tesseract OCR engine which recognizes more than 100 languages, but I'm just happy with English.

While there are several wrappers listed for tesseract, I'm currently playing with gImageReader but I think it's the underlying code I need to understand first. Another offering which is showing some promise is ScribeOCR and while that is an on-line service, the code base will allow me to run a local copy if that turns out to be a better option. Documentation for that is here.

There are several other packages listed on the tesseract site, many of which can be ignored, but there may be something else useful.

In addition to the OCR engine, I've downloaded a shed load of other applications that were supposed to help pre-process the scanned PDF's. I'm sure a lot of them could be wipped, but I need to go through and work out just what is useful in a Swiss army knife of tools. For instance OCRmyPDF (I think) provided an extension to the Dolphin file manager menu where I can select it directly and start it running on a number of pdf's. Tailoring that function and adding my own preferences to the list would be good to do.

OCR processing on Linux

Day to Day Log

Health Matters

Lester's Rants