On Fri, Aug 27, 2021 at 05:55:44PM -0700, Al Kossow via cctalk wrote:
I was also just thinking you would probably have to
have a layer (black) with all of the
stuff to OCR including the stuff in red and blue, then overlay the color on that
after the pass through whatever you're using to do the OCR.
The one bottleneck I would really like to fix is getting the 24 cores on my machine
doing
OCR on 24 different pages at the same time.
The documentation for ocrmypdf describes how to do that.
https://ocrmypdf.readthedocs.io/en/latest/
and
https://ocrmypdf.readthedocs.io/en/latest/batch.html
Don