I need to convert some graphic text screenshots back into actual text. I originally wanted one capable of batch-processing but instead I stacked a bunch of screenshots together into one image to avoid that. They are screenshots of video metadata because some genius decided to archive video info of a whole series into JPEGs... Yeah...
There must be some lunatic out there who lost all these episodes in a crash so I'll archive a bunch of useful info on my site and make it unsearchable JPEG to make his job that much harder. Seriously...
Anyway, I only tried FreeOCR so far and it... sucks. 'Nuff said. Any other recommendations?
+ Reply to Thread
Results 1 to 5 of 5
Did you try to learn/train tesseract (freeocr use tesseract library).
Without tesseract you are probably forced to use commercial software and seem Abbyy Finereader is frequently mentioned as best.
Gray-scale or black and white text is easier for an OCR.
Abbyy Finereader and Nuance Omnipage are the best commercial.
As for freeware you can try online OCRs which I believe use tesseract, they aren't bad.
I got stuck on this part on that tutorial.
N=662 # set accordingly to the number of files that you have for i in `seq 0 $N`; do tesseract $i.bmp $i batch.nochop makebox done
bash: seq: command not found