I need to convert some graphic text screenshots back into actual text. I originally wanted one capable of batch-processing but instead I stacked a bunch of screenshots together into one image to avoid that. They are screenshots of video metadata because some genius decided to archive video info of a whole series into JPEGs... Yeah...
There must be some lunatic out there who lost all these episodes in a crash so I'll archive a bunch of useful info on my site and make it unsearchable JPEG to make his job that much harder. Seriously...
Anyway, I only tried FreeOCR so far and it... sucks. 'Nuff said. Any other recommendations?
Our website is made possible by displaying online advertisements to our visitors. Consider supporting us by disable your adblocker or try DVDFab and copy, convert Blu-ray and DVDs! :)
+ Reply to Thread
Results 1 to 5 of 5
Did you try to learn/train tesseract (freeocr use tesseract library).
Without tesseract you are probably forced to use commercial software and seem Abbyy Finereader is frequently mentioned as best.
Gray-scale or black and white text is easier for an OCR.
Abbyy Finereader and Nuance Omnipage are the best commercial.
As for freeware you can try online OCRs which I believe use tesseract, they aren't bad.
I got stuck on this part on that tutorial.
N=662 # set accordingly to the number of files that you have for i in `seq 0 $N`; do tesseract $i.bmp $i batch.nochop makebox done
bash: seq: command not found