I play with open-source OCR (Optical Character Recognition) packages periodically. My last foray was a few years ago when I bought a tablet PC and wanted to scan in some of my course books so I could carry just one thing to school. I tried every package I could find, and none of them worked well enough even to consider using.
Read more »Tesseract: an Open-Source Optical Character Recognition Engine
http://www.linuxjournal.com –