Matt Barnes wrote:
Tesseract is an OCR and can convert pdf's and images to text. I haven't gotten around to installing it and trying it out, but it seems like the OCR of choice, located here:http://sourceforge.net/project/showfiles.php?group_id=158586