Today, the Tesseract OCR project, only supports the English language, and does not yet include a page layout analysis module, so it performs poorly on material with multiple columns. "It also doesn't do well on grayscale and color documents, and it's not nearly as accurate as some of the best commercial OCR packages out there," Vincent wrote on the company blog.
Nepaies ne pieci gadi, kā līdz mums nonāks arī OCR latviski. Nekas cits, kā Finereader pagaidām nav atrasts. It sevišķi - opensourcisks.