When comparing Tesseract OCR vs Cuneiform, the Slant community recommends Tesseract OCR for most people. In the question“What are the best Linux OCR programs?” Tesseract OCR is ranked 2nd while Cuneiform is ranked 3rd.
Ranked in these QuestionsQuestion Ranking
Pro Works great with 300 DPI files
Pro Very easy to use (see the manual page, not built-in help)
See the manual page.
Pro Support for 40 languages
In the beginning Tesseract only had support English. Now newer versions can support up to 40 languages.
Pro Can create sandwhich PDF files
Pro Preserves formatting
Besides offering pretty good text recognition, it also preserves the formatting and text differences.
Pro Spell checker
Cuneiform has a spell checker included which helps with correcting errors.
Con Rudimentary image processing
Tesseract's image processing is very rudimentary, in order to get the most out of it you need to use a preprocessor or use an image that's already been processed.
A for humans perfectly readable image 100 dpi results in a huge number of failed characters even if source is free from physical scan artifacts (i.e. print to file). If possible, it's better to use 300 dpi image files.
Con Only reads TIFF files
Tesseract can only read TIFF files. If you want to use a picture with a different format (JPEG, PNG, PDF) you need to convert it first.
Con Does not support PDF
Cuneiform has no support for converting PDF files to text format.