- changed status to resolved
Tesseract Information in Wiki Is Out of Date
Issue #475
resolved
The wiki states, "If you are trying to OCR a non English subtitle track, you can download from https://code.google.com/p/tesseract-ocr/downloads/list the Tesseract 3.02 language files for the language you want, and copy the .tessdata file inside ~/Library/Application Support/Subler/tessdata/..."
However, this download directory no longer exists. The data files for Tesseract 3.02 are at https://github.com/tesseract-ocr/tesseract/wiki/Data-Files#data-files-for-version-302.
Furthermore, there are no ".tessdata" files. There are ".tar.gz" files, which when extracted, contain ".cube" and ".traineddata" files.
Please clarify the instructions for copying tessdata files.
Comments (1)
-
repo owner - Log in to comment
Link updated, thanks.