Tesseract Information in Wiki Is Out of Date

Issue #475 resolved
Jon Marathon created an issue

The wiki states, "If you are trying to OCR a non English subtitle track, you can download from https://code.google.com/p/tesseract-ocr/downloads/list the Tesseract 3.02 language files for the language you want, and copy the .tessdata file inside ~/Library/Application Support/Subler/tessdata/..."

However, this download directory no longer exists. The data files for Tesseract 3.02 are at https://github.com/tesseract-ocr/tesseract/wiki/Data-Files#data-files-for-version-302.

Furthermore, there are no ".tessdata" files. There are ".tar.gz" files, which when extracted, contain ".cube" and ".traineddata" files.

Please clarify the instructions for copying tessdata files.

Comments (1)

  1. Log in to comment