Uploaded image for project: 'OCR Application'
  1. OCR Application
  2. OCR-7

Allow Tesseract OCR language files to be downloaded on a running instance

    XMLWordPrintable

Details

    • New Feature
    • Resolution: Fixed
    • Major
    • 1.0
    • 1.0
    • None
    • None
    • Unknown
    • N/A
    • N/A

    Description

      In order to work properly, Tesseract needs to have a language file to rely on.

      This language file will change depending on the language of the recognized text.

      Hopefully, Tesseract already provides training data here : https://github.com/tesseract-ocr/tessdata/tree/3.04.00

      Note that those files are version-specific, therefore, if we plan to upgrade the currently provided version of Tesseract (3.04.01) to a higher version in the future, we will have to change the URL used to download those files.

      Attachments

        Activity

          People

            caubin Clément Aubin
            caubin Clément Aubin
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: