Uploaded image for project: 'AI Platform Core Components'
  1. AI Platform Core Components
  2. AIPCC-623

Investigate what related tesseract data packages to install (Part 1)

      Although installing the language packs for French, German, Italian & Spanish can be done by simply doing what we do for the English language pack, there are several data RPMs that are related to our 5 languages. Should we install them or not?

      "Special" trained data packages:

      • osd (Orientation and script detection)
      • equ (Math equations)
        Note: These are the only 2 available

      Script packages (e.g., Cyrillic is a a script):

      • Latin (all 5 of our languages are written in this)
      • Fraktur (A German script, known for its distinctive fonts, used until the 1940s)

      Variant Language Packs:

      • German in Fraktur (previously misnamed as "Frankish" by upstream)

      Ancient Language Packs:

      • Middle English (1100-1500 AD)
      • Middle French (1400-1600 AD)
      • Old Spanish (used until 1500 roughly)
      • Old Italian (probably 1100-1550 AD)

      UPDATE:
      There's also the question of another language altogether:

      • Portuguese

              mdepaulo@redhat.com Mike DePaulo
              mdepaulo@redhat.com Mike DePaulo
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: