Uploaded image for project: 'ModeShape'
  1. ModeShape
  2. MODE-1639

Replace Aperture MIME type detector with one that uses Tika

    XMLWordPrintable

Details

    • Enhancement
    • Resolution: Done
    • Critical
    • 3.0.0.CR1
    • 3.0.0.Beta4
    • Common, JCR, Sequencers
    • None

    Description

      Tika has a MIME type detector that is simpler and perhaps just as effective as the current Aperture detector. Since we're already dependent on Tika for text extraction, we can remove the Aperture dependency altogether.

      We should also change how multiple MIME type detectors work together. Currently, the first detector to return a MIME type will always win, and Aperture always returns some MIME type (e.g., text/plain for text files or application/octet-stream for binary files). So rather than have the most capable detectors configured first, we should have the most specialized:

      1. Custom detectors (future enhancement, see MODE-xxxx)
      2. Extension-based detector that only knows about special cases (not as it currently defined with a complete 'mime.types' file).
      3. Tika-based detector

      Note that we need to make sure to keep any extension in our existing 'mime.types' file that is not found in Tika.

      Attachments

        Issue Links

          Activity

            People

              rhauch Randall Hauch (Inactive)
              rhauch Randall Hauch (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: