Uploaded image for project: 'ModeShape'
  1. ModeShape
  2. MODE-1639

Replace Aperture MIME type detector with one that uses Tika

    Details

      Description

      Tika has a MIME type detector that is simpler and perhaps just as effective as the current Aperture detector. Since we're already dependent on Tika for text extraction, we can remove the Aperture dependency altogether.

      We should also change how multiple MIME type detectors work together. Currently, the first detector to return a MIME type will always win, and Aperture always returns some MIME type (e.g., text/plain for text files or application/octet-stream for binary files). So rather than have the most capable detectors configured first, we should have the most specialized:

      1. Custom detectors (future enhancement, see MODE-xxxx)
      2. Extension-based detector that only knows about special cases (not as it currently defined with a complete 'mime.types' file).
      3. Tika-based detector

      Note that we need to make sure to keep any extension in our existing 'mime.types' file that is not found in Tika.

        Gliffy Diagrams

          Attachments

            Issue Links

              Activity

                People

                • Assignee:
                  rhauch Randall Hauch
                  Reporter:
                  rhauch Randall Hauch
                • Votes:
                  0 Vote for this issue
                  Watchers:
                  1 Start watching this issue

                  Dates

                  • Created:
                    Updated:
                    Resolved: