Uploaded image for project: 'ModeShape'
  1. ModeShape
  2. MODE-2746

Problem with MSOffice Sequencer

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Major Major
    • None
    • 5.4.1.Final
    • None
    • None
    • Undefined

      I am trying to use the sequencer for msoffice files. I started from the example on github at the following link https://github.com/ModeShape/modeshape-examples/tree/master/modeshape-sequencing-example. I added the msoffice sequencer dependency in the pom and added the sequencer configuration in the repository config file. I have several problems:

      1) if I try to load an xls, the sequencer correctly generates the node with the metadata. But if I add a second xls, the sequencer doesn't work (not even for the first file anymore)

      2) when I try to load a .doc or .docx file, the sequencer does not work. In the various tests I have done, I have understood that it could be a problem of versions of Tika and/or Apache POI: could you tell me the correct versions for version 5.4.1 or 5.4.0 of ModeShape? By adding print messages I realized that the sequencer is correctly initialized (it enters the initialize () function) but does not enter the execute ().

              Unassigned Unassigned
              francescagea Francesca Mainini (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated: