Uploaded image for project: 'XWiki Platform'
  1. XWiki Platform
  2. XWIKI-19447

Additional Thumbnails/thumbnail.png returned by tika when parsing odt

    XMLWordPrintable

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Major
    • Resolution: Solved By
    • Affects Version/s: 14.1-rc-1
    • Fix Version/s: None
    • Component/s: Search - Solr, Tika
    • Labels:
      None
    • Tests:
      Unit
    • Difficulty:
      Unknown
    • Documentation:
      N/A
    • Documentation in Release Notes:
      N/A
    • Similar issues:

      Description

      Since the upgrade to Tika 2.3.0 (XWIKI-19402), and more precisely https://issues.apache.org/jira/browse/TIKA-3644 and additional "Thumbnails/thumbnail.png" is returned when calling Tika#parseToString.

      We need to decide if:

      • we accept Thumbnails/thumbnail.png => then close the issue as invalid
      • we find a way to remove it without downgrading Tika => close the issue as fixed
      • downgrade Tika, but this can only be temporary because on the long run we need to have Tika up to date

      Corresponding failing test: org.xwiki.search.solr.internal.metadata.DocumentSolrMetadataExtractorTest#testAttachmentExtractFromOpenDocument

        Attachments

          Issue Links

            Activity

              People

              Assignee:
              MichaelHamann Michael Hamann
              Reporter:
              mleduc Manuel Leduc
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Dates

                Created:
                Updated:
                Resolved:
                Date of First Response: