Details

    • Type: Task
    • Status: Closed
    • Priority: Major
    • Resolution: Fixed
    • Affects Version/s: 10.8
    • Fix Version/s: 10.9
    • Component/s: Dependency Upgrades
    • Labels:
      None
    • Difficulty:
      Unknown
    • Documentation:
      N/A
    • Similar issues:

      Description

      See http://www.apache.org/dist/tika/CHANGES-1.19.txt

         * Require Java 8 (TIKA-2679).
      
         * Enable building with Java 11 (TIKA-2668)
      
         * Add an option to make tika-server robust against infinite loops,
           OOMs, and memory leaks (TIKA-2725).
      
         * Allow configuration of the Tesseract parser via the standard
           tika-config.xml options (TIKA-2705).
      
         * Improve handling of empty cells across table-based
           formats (TIKA-2479).
      
         * Add a Standards compliant HTML encoding detector
           via Gerard Bouchar (TIKA-2673).
      
         * Improved XML parsing -- limited default entity expansions to 20.
           To raise this limit, add -Djdk.xml.entityExpansionLimit=XXX to
           your commandline.
      
         * Mime magic improvements for Olympus RAW (TIKA-2658), interpreted
           server-side languages via HTTP (TIKA-2648), MHTML (TIKA-2723)
      
         * Add absolute timeout to ForkParser rather than testing
           for active (TIKA-2656).
      
         * Make the RecursiveParserWrapper work with the ForkParser (TIKA-2655).
      
         * Allow the ForkParser to specify a directory containing tika-app.jar
           for use by the ForkServer.  This allows users to keep most of the
           parser dependencies out of their code; and it allows for an easy
           addition of optional jars for Parser dependencies,
           such as the xerial sqlite jar (TIKA-2653).
      
         * Use a pool for SAXParsers and DOMBuilders rather than creating
           a new parser/builder for every parse.
           For better performance, set XMLReaderUtils.setPoolSize() to the
           number of threads you're using with Tika (TIKA-2645).
      
         * Add the RecursiveParserWrapperHandler to improve the RecursiveParserWrapper
           API slightly (TIKA-2644).
      
         * Upgraded to Commons-Compress 1.18 (TIKA-2707).
      
         * Upgraded to Apache POI 4.0.0 (TIKA-2552).
      
         * Upgraded to Apache PDFBox 2.0.11 (TIKA-2681).
      
         * Upgraded to deeplearning4j 1.0.0-beta2 (TIKA-2672).
      
         * Upgraded jmatio to 1.4 (TIKA-2667)
      
         * Upgraded Apache Lucene to 7.4.0 in tika-eval and tika-examples (TIKA-2695).
      
         * Upgraded junrar to 1.0.1 (TIKA-2664).
      
         * Numerous other upgrades (TIKA-2692).
      
         * Excluded Spring as a transitive dependency (TIKA-2721).
      

        Attachments

          Issue Links

            Activity

              People

              • Assignee:
                tmortagne Thomas Mortagne
                Reporter:
                tmortagne Thomas Mortagne
              • Votes:
                0 Vote for this issue
                Watchers:
                2 Start watching this issue

                Dates

                • Created:
                  Updated:
                  Resolved:
                  Date of First Response: