clear query| facets| time Search criteria: .   Results from 1 to 10 from 18 (0.0s).
Loading phrases to help you
refine your search...
[TIKA-3055] Add an optional PreflightPDFParser - Tika - [issue]
...PDFBox's PreflightParser offers quite a bit of information about the validity of a PDF...I think this would be useful for for many.We'd leave the current PDFParser as the default, and users ...
http://issues.apache.org/jira/browse/TIKA-3055    Author: Tim Allison , 2020-02-29, 00:14
[TIKA-3057] Improve detection of zip-based formats - Tika - [issue]
...In crawling open office and libre office's bug trackers, I found a bunch of staroffice/libreoffice zip-based formats that we aren't currently detecting.  I also found that Apple changed...
http://issues.apache.org/jira/browse/TIKA-3057    Author: Tim Allison , 2020-02-28, 18:16
[TIKA-3036] broken build: "group id is too large" on a Mac - Tika - [issue]
...I recently got a failed build on a mac with this problem: https://issues.redhat.com/browse/KEYCLOAK-4563 The fix looks straightforward...add {tarLongFileMode} configuration:    <...
http://issues.apache.org/jira/browse/TIKA-3036    Author: Tim Allison , 2020-02-25, 21:11
pushing branch_1x to Apache snapshots? - Tika - [mail # dev]
...Hi All,  What do we have to do to push the 1.x branch to Apache snapshots?  Thankyou!            Cheers,            &nbs...
   Author: Tim Allison , 2020-02-25, 12:23
[TIKA-3056] General upgrades for 1.24 - Tika - [issue]
http://issues.apache.org/jira/browse/TIKA-3056    Author: Tim Allison , 2020-02-25, 08:22
[TIKA-3033] Upgrade to PDFBox 2.0.19 when available - Tika - [issue]
http://issues.apache.org/jira/browse/TIKA-3033    Author: Tim Allison , 2020-02-25, 08:22
[TIKA-3047] Upgrade to POI 4.1.2 - Tika - [issue]
...Now available at a maven repo near you!  Thank you Andreas Beeker for running the release!...
http://issues.apache.org/jira/browse/TIKA-3047    Author: Tim Allison , 2020-02-25, 08:22
[TIKA-3050] Add xmp extraction to psd files - Tika - [issue]
http://issues.apache.org/jira/browse/TIKA-3050    Author: Tim Allison , 2020-02-25, 08:22
[TIKA-3026] Consider extracting structure/tags where possible in PDFs with the PDFMarkedContentExtractor - Tika - [issue]
...Some PDFs contain tags that may be useful in understanding the structure of the elements within a PDF, e.g. table markup, paragraph breaks, headers, etc.    The quality of the tags depends e...
http://issues.apache.org/jira/browse/TIKA-3026    Author: Tim Allison , 2020-02-24, 19:02
[TIKA-3045] Allow users to run custom parsing of xfa and xmp - Tika - [issue]
...We currently do some processing of xfa and xmp, but some users may want more control over parsing these embedded file types....
http://issues.apache.org/jira/browse/TIKA-3045    Author: Tim Allison , 2020-02-24, 18:53