[TIKA-3055] Add an optional PreflightPDFParser - Tika - [issue]
...PDFBox's PreflightParser offers quite a bit of information about the validity of a PDF...I think this would be useful for for many.We'd leave the current PDFParser as the default, and users ...    Author: Tim Allison , 2020-02-29, 00:14
[TIKA-3057] Improve detection of zip-based formats - Tika - [issue]
...In crawling open office and libre office's bug trackers, I found a bunch of staroffice/libreoffice zip-based formats that we aren't currently detecting.  I also found that Apple changed...    Author: Tim Allison , 2020-02-28, 18:16
[TIKA-3036] broken build: "group id is too large" on a Mac - Tika - [issue]
...I recently got a failed build on a mac with this problem: The fix looks straightforward...add {tarLongFileMode} configuration:    <...    Author: Tim Allison , 2020-02-25, 21:11
pushing branch_1x to Apache snapshots? - Tika - [mail # dev]
...Hi All,  What do we have to do to push the 1.x branch to Apache snapshots?  Thankyou!            Cheers,            &nbs...
   Author: Tim Allison , 2020-02-25, 12:23
[TIKA-3056] General upgrades for 1.24 - Tika - [issue]    Author: Tim Allison , 2020-02-25, 08:22
[TIKA-3033] Upgrade to PDFBox 2.0.19 when available - Tika - [issue]    Author: Tim Allison , 2020-02-25, 08:22
[TIKA-3047] Upgrade to POI 4.1.2 - Tika - [issue]
...Now available at a maven repo near you!  Thank you Andreas Beeker for running the release!...    Author: Tim Allison , 2020-02-25, 08:22
[TIKA-3050] Add xmp extraction to psd files - Tika - [issue]    Author: Tim Allison , 2020-02-25, 08:22
[TIKA-3026] Consider extracting structure/tags where possible in PDFs with the PDFMarkedContentExtractor - Tika - [issue]
...Some PDFs contain tags that may be useful in understanding the structure of the elements within a PDF, e.g. table markup, paragraph breaks, headers, etc.    The quality of the tags depends e...    Author: Tim Allison , 2020-02-24, 19:02
[TIKA-3045] Allow users to run custom parsing of xfa and xmp - Tika - [issue]
...We currently do some processing of xfa and xmp, but some users may want more control over parsing these embedded file types....    Author: Tim Allison , 2020-02-24, 18:53