pushing branch_1x to Apache snapshots? - Tika - [mail # dev]
...Hi All,  What do we have to do to push the 1.x branch to Apache snapshots?  Thankyou!            Cheers,            &nbs...
   Author: Tim Allison , 2020-02-25, 12:23
[TIKA-3033] Upgrade to PDFBox 2.0.19 when available - Tika - [issue]    Author: Tim Allison , 2020-02-25, 08:22
[TIKA-3047] Upgrade to POI 4.1.2 - Tika - [issue]
...Now available at a maven repo near you!  Thank you Andreas Beeker for running the release!...    Author: Tim Allison , 2020-02-25, 08:22
[TIKA-3050] Add xmp extraction to psd files - Tika - [issue]    Author: Tim Allison , 2020-02-25, 08:22
[TIKA-3026] Consider extracting structure/tags where possible in PDFs with the PDFMarkedContentExtractor - Tika - [issue]
...Some PDFs contain tags that may be useful in understanding the structure of the elements within a PDF, e.g. table markup, paragraph breaks, headers, etc.    The quality of the tags depends e...    Author: Tim Allison , 2020-02-24, 19:02
Apache Tika Server Warning - Tika - [mail # user]
...1) If you want to extract inline images from PDFs or render PDFs and youare ok with the licenses, grab the jars specified in PDFBox's pom:
   Author: Tim Allison , 2020-02-21, 18:50
[TIKA-3049] Improve file detection...varia - Tika - [issue]
...I recently crawled a few bugzilla issue trackers to add files to our regression corpus.  I noticed that bugzilla is able to identify the mime types of a few file types that we're not, a...    Author: Tim Allison , 2020-02-20, 21:32
[COMPRESS and Tika/PDFBox/POI] files from bug trackers - Tika - [mail # dev]
...All,  I recently downloaded attachments from the following bug trackers:COMPRESS, TIKA, PDFBox, POI, Open Office, Libre Office and ghostscript:
   Author: Tim Allison , 2020-02-14, 21:49
[TIKA-3046] Add detection of some open office related formats - Tika - [issue]
...Add format detection for .cdr, .bau, .sob, .oxt, .odp, .odb. In unpacking attachments to Libre Office's bug tracker, I found that our zip package detector didn't recognize these formats....    Author: Tim Allison , 2020-02-14, 16:55
[TIKA-3041] ExtractInlineImages missing images from PDFBOX-52 - Tika - [issue]
...Tilman Hausherr noted on TIKA-3040 that Tika is likely missing the inline images on the file attached to PDFBOX-52.  He's right.  Let's fix this....    Author: Tim Allison , 2020-02-12, 18:23