clear query| facets| time Search criteria: .   Results from 1 to 10 from 853 (0.0s).
Loading phrases to help you
refine your search...
[expand - 2 more] - 2.0.19? - PDFBox - [mail # dev]
...Will do shortly.On Sun, Feb 16, 2020 at 10:33 AM Andreas Lehmkuehler wrote:> @Tim did you have the time to run the tests?>> Andreas>>> Am 07.02.20 um 01:22 schrieb Tim Alli...
   Author: Tim Allison , 2020-02-17, 11:46
[TIKA-3047] Upgrade to POI 4.1.2 - Tika - [issue]
...Now available at a maven repo near you!  Thank you Andreas Beeker for running the release!...
http://issues.apache.org/jira/browse/TIKA-3047    Author: Tim Allison , 2020-02-14, 21:59
[COMPRESS and Tika/PDFBox/POI] files from bug trackers - Tika - [mail # dev]
...All,  I recently downloaded attachments from the following bug trackers:COMPRESS, TIKA, PDFBox, POI, Open Office, Libre Office and ghostscript:http://162.242.228.174/docs/bugtrackers/&n...
   Author: Tim Allison , 2020-02-14, 21:49
[expand - 1 more] - [COMPRESS and Tika/PDFBox/POI] files from bug trackers - POI - [mail # dev]
...All,  I recently downloaded attachments from the following bug trackers:COMPRESS, TIKA, PDFBox, POI, Open Office, Libre Office and ghostscript:http://162.242.228.174/docs/bugtrackers/&n...
   Author: Tim Allison , 2020-02-14, 21:49
[TIKA-3046] Add detection of some open office related formats - Tika - [issue]
...Add format detection for .cdr, .bau, .sob, .oxt, .odp, .odb. In unpacking attachments to Libre Office's bug tracker, I found that our zip package detector didn't recognize these formats....
http://issues.apache.org/jira/browse/TIKA-3046    Author: Tim Allison , 2020-02-14, 16:55
[TIKA-3045] Allow users to run custom parsing of xfa and xmp - Tika - [issue]
...We currently do some processing of xfa and xmp, but some users may want more control over parsing these embedded file types....
http://issues.apache.org/jira/browse/TIKA-3045    Author: Tim Allison , 2020-02-14, 12:38
[TIKA-3026] Consider extracting structure/tags where possible in PDFs with the PDFMarkedContentExtractor - Tika - [issue]
...Some PDFs contain tags that may be useful in understanding the structure of the elements within a PDF, e.g. table markup, paragraph breaks, headers, etc.    The quality of the tags depends e...
http://issues.apache.org/jira/browse/TIKA-3026    Author: Tim Allison , 2020-02-13, 18:12
[TIKA-3041] ExtractInlineImages missing images from PDFBOX-52 - Tika - [issue]
...Tilman Hausherr noted on TIKA-3040 that Tika is likely missing the inline images on the file attached to PDFBOX-52.  He's right.  Let's fix this....
http://issues.apache.org/jira/browse/TIKA-3041    Author: Tim Allison , 2020-02-12, 18:23
[jira] [Commented] (TIKA-3040) PDF inline OCR: Exception while processing certain image (others in same PDF work) - Tika - [mail # dev]
...Eric,  Are you talking about the different OCR strategies for PDFs?  Thechallenge that it really isn't simple.I've tried to explain it:https://cwiki.apache.org/confluence/display/T...
   Author: Tim Allison , 2020-02-12, 18:23
[VOTE] Apache POI 4.1.2 release (RC3) - POI - [mail # dev]
...+1Thank you, Andi (and team)!http://162.242.228.174/reports/reports_poi_4.1.2-rc3.tgzOn Mon, Feb 10, 2020 at 3:38 PM Andreas Beeker  wrote:> Hi *,>> I've prepared artifacts for...
   Author: Tim Allison , 2020-02-11, 17:41