[TIKA-2675] OpenDocumentParser should fail on invalid zip files - Tika - [issue]
...The OpenDocumentParser assumes a zip file as container. However, if it is called on an invalid zip stream from a remote URL (see NUTCH-2603), the parser signals success and returns a documen...    Author: Sebastian Nagel , 2018-07-06, 15:25
Thread-safety and locking of methods Tika.detect(...) and MimeType.detect(...) - Tika - [mail # user]
...Hi,two questions regarding thread-safety and locking in Tika's MIME type detectorswhile investigating global locks in NUTCH-2578 (multi-threaded fetcher) [1].First, are the methods Tika.dete...
   Author: Sebastian Nagel , 2018-05-17, 15:51
[expand - 3 more] - Tika content detection and crawled "remote" content - Tika - [mail # user]
...Hi,a follow up based on Tika 1.16 for the July crawl:           #  Tika-1.16                   HTTP-Content-Ty...
   Author: Sebastian Nagel , 2017-08-10, 09:24
Adding a WARC parser to Tika - Tika - [mail # user]
...FYI, for a similar task - testing crawler-commons parser - I've started a small testtools which reads the sitemaps from WARC files:
   Author: Sebastian Nagel , 2017-07-11, 18:12
[TIKA-2422] Improve detection of Graphviz *.dot format - Tika - [issue]
...Detection of Graphviz document formats could be improved by adding either *.dot as glob pattern (conflicts with the more frequent MSWord templates) a magic pattern which catches the .dot lan...    Author: Sebastian Nagel , 2017-07-06, 13:59
[TIKA-1503] TestGDALParser fails if gdalinfo does not support FITS - Tika - [issue]
...gdalinfo is used as external parser (see TIKA-605). The test testParseFITS fails if gdalinfo is compiled without support of FITS:testParseFITS(org.apache.tika.parser.gdal.TestGDALParser) &nb...    Author: Sebastian Nagel , 2014-12-24, 12:06
TestGDALParser fails if gdalinfo does not support FITS - Tika - [mail # dev]
...Hi,if gdalinfo is compiled without support of FITSthe test testParseFITS fails:testParseFITS(org.apache.tika.parser.gdal.TestGDALParser)  Time elapsed: 0.206 sec  <<< FAIL...
   Author: Sebastian Nagel , 2014-12-23, 13:19
[TIKA-1263] Atom feed failed to detect - Tika - [issue]
...Atom feeds with namespace are not detected as application/atom+xml. Trivial patch attached, sample feed taken from wikipedia....    Author: Sebastian Nagel , 2014-03-21, 13:38
encrypted PDF created with PDFMaker failed to parse - Tika - [mail # user]
...Hi,  I have a bunch of PDF files - encrypted to prohibit changes and annotations   (this matters because documents are forms) - created by Acrobat PDFMaker Tika (1.3/trunk) fails t...
   Author: Sebastian Nagel , 2013-05-23, 10:55