clear query| facets| time Search criteria: .   Results from 1 to 10 from 614 (0.0s).
Loading phrases to help you
refine your search...
[TIKA-2827] Improve tika-eval comparison reports to include mime types in A and B for diffs - Tika - [issue]
http://issues.apache.org/jira/browse/TIKA-2827    Author: Tim Allison , 2019-02-14, 15:11
[TIKA-2826] Add a csv/tsv parser - Tika - [issue]
...I'm not sure I want to wade into csv/tsv detection on this ticket, but it would be great to apply a csv parser at least for filenames ending in ".csv" or if a user "hints" that the file is a...
http://issues.apache.org/jira/browse/TIKA-2826    Author: Tim Allison , 2019-02-13, 17:58
[TIKA-2824] General dependency/plugin upgrades for next release - Tika - [issue]
...At least initially:jackcessopennlphttpcomponentszstd-jnicxf...
http://issues.apache.org/jira/browse/TIKA-2824    Author: Tim Allison , 2019-02-12, 16:09
[VOTE] Release Lucene/Solr 7.7.0 RC1 - Lucene - [mail # dev]
...+1 (non-binding)Thank you!On Wed, Feb 6, 2019 at 4:19 PM Nicholas Knize  wrote:>> +1 SUCCESS! [1:00:01.156649]>> On Wed, Feb 6, 2019 at 10:10 AM Uwe Schindler  wrote:&g...
   Author: Tim Allison , 2019-02-06, 21:55
by: java.util.zip.DataFormatException: invalid distance too far back reported by Solr API - Solr - [mail # user]
...>At the end of the day it would be a much better architecture to parse the> PDFs using plain standalone TikaServer+1Also, note that we added a -spawnChild switch to tika-server that wi...
   Author: Tim Allison , 2019-02-05, 15:29
[TIKA-2825] Make interrupter in tika-batch's child process actually optional - Tika - [issue]
...tika-eval uses tika-batch, but it only uses the child batch process because if there's a failure there, something went seriously wrong, and there shouldn't be a restart.The problem is that t...
http://issues.apache.org/jira/browse/TIKA-2825    Author: Tim Allison , 2019-02-01, 15:53
[TIKA-2822] Update common tokens files for tika-eval - Tika - [issue]
...We initially created the common tokens files (top 20k tokens by document frequency) in Wikipedia with Lucene 6.x.  We should rerun that code with an updated Lucene on the off chance tha...
http://issues.apache.org/jira/browse/TIKA-2822    Author: Tim Allison , 2019-01-30, 19:17
[expand - 1 more] - Fwd: Memory Errors with PDFBOX - Tika - [mail # user]
...forwarding to the correct pdfbox address... sorry for the noise...---------- Forwarded message ---------From: Tim Allison Date: Wed, Jan 30, 2019 at 10:29 AMSubject: Re: Memory Errors with P...
   Author: Tim Allison , 2019-01-30, 16:13
[TIKA-2823] Remove printstacktrace in XMLReaderUtils - Tika - [issue]
...Many apologies......
http://issues.apache.org/jira/browse/TIKA-2823    Author: Tim Allison , 2019-01-29, 18:39
TokenizerChain.getMultiTermAnalyzer().normalize() no longer normalizes multiterms in 8.x?! - Solr - [mail # user]
...All,  I don't know if this change was intended, but it feels like a bug to me...TokenFilterFactory[] filters = new TokenFilterFactory[2];filters[0] = new LowerCaseFilterFactory(Collecti...
   Author: Tim Allison , 2019-01-25, 13:32