clear query| facets| time Search criteria: .   Results from 1 to 10 from 84 (0.0s).
Loading phrases to help you
refine your search...
[TIKA-2573] Map Data Extraction/OCR - Tika - [issue]
...NY Public Library has a rather interesting project that might unlock lots of map data for Tika: https://github.com/nypl-spacetime/map-vectorizer ...
http://issues.apache.org/jira/browse/TIKA-2573    Author: Grant Ingersoll , 2018-02-09, 19:44
[TIKA-568] Language Detection isReasonablyCertain() hides valuable information - Tika - [issue]
...LanguageIdentifier.isReasonablyCertain() hardcodes a threshold for language detection, which is fine, except applications should be allowed to decide what threshold suits them.  For ins...
http://issues.apache.org/jira/browse/TIKA-568    Author: Grant Ingersoll , 2016-02-04, 20:30
[expand - 2 more] - Multiple parsers for the same MIME type - Tika - [mail # dev]
...Thanks, will check it out.On Fri, Jan 2, 2015 at 5:07 PM, Jukka Zitting wrote:> Hi,>> 2015-01-02 16:37 GMT-05:00 Grant Ingersoll :> > I think the problem is that the file type...
   Author: Grant Ingersoll , 2015-01-03, 23:46
[expand - 4 more] - Parsers, DefaultConfig and such - Tika - [mail # user]
...On Mar 14, 2014, at 9:31 PM, Jukka Zitting  wrote:> Hi,> > On Fri, Mar 14, 2014 at 5:13 PM, Grant Ingersoll  wrote:>> On Mar 13, 2014, at 3:53 PM, Jukka Zitting &nbs...
   Author: Grant Ingersoll , 2014-03-15, 10:11
[TIKA-554] ParseUtils.getStringContent needs an option to set the write limit that can be passed into the BodyContentHandler - Tika - [issue]
...It would be helpful if one could specify the writeLimit to be used when using ParseUtils.getStringContent.  Patch shortly....
http://issues.apache.org/jira/browse/TIKA-554    Author: Grant Ingersoll , 2011-10-07, 09:02
[TIKA-433] Tika + Hadoop - Tika - [issue]
...Would be great to have a Tika contrib that took in an HDFS location with "rich" documents on it and an output format (or output processor) and converted the docs to XHTML or Solr or whatever...
http://issues.apache.org/jira/browse/TIKA-433    Author: Grant Ingersoll , 2011-10-07, 08:59
Mailing List Moderation - Tika - [mail # dev]
...Hi Tika,  I'm currently a moderator for Tika left over from the Lucene days, would someone more involved in Tika please step up and take on that responsibility and remove me from the du...
   Author: Grant Ingersoll , 2011-05-17, 12:15
PDF text extracted without spaces - Tika - [mail # user]
...Can you share more about how you are using it.  Also, can you show a test case?  -Grant  On Dec 3, 2010, at 12:26 AM, Ganesh wrote:  > Hello all, >  > I new...
   Author: Grant Ingersoll , 2010-12-03, 14:10
[expand - 1 more] - Upgrading Solr to Tika 0.8 - Tika - [mail # user]
...On Nov 30, 2010, at 11:58 AM, Grant Ingersoll wrote:  > I'm trying to upgrade Solr's version of Tika to 0.8 (https://issues.apache.org/jira/browse/SOLR-2241), but am getting some new...
   Author: Grant Ingersoll , 2010-11-30, 20:35
[expand - 1 more] - MimeType detection and fall back - Tika - [mail # user]
...Shall I open a bug for this?  On Nov 18, 2010, at 11:34 AM, Grant Ingersoll wrote:  > Hi, >  > I'm using MimeTypes::public MimeType getMimeType(String name, byte[] da...
   Author: Grant Ingersoll , 2010-11-30, 17:09