clear query| facets| time Search criteria: .   Results from 1 to 10 from 30 (0.0s).
Loading phrases to help you
refine your search...
[TIKA-2338] Change Scope of Jai-ImageIO-Core dependency - Tika - [issue]
...Looks like jai-imageio-core from github (https://github.com/jai-imageio/jai-imageio-core) which we depend on with test scope is Apache compatible.Note that is a fork from the original Jai pr...
http://issues.apache.org/jira/browse/TIKA-2338    Author: Luis Filipe Nassif , 2018-03-09, 12:25
[TIKA-2568] Full encrypted 7Z file not detected as such - Tika - [issue]
...Full encrypted 7zip containers that hide its subitem names are not detected as encrypted. Fix is to catch PasswordRequiredException when creating SevenZFile into PackageParser and rethrow it...
http://issues.apache.org/jira/browse/TIKA-2568    Author: Luis Filipe Nassif , 2018-03-08, 12:52
[TIKA-1466] Enable overriding of mimetype glob pattern definitions - Tika - [issue]
...I think it is important to enable an overriding of the default tika-mimetypes.xml glob pattern definitions within a custom-mimetypes.xml. Currently, you can not define in a custom mimetype a...
http://issues.apache.org/jira/browse/TIKA-1466    Author: Luis Filipe Nassif , 2018-03-07, 18:32
[TIKA-2390] Extract images embedded in Html - Tika - [issue]
...We should handle images embedded in html like we do for other formats, as attachments. There are encodings other than base64 used out there to embed images in html?...
http://issues.apache.org/jira/browse/TIKA-2390    Author: Luis Filipe Nassif , 2018-02-23, 14:11
[TIKA-2469] False positives with x-ms-owner detection - Tika - [issue]
...Attached windows system files are incorrectly detected as application/x-ms-owner. Tim Allison did you add the magic for x-ms-owner? Is it possible to make the magic regex more strict?...
http://issues.apache.org/jira/browse/TIKA-2469    Author: Luis Filipe Nassif , 2017-10-13, 14:25
[TIKA-2428] EMFParser loops forever with corrupted files - Tika - [issue]
...EMFParser hangs with the attached corrupted EMF files.Sorry Tim Allison! Just now having time to test against our forensic test corpus......
http://issues.apache.org/jira/browse/TIKA-2428    Author: Luis Filipe Nassif , 2017-09-18, 13:02
[TIKA-2456] Emails extracted from MBOX not detected as rfc822 - Tika - [issue]
...Similar to TIKA-2454, because of recurrent detection issues with message/rfc822 (TIKA-2042, TIKA-1602, TIKA-879), children of mbox files could not be detected as rfc822, but they will always...
http://issues.apache.org/jira/browse/TIKA-2456    Author: Luis Filipe Nassif , 2017-08-31, 17:07
[TIKA-1865] Save sender email address in Outlook MSG metadata - Tika - [issue]
...Sender email address is lost when extracting metadata from Outlook msg files. Currently only sender name is extracted. That is an important information to be extracted for search engines....
http://issues.apache.org/jira/browse/TIKA-1865    Author: Luis Filipe Nassif , 2017-03-01, 20:39
[TIKA-2082] Upgrade to PDFBox 2.0.3 - Tika - [issue]
...PDFBox 2.0.3 was released with a number of fixes. Tika should upgrade....
http://issues.apache.org/jira/browse/TIKA-2082    Author: Luis Filipe Nassif , 2016-12-16, 16:06
[TIKA-1267] Improve Mbox file detection - Tika - [issue]
...Could we add to application/mbox mime-type definition code below:<magic priority="70"><match value="From " type="string" offset="0"/></magic>Or is it too common out there?...
http://issues.apache.org/jira/browse/TIKA-1267    Author: Luis Filipe Nassif , 2016-07-28, 11:34