clear query| facets| time Search criteria: author:"Michael McCandless".   Results from 21 to 30 from 54 (0.0s).
Loading phrases to help you
refine your search...
[TIKA-981] Text isn't extracted from PDF pop-up annotations - Tika - [issue]
http://issues.apache.org/jira/browse/TIKA-981    Author: Michael McCandless , 2012-09-02, 12:59
[TIKA-982] RTF document embedded into Word (.doc) document is extracted as .unknown - Tika - [issue]
http://issues.apache.org/jira/browse/TIKA-982    Author: Michael McCandless , 2012-09-02, 14:10
[TIKA-986] NullPointerException trying to parse detached .pk7s signature - Tika - [issue]
...Our Pkcs7Parser tries to pull the signed content out and then parsesthat, but if the signature is detached then there is no content (weget null return from CMSSignedDataParser.getSignedConte...
http://issues.apache.org/jira/browse/TIKA-986    Author: Michael McCandless , 2012-09-02, 14:10
[TIKA-987] Embedded drawing (SHAPE MERGEFORMAT) sometimes not extracted - Tika - [issue]
...I have two Word docs, both containing the same drawing, but one hastext added.In one case (picture.doc) the extraction is correct: it contains onlyan embedded image.wmf; when I view the imag...
http://issues.apache.org/jira/browse/TIKA-987    Author: Michael McCandless , 2017-12-08, 18:49
[TIKA-988] We don't extract a placeholder for a Word document embedded in an Excel document - Tika - [issue]
...In TIKA-956 we fixed the Word parser so that at the point where an embedded document appears, we output a <div class="embedded" id="_XXX"/> tag.It would be nice to do this for document...
http://issues.apache.org/jira/browse/TIKA-988    Author: Michael McCandless , 2017-12-08, 18:49
[TIKA-989] We don't extract a placeholder for documents embedded in a Word OOXML (.docx) document - Tika - [issue]
...In TIKA-956 we fixed the Word parser so that at the point where an embedded document appears, we output a <div class="embedded" id="_XXX"/> tag.It would be nice to do this for document...
http://issues.apache.org/jira/browse/TIKA-989    Author: Michael McCandless , 2012-09-11, 15:19
[TIKA-997] Leave a placeholder when documents are embedded in .pptx documents - Tika - [issue]
...Just like TIKA-956, we should leave a <div class="embedded" id="XXX"> to record where a given sub-document appeared....
http://issues.apache.org/jira/browse/TIKA-997    Author: Michael McCandless , 2012-09-28, 12:47
[TIKA-999] RTF Parser doesn't extract page/word/character count metadata - Tika - [issue]
http://issues.apache.org/jira/browse/TIKA-999    Author: Michael McCandless , 2012-09-26, 14:49
[TIKA-702] Cannot compile Tika with Java 7 (ImageMetadataExtractor.java) - Tika - [issue]
...Spinoff from user thread "Closing streams (Was: Tika leaves files open)" started by Jukka on 8/30/2011 (http://markmail.org/message/6iq4tapiwzpmanhw).ImageMetadatExtractor is (indirectly) us...
http://issues.apache.org/jira/browse/TIKA-702    Author: Michael McCandless , 2011-10-20, 12:34
[TIKA-705] Valid OOXML PPT file hits InvalidFormatException thrown in POI - Tika - [issue]
...I took the "testRTFVarious.rtf" test case from TIKA-683, and saved it as various other doc types, to generate more test cases.But when I did this for PPTX, the resulting file hits this excep...
http://issues.apache.org/jira/browse/TIKA-705    Author: Michael McCandless , 2011-12-20, 06:22