Hello Karl,

 

I have to do a last test, I do it between noon and two o'clock and I inform you of the good functioning of this patch if it works

 

 

 

De : Karl Wright [mailto:[EMAIL PROTECTED]]
Envoyé : jeudi 11 janvier 2018 18:09
À : [EMAIL PROTECTED]
Objet : Re: Document connector excluding mime type and size - Tika Parser error

 

No Tika error is good, but have a look at Simple History to be sure documents were actually processed.  If you can confirm that, I'll kick off the patch process.

 

Karl

 

 

On Thu, Jan 11, 2018 at 11:26 AM, msaunier <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> > wrote:

Ok. So.

 

With the same configuration but Tika 1.17 :

 

·        No Tika error

·        But, no documents send to Solr. I don’t understand why. I research.

 

 

 

 

De : msaunier [mailto:[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> ]
Envoyé : jeudi 11 janvier 2018 15:32
À : [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
Objet : RE: Document connector excluding mime type and size - Tika Parser error

 

I crawl for the moment. I think, I would have finished in 30 minutes.

 

 

 

De : Karl Wright [mailto:[EMAIL PROTECTED]]
Envoyé : jeudi 11 janvier 2018 15:05
À : [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
Objet : Re: Document connector excluding mime type and size - Tika Parser error

 

Did this work for you?

Karl

 

On Thu, Jan 11, 2018 at 6:36 AM, Karl Wright <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> > wrote:

If you need the jcifs connector, run "ant make-deps" too.  Then run "ant build" again.

 

Karl

 

On Thu, Jan 11, 2018 at 4:30 AM, msaunier <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> > wrote:

Hello Karl,

 

I have build and configured but WindowsShare connector do not appear in the list of repository connectors.

 

·        I have add jcifs.jar into the connectors/jcifs/lib-proprietary directory

·        I have ant make-core-deps

·        Ant build

·        Uncomment windows share into the connectors-proprietary.xml file in the dist folder

·        I have add jcifs.jar in connector-lib-proprietary

 

But not have the proposition on the manifold interface.

 

Any idea ?

Thanks.

 

 

De : msaunier [mailto:[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> ]
Envoyé : mercredi 10 janvier 2018 18:15
À : [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
Objet : RE: Document connector excluding mime type and size - Tika Parser error

 

Good !

 

I configure and test that.

I give you a return as soon as the reading is finished.

400k documents.

 

If it works, I test on few million of documents.

 

Thank.

 

 

De : Karl Wright [mailto:[EMAIL PROTECTED]]
Envoyé : mercredi 10 janvier 2018 17:45
À : [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
Objet : Re: Document connector excluding mime type and size - Tika Parser error

 

The build you should be using is the ant build.  Do not use the maven build for this purpose.

 

- Check out trunk:

 

svn co https://svn.apache.org/repos/asf/manifoldcf/trunk

 

- Download dependencies:

 

ant make-core-deps

 

- Build:

 

ant build

 

- Your deliverable is in the "dist" directory

 

Karl

 

 

On Wed, Jan 10, 2018 at 11:37 AM, msaunier <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> > wrote:

I have an error with the maven build, so I have test with an external 1.17 Tika Server but, POI not included. If you success a mvn package with 1.17 Tika, I am interested.

 

Today, I have not had much time to deal with it.

 

I found some bugs that I would declare tomorrow if they are not already. They concern log4j2, local_fr and a bug with the web interface and the keyboard input key.

 

I continu my investigation.

 

De : Karl Wright [mailto:[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> ]
Envoyé : mercredi 10 janvier 2018 17:15
À : [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
Objet : Re: Document connector excluding mime type and size - Tika Parser error

 

Any news?

Karl

 

On Tue, Jan 9, 2018 at 1:10 PM, Karl Wright <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> > wrote:

Let me know what happens.
If it works for you, I'll see if we can put together a patch release of 2.9 with the fix.

 

Karl

 

 

On Tue, Jan 9, 2018 at 11:07 AM, msaunier <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> > wrote:

Test check out and building with POI 3.17 and Tika 1.17?

 

It’s possible.

 

I finish a project and I test that.

 

De : Karl Wright [mailto:[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> ]
Envoyé : mardi 9 janvier 2018 16:57
À : [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
Objet : Re: Document connector excluding mime type and size - Tika Parser error

 

So here's the problem; we used POI 3.17 with Tika 3.16 in 2.9, in order to deal with the classloader issue present in POI 3.15, and because POI 3.16 has a severe security issue that made it impossible to ship with.

 

Unfortunately that doesn't quite work; POI 3.17 is not backwards compatible with 3.16 completely and therefore problems occur with this combination.

 

The probable solution is to check out and build trunk and see if that works for you.  It very well might.  The question then is what to do next, because we are not scheduled to release again until April.  We might have to do a point release to deal with this.

 

Please give it a try and let me know what happens.

 

Thanks,

Karl

 

 

On Tue, Jan 9, 2018 at 10:29 AM, Karl Wright <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> > wrote:

Ok, never mind that last email.  We patched it in part in 2.9 by including the latest POI.  So clearly it's still an existing problem in POI.  I'll have to open a ticket there and await a patch from them.

 

Karl

 

On Tue, Jan 9, 2018 at 10:27 AM, Karl Wright <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> > wrote:

This screenshot cannot be MCF 2.9 since the version of poi was not 3.17 for the 2.9