clear query| facets| time Search criteria: .   Results from 1 to 10 from 89 (0.0s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Nutch 1.14 issues - Nutch - [mail # dev]
...Hi Sebastian,Sorry, clarifying my objectives:I am not frustrated, just trying to help. I did not write this message to request fixes for Arch. All these issues have been fixed in Arch, excep...
   Author: Arkadi.Kosmynin@... , 2018-06-12, 22:20
[expand - 1 more] - Arch 1.9.2 is available - Nutch - [mail # user]
...You are welcome.> -----Original Message-----> From: lewis john mcgibbney [mailto:[EMAIL PROTECTED]]> Sent: Friday, 30 September 2016 2:22 AM> To: [EMAIL PROTECTED]> Subject: R...
   Author: Arkadi.Kosmynin@... , 2016-09-29, 23:45
[expand - 2 more] - Bug: redirected URLs lost on indexing stage? - Nutch - [mail # user]
...Hi Sebastian,I meant #1 and used if http.redirect.max == 3.Thanks,Arkadi> -----Original Message-----> From: Sebastian Nagel [mailto:[EMAIL PROTECTED]]> Sent: Tuesday, 3 November 201...
   Author: Arkadi.Kosmynin@... , 2015-11-06, 04:09
[expand - 2 more] - A parser failure on a single document may fail crawling job - Nutch - [mail # user]
...Hi Sebastian,> -----Original Message-----> From: Sebastian Nagel [mailto:[EMAIL PROTECTED]]> Sent: Friday, 24 July 2015 6:39 AM> To: [EMAIL PROTECTED]> Cc: Kosmynin, Arkadi (C...
   Author: Arkadi.Kosmynin@... , 2015-07-30, 06:52
[expand - 1 more] - A bug in org.apache.nutch.parse.ParseUtil? - Nutch - [mail # user]
...Hi Sebastian,Yes, I considered parseResult.isSuccess(), but the problem is, it returns success only if all parses were successful. So, if the first parser succeeds, it will break the loop, e...
   Author: Arkadi.Kosmynin@... , 2015-04-21, 04:21
Nutch crawl commands and efficiency - Nutch - [mail # user]
...Hi,  I can't see from your description what exactly is slow, but I'd suggest to make sure that Nutch is using Hadoop native libraries. They make a huge difference for some operations. &...
   Author: Arkadi.Kosmynin@... , 2012-09-03, 23:44
focused crawl extended with user generated content - Nutch - [mail # user]
...Hi Magnus  > -----Original Message----- > From: Magnús Skúlason [mailto:[EMAIL PROTECTED]] > Sent: Wednesday, 13 June 2012 1:57 AM > To: [EMAIL PROTECTED] > Subject: focu...
   Author: Arkadi.Kosmynin@... , 2012-06-13, 01:00
[expand - 1 more] - Deletion of duplicates fails with org.apache.lucene.search.BooleanQuery$TooManyClauses - Nutch - [mail # user]
...>  > hi >  >  > > Hi, > > > > I started having this problem recently. For some reason, I did not > have it > > before, when working with...
   Author: Arkadi.Kosmynin@... , 2012-01-17, 03:20
Start crawl from Java without bin/nutch script - Nutch - [mail # user]
...The path should be C:/server/nutch/urls. I know this is not what you would expect from Cygwin, but it works.  Regards,  Arkadi  > -----Original Message----- > From: Lewi...
   Author: Arkadi.Kosmynin@... , 2012-01-16, 06:41
Drupal Integration with Nutch via CSIRO's Arch ? - Nutch - [mail # user]
...Hi Nicholas,  Thank you very much for your interest. I have a good news for you: we are moving our web sites to Drupal and thus will _have_ to integrate Arch with Drupal pretty soon. Pr...
   Author: Arkadi.Kosmynin@... , 2011-12-30, 03:10