clear query| facets| time Search criteria: .   Results from 1 to 10 from 343 (0.0s).
Loading phrases to help you
refine your search...
[NUTCH-1514] Phase out the deprecated configuration properties (if possible) - Nutch - [issue]
...In reference to [0], the deprecated configuration properties can be removed (only if possible without affecting the functionality).[0] : http://mail-archives.apache.org/mod_m...
http://issues.apache.org/jira/browse/NUTCH-1514    Author: Tejas Patil , 2018-07-10, 14:11
[NUTCH-1712] Use MultipleInputs in Injector to make it a single mapreduce job - Nutch - [issue]
...Currently Injector creates two mapreduce jobs:1. sort job: get the urls from seeds file, emit CrawlDatum objects.2. merge job: read CrawlDatum objects from both crawldb and output of sort jo...
http://issues.apache.org/jira/browse/NUTCH-1712    Author: Tejas Patil , 2016-02-25, 22:02
[NUTCH-1715] RobotRulesParser adds additional '*' to the robots name - Nutch - [issue]
...In RobotRulesParser, when Nutch creates a agent string from multiple agents, it combines agents from both 'http.agent.name' and 'http.robots.agents'. Along with that it appends a wildcard (i...
http://issues.apache.org/jira/browse/NUTCH-1715    Author: Tejas Patil , 2014-05-01, 06:22
[NUTCH-1721] Upgrade to Crawler commons 0.3 - Nutch - [issue]
http://issues.apache.org/jira/browse/NUTCH-1721    Author: Tejas Patil , 2014-05-01, 06:22
[NUTCH-1716] RobotRulesParser adds extra '*' to the robots name - Nutch - [issue]
...In RobotRulesParser, when Nutch creates a agent string from multiple agents, it combines agents from both 'http.agent.name' and 'http.robots.agents'. Along with that it appends a wildcard (i...
http://issues.apache.org/jira/browse/NUTCH-1716    Author: Tejas Patil , 2014-05-01, 06:22
Nutch didn't (fail) to create new segment dir - Nutch - [mail # user]
...The logs say this:>> Generator: 0 records selected for fetching, exiting ...This is because there are no urls that generator could pass to form asegment.>> Injector: total number...
   Author: Tejas Patil , 2014-02-15, 05:18
[expand - 1 more] - sizing guide - Nutch - [mail # user]
...On Wed, Feb 12, 2014 at 11:08 PM, Deepa Jayaveer wrote:> Thanks for your reply.>   I started off PoC with Nutch-MySQL. Planned to move to Nutch 2.1 with> Hbase> once I get a...
   Author: Tejas Patil , 2014-02-13, 08:58
[expand - 1 more] - [DISCUSS] Release Trunk - Nutch - [mail # dev]
...Thanks Lewis. G+ hangout sounds cool. Is this wiki page complete andupdated to start off ?http://wiki.apache.org/nutch/Release_HOWTOThanks,TejasOn Thu, Feb 13, 2014 at 12:23 AM, Lewis John M...
   Author: Tejas Patil , 2014-02-13, 08:53
[expand - 1 more] - HTML tag filtering - Nutch - [mail # user]
...That means that there were changes to the source files since the patch wascreated. You need to manually add the changes from patch to the sourcefiles.Thanks,TejasOn Thu, Feb 13, 2014 at 12:0...
   Author: Tejas Patil , 2014-02-13, 08:28
how cam I download the source code of Nutch's dependence jars - Nutch - [mail # user]
...Have you tried this ?http://java.dzone.com/articles/ivy-how-retrieve-source-codesThanks,TejasOn Wed, Feb 12, 2014 at 12:43 AM, Gavin  wrote:> Maven can do this.> How can i do this...
   Author: Tejas Patil , 2014-02-12, 16:02