clear query| facets| time Search criteria: .   Results from 21 to 30 from 49901 (0.0s).
Loading phrases to help you
refine your search...
[NUTCH-1377] Add option to index via CloudSolrServer instead - Nutch - [issue]
...Nutch indexes to a specific Solr server. With SolrCloud on its way we can still use the current indexer and point to any server. However, the SolrCloudServer can connect to ZooKeeper instead...
http://issues.apache.org/jira/browse/NUTCH-1377    Author: Markus Jelsma , 2018-10-15, 12:41
[NUTCH-2653] ProtocolFactory.getProtocol(url) creates separate plugin instances for http/https - Nutch - [issue]
...Fetcher creates two instances of the protocol-okhttp plugin, one to handle http requests, another for https. The plugin properties are logged during plugin instantiation when calling setConf...
http://issues.apache.org/jira/browse/NUTCH-2653    Author: Sebastian Nagel , 2018-10-15, 12:16
[NUTCH-2375] Upgrade the code base from org.apache.hadoop.mapred to org.apache.hadoop.mapreduce - Nutch - [issue]
...Nutch is still using the deprecated org.apache.hadoop.mapred dependency which has been deprecated. It need to be updated to org.apache.hadoop.mapreduce dependency....
http://issues.apache.org/jira/browse/NUTCH-2375    Author: Omkar Reddy , 2018-10-15, 11:49
[NUTCH-1842] crawl.gen.delay has a wrong default value in nutch-default.xml or is being parsed incorrectly - Nutch - [issue]
...this is from nutch-default.xml:<property>  <name>crawl.gen.delay</name>  <value>604800000</value>  <description>   This value, ex...
http://issues.apache.org/jira/browse/NUTCH-1842    Author: kaveh minooie , 2018-10-15, 10:21
[NUTCH-1121] JUnit test for parse-js - Nutch - [issue]
...This issue is part of the larger attempt to provide a Junit test case for every Nutch plugin....
http://issues.apache.org/jira/browse/NUTCH-1121    Author: Lewis John McGibbney , 2018-10-13, 16:53
[NUTCH-1021] Migrate OutlinkExtractor from Apache ORO to java.util.regex - Nutch - [issue]
...Migrate from deprecated ORO to Java util regex....
http://issues.apache.org/jira/browse/NUTCH-1021    Author: Markus Jelsma , 2018-10-13, 16:53
[NUTCH-1014] Migrate from Apache ORO to java.util.regex - Nutch - [issue]
...A separate issue tracking migration of all components from Apache ORO to java.util.regex. Components involved are: RegexURLNormalzier OutlinkExtractor JSParseFilter MoreIndexingFilter BasicU...
http://issues.apache.org/jira/browse/NUTCH-1014    Author: Markus Jelsma , 2018-10-13, 16:53
[NUTCH-1678] Remove dependency on org.apache.oro - Nutch - [issue]
...org.apache.oro has been archived for three years and it may be good to remove the dependency as Java has had built in regexes for quite some time now. There don't seem to have been any speci...
http://issues.apache.org/jira/browse/NUTCH-1678    Author: James Sullivan , 2018-10-13, 16:53
[NUTCH-2192] Get rid of oro - Nutch - [issue]
...Couple of classes still rely on oro, we should get rid of it....
http://issues.apache.org/jira/browse/NUTCH-2192    Author: Markus Jelsma , 2018-10-13, 16:53
[NUTCH-2606] MIME detection is wrong for plain-text documents send as Content-Type "application/msword" - Nutch - [issue]
...Plain-text documents send as Content-Type "application/msword" are tried to parse as Word documents. The MIME detection should be fixed, so that these are correctly identified as plain-text ...
http://issues.apache.org/jira/browse/NUTCH-2606    Author: Sebastian Nagel , 2018-10-13, 12:00