clear query| facets| time Search criteria: .   Results from 21 to 30 from 49901 (0.0s).
Loading phrases to help you
refine your search...
[NUTCH-1377] Add option to index via CloudSolrServer instead - Nutch - [issue]
...Nutch indexes to a specific Solr server. With SolrCloud on its way we can still use the current indexer and point to any server. However, the SolrCloudServer can connect to ZooKeeper instead...    Author: Markus Jelsma , 2018-10-15, 12:41
[NUTCH-2653] ProtocolFactory.getProtocol(url) creates separate plugin instances for http/https - Nutch - [issue]
...Fetcher creates two instances of the protocol-okhttp plugin, one to handle http requests, another for https. The plugin properties are logged during plugin instantiation when calling setConf...    Author: Sebastian Nagel , 2018-10-15, 12:16
[NUTCH-2375] Upgrade the code base from org.apache.hadoop.mapred to org.apache.hadoop.mapreduce - Nutch - [issue]
...Nutch is still using the deprecated org.apache.hadoop.mapred dependency which has been deprecated. It need to be updated to org.apache.hadoop.mapreduce dependency....    Author: Omkar Reddy , 2018-10-15, 11:49
[NUTCH-1842] crawl.gen.delay has a wrong default value in nutch-default.xml or is being parsed incorrectly - Nutch - [issue]
...this is from nutch-default.xml:<property>  <name>crawl.gen.delay</name>  <value>604800000</value>  <description>   This value, ex...    Author: kaveh minooie , 2018-10-15, 10:21
[NUTCH-1121] JUnit test for parse-js - Nutch - [issue]
...This issue is part of the larger attempt to provide a Junit test case for every Nutch plugin....    Author: Lewis John McGibbney , 2018-10-13, 16:53
[NUTCH-1021] Migrate OutlinkExtractor from Apache ORO to java.util.regex - Nutch - [issue]
...Migrate from deprecated ORO to Java util regex....    Author: Markus Jelsma , 2018-10-13, 16:53
[NUTCH-1014] Migrate from Apache ORO to java.util.regex - Nutch - [issue]
...A separate issue tracking migration of all components from Apache ORO to java.util.regex. Components involved are: RegexURLNormalzier OutlinkExtractor JSParseFilter MoreIndexingFilter BasicU...    Author: Markus Jelsma , 2018-10-13, 16:53
[NUTCH-1678] Remove dependency on org.apache.oro - Nutch - [issue] has been archived for three years and it may be good to remove the dependency as Java has had built in regexes for quite some time now. There don't seem to have been any speci...    Author: James Sullivan , 2018-10-13, 16:53
[NUTCH-2192] Get rid of oro - Nutch - [issue]
...Couple of classes still rely on oro, we should get rid of it....    Author: Markus Jelsma , 2018-10-13, 16:53
[NUTCH-2606] MIME detection is wrong for plain-text documents send as Content-Type "application/msword" - Nutch - [issue]
...Plain-text documents send as Content-Type "application/msword" are tried to parse as Word documents. The MIME detection should be fixed, so that these are correctly identified as plain-text ...    Author: Sebastian Nagel , 2018-10-13, 12:00