clear query| facets| time Search criteria: .   Results from 1 to 10 from 206 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-30545] Impl Extremely Randomized Trees - Spark - [issue]
...1, Extremely Randomized Trees or ExtraTrees is widely used and impled in Scikit-Learn and OpenCV;2, ExtraTrees is quite similar to RandomForest, and the main difference lie in that,on each l...
http://issues.apache.org/jira/browse/SPARK-30545    Author: zhengruifeng , 2020-01-26, 15:58
[SPARK-30642] LinearSVC blockify input vectors - Spark - [issue]
http://issues.apache.org/jira/browse/SPARK-30642    Author: zhengruifeng , 2020-01-26, 12:05
[SPARK-29212] Add common classes without using JVM backend - Spark - [issue]
...Copied from https://github.com/apache/spark/pull/25776.  Maciej's Concern:Use cases for public ML type hierarchy Add Python-only Transformer implementations: I am Python user and wan...
http://issues.apache.org/jira/browse/SPARK-29212    Author: zhengruifeng , 2020-01-26, 04:44
[SPARK-30641] ML algs blockify input vectors - Spark - [issue]
...stacking input vectors into blocks will benefit ML algs:1, less RAM to persist datasets, since the overhead of object header is reduced;2, optimization potential for impl, since high-level B...
http://issues.apache.org/jira/browse/SPARK-30641    Author: zhengruifeng , 2020-01-25, 13:12
[SPARK-30543] RandomForest add Param bootstrap to control sampling method - Spark - [issue]
...Current RF with numTrees=1 will directly build a tree using the orignial dataset,while with numTrees>1 it will use bootstrap samples to build trees.This design is to train a DecisionTreeM...
http://issues.apache.org/jira/browse/SPARK-30543    Author: zhengruifeng , 2020-01-23, 08:48
[SPARK-30503] OnlineLDAOptimizer does not handle persistance correctly - Spark - [issue]
...It seems that in OnlineLDAOptimizer, PeriodicGraphCheckpointer can not unpersit edges correctly.scala> import org.apache.spark.ml.clustering.LDAimport org.apache.spark.ml.clustering.LDAsc...
http://issues.apache.org/jira/browse/SPARK-30503    Author: zhengruifeng , 2020-01-23, 08:40
[SPARK-30202] impl QuantileTransform - Spark - [issue]
...Recently, I encountered some practice senarinos to map the data to another distribution.Then I found that QuantileTransformer in sklearn is what I needed, I locally fitted a model on sampled...
http://issues.apache.org/jira/browse/SPARK-30202    Author: zhengruifeng , 2020-01-17, 15:37
[SPARK-29565] OneHotEncoder should support single-column input/ouput - Spark - [issue]
...Current feature algs (QuantileDiscretizer/Binarizer/Bucketizer/StringIndexer) are designed to support both single-col & multi-col.And there is already some internal utils (like checkSing...
http://issues.apache.org/jira/browse/SPARK-29565    Author: zhengruifeng , 2020-01-16, 15:14
[SPARK-29566] Imputer should support single-column input/ouput - Spark - [issue]
...Imputer should support single-column input/ouputrefer to https://issues.apache.org/jira/browse/SPARK-29565...
http://issues.apache.org/jira/browse/SPARK-29566    Author: zhengruifeng , 2020-01-16, 11:54
[SPARK-30502] PeriodicRDDCheckpointer supports storageLevel - Spark - [issue]
...Intermediate RDDs in ML are cached with storageLevel=StorageLevel.MEMORY_AND_DISK.PeriodicRDDCheckpointer will store RDD with storageLevel=StorageLevel.MEMORY_ONLY, it maybe nice to set the ...
http://issues.apache.org/jira/browse/SPARK-30502    Author: zhengruifeng , 2020-01-16, 03:02