clear query| facets| time Search criteria: .   Results from 31 to 40 from 763 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-14604] Modify design of ML model summaries - Spark - [issue]
...Several models now have summaries containing evaluation metrics and training info: LinearRegressionModel LogisticRegressionModel GeneralizedLinearRegressionModelThese summaries have...    Author: Joseph K. Bradley , 2019-10-08, 05:41
[SPARK-15573] Backwards-compatible persistence for - Spark - [issue]
...This JIRA is for imposing backwards-compatible persistence for the DataFrames-based API for MLlib.  I.e., we want to be able to load models saved in previous versions of Spark.  We...    Author: Joseph K. Bradley , 2019-10-08, 05:41
[SPARK-13951] PySpark ml.pipeline support export/import - nested Piplines - Spark - [issue]    Author: Joseph K. Bradley , 2019-09-24, 08:03
[SPARK-24467] VectorAssemblerEstimator - Spark - [issue]
...In SPARK-22346, I believe I made a wrong API decision: I recommended added `VectorSizeHint` instead of making `VectorAssembler` into an Estimator since I thought the latter option would brea...    Author: Joseph K. Bradley , 2019-09-16, 18:26
[SPARK-23758] MLlib 2.4 Roadmap - Spark - [issue]
...Roadmap processThis roadmap is a master list for MLlib improvements we are working on during this release.  This includes ML-related changes in PySpark and SparkR.What is planned for th...    Author: Joseph K. Bradley , 2019-07-20, 05:06
[SPARK-19039] UDF ClosureCleaner bug when UDF, col applied in paste mode in REPL - Spark - [issue]
...When I try this: Define UDF Apply UDF to get Column Use Column in a DataFrameI can find weird behavior in the spark-shell when using paste mode.To reproduce this, paste this into the spark-s...    Author: Joseph K. Bradley , 2019-05-31, 07:14
[SPARK-19053] Supporting multiple evaluation metrics in DataFrame-based API: discussion - Spark - [issue]
...This JIRA is to discuss supporting the computation of multiple evaluation metrics efficiently in the DataFrame-based API for MLlib.In the RDD-based API, RegressionMetrics and other *Metrics ...    Author: Joseph K. Bradley , 2019-05-29, 06:03
[SPARK-5272] Refactor NaiveBayes to support discrete and continuous labels,features - Spark - [issue]
...This JIRA is to discuss refactoring NaiveBayes in order to support both discrete and continuous labels and features.Currently, NaiveBayes supports only discrete labels and features.Proposal:...    Author: Joseph K. Bradley , 2019-05-21, 05:37
[SPARK-4500] Improve exact stratified sampling implementation - Spark - [issue]
...The current implementation for exact stratified sampling (sampleByKeyExact) could be more efficient.  Proposed algorithm sketch: Sampling is done separately for each stratum.  Here...    Author: Joseph K. Bradley , 2019-05-21, 05:37
[SPARK-3717] DecisionTree, RandomForest: Partition by feature - Spark - [issue]
...SummaryCurrently, data are partitioned by row/instance for DecisionTree and RandomForest.  This JIRA argues for partitioning by feature for training deep trees.  This is especially...    Author: Joseph K. Bradley , 2019-05-21, 05:37