clear query| facets| time Search criteria: .   Results from 1 to 10 from 25 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-22666] Spark datasource for image format - Spark - [issue]
...The current API for the new image format is implemented as a standalone feature, in order to make it reside within the mllib package. As discussed in SPARK-21866, users should be able to loa...
http://issues.apache.org/jira/browse/SPARK-22666    Author: Timothy Hunter , 2018-09-25, 10:11
[SPARK-25124] VectorSizeHint.size is buggy, breaking streaming pipeline - Spark - [issue]
...Currently, when using VectorSizeHint().setSize(3) in an ML pipeline, transforming a stream will return a nondescript exception about the stream not started. At core are the following bugs th...
http://issues.apache.org/jira/browse/SPARK-25124    Author: Timothy Hunter , 2018-08-24, 22:41
[SPARK-21866] SPIP: Image support in Spark - Spark - [issue]
...Background and motivationAs Apache Spark is being used more and more in the industry, some new use cases are emerging for different data formats beyond the traditional SQL types or the numer...
http://issues.apache.org/jira/browse/SPARK-21866    Author: Timothy Hunter , 2018-06-08, 19:20
[SPARK-23996] Implement the optimal KLL algorithms for quantiles in streams - Spark - [issue]
...The current implementation for approximate quantiles - a variant of Grunwald-Khanna, which I implemented - is not the best in light of recent papers: - it is not exactly the one from the pap...
http://issues.apache.org/jira/browse/SPARK-23996    Author: Timothy Hunter , 2018-04-23, 18:25
[SPARK-19635] Feature parity for Chi-square hypothesis testing in MLlib - Spark - [issue]
...This ticket tracks porting the functionality of spark.mllib.Statistics.chiSqTest over to spark.ml.Here is a design doc:https://docs.google.com/document/d/1ELVpGV3EBjc2KQPLN9_9_Ge9gWchPZ6SGtD...
http://issues.apache.org/jira/browse/SPARK-19635    Author: Timothy Hunter , 2018-03-15, 01:55
[SPARK-19634] Feature parity for descriptive statistics in MLlib - Spark - [issue]
...This ticket tracks porting the functionality of spark.mllib.MultivariateOnlineSummarizer over to spark.ml.A design has been discussed in SPARK-19208 . Here is a design doc:https://docs.googl...
http://issues.apache.org/jira/browse/SPARK-19634    Author: Timothy Hunter , 2018-01-29, 11:50
[SPARK-20077] Documentation for ml.stats.Correlation - Spark - [issue]
...Now that (Pearson) correlations are available in spark.ml, we need to write some documentation to go along with this feature. It can simply be looking at the unit tests for example right now...
http://issues.apache.org/jira/browse/SPARK-20077    Author: Timothy Hunter , 2017-11-06, 08:25
[SPARK-12210] Small example that shows how to integrate spark.mllib with spark.ml - Spark - [issue]
...Since we are missing a number of algorithms in spark.ml such as clustering or LDA, we should have a small example that shows the recommended way to go back and forth between spark.ml and spa...
http://issues.apache.org/jira/browse/SPARK-12210    Author: Timothy Hunter , 2017-04-08, 09:57
[SPARK-20076] Python interface for ml.stats.Correlation - Spark - [issue]
...The (Pearson) statistics have been exposed with a Dataframe interface as part of SPARK-19636 in the Scala interface. We should now make these available in Python....
http://issues.apache.org/jira/browse/SPARK-20076    Author: Timothy Hunter , 2017-04-07, 09:00
[SPARK-19636] Feature parity for correlation statistics in MLlib - Spark - [issue]
...This ticket tracks porting the functionality of spark.mllib.Statistics.corr() over to spark.ml.Here is a design doc:https://docs.google.com/document/d/1ELVpGV3EBjc2KQPLN9_9_Ge9gWchPZ6SGtDW5t...
http://issues.apache.org/jira/browse/SPARK-19636    Author: Timothy Hunter , 2017-03-24, 01:43