clear query| facets| time Search criteria: .   Results from 1 to 10 from 25 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-22666] Spark datasource for image format - Spark - [issue]
...The current API for the new image format is implemented as a standalone feature, in order to make it reside within the mllib package. As discussed in SPARK-21866, users should be able to loa...    Author: Timothy Hunter , 2018-09-25, 10:11
[SPARK-25124] VectorSizeHint.size is buggy, breaking streaming pipeline - Spark - [issue]
...Currently, when using VectorSizeHint().setSize(3) in an ML pipeline, transforming a stream will return a nondescript exception about the stream not started. At core are the following bugs th...    Author: Timothy Hunter , 2018-08-24, 22:41
[SPARK-21866] SPIP: Image support in Spark - Spark - [issue]
...Background and motivationAs Apache Spark is being used more and more in the industry, some new use cases are emerging for different data formats beyond the traditional SQL types or the numer...    Author: Timothy Hunter , 2018-06-08, 19:20
[SPARK-23996] Implement the optimal KLL algorithms for quantiles in streams - Spark - [issue]
...The current implementation for approximate quantiles - a variant of Grunwald-Khanna, which I implemented - is not the best in light of recent papers: - it is not exactly the one from the pap...    Author: Timothy Hunter , 2018-04-23, 18:25
[SPARK-19635] Feature parity for Chi-square hypothesis testing in MLlib - Spark - [issue]
...This ticket tracks porting the functionality of spark.mllib.Statistics.chiSqTest over to is a design doc:    Author: Timothy Hunter , 2018-03-15, 01:55
[SPARK-19634] Feature parity for descriptive statistics in MLlib - Spark - [issue]
...This ticket tracks porting the functionality of spark.mllib.MultivariateOnlineSummarizer over to design has been discussed in SPARK-19208 . Here is a design doc:https://docs.googl...    Author: Timothy Hunter , 2018-01-29, 11:50
[SPARK-20077] Documentation for ml.stats.Correlation - Spark - [issue]
...Now that (Pearson) correlations are available in, we need to write some documentation to go along with this feature. It can simply be looking at the unit tests for example right now...    Author: Timothy Hunter , 2017-11-06, 08:25
[SPARK-12210] Small example that shows how to integrate spark.mllib with - Spark - [issue]
...Since we are missing a number of algorithms in such as clustering or LDA, we should have a small example that shows the recommended way to go back and forth between and spa...    Author: Timothy Hunter , 2017-04-08, 09:57
[SPARK-20076] Python interface for ml.stats.Correlation - Spark - [issue]
...The (Pearson) statistics have been exposed with a Dataframe interface as part of SPARK-19636 in the Scala interface. We should now make these available in Python....    Author: Timothy Hunter , 2017-04-07, 09:00
[SPARK-19636] Feature parity for correlation statistics in MLlib - Spark - [issue]
...This ticket tracks porting the functionality of spark.mllib.Statistics.corr() over to is a design doc:    Author: Timothy Hunter , 2017-03-24, 01:43