clear query| facets| time Search criteria: author:"Joseph K. Bradley".   Results from 21 to 30 from 763 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-10931] PySpark ML Models should contain Param values - Spark - [issue]
...PySpark spark.ml Models are generally wrappers around Java objects and do not even contain Param values.  This JIRA is for copying the Param values from the Estimator to the model.This ...
http://issues.apache.org/jira/browse/SPARK-10931    Author: Joseph K. Bradley , 2017-08-23, 00:49
[SPARK-10779] Set initialModel for KMeans model in PySpark (spark.mllib) - Spark - [issue]
...Provide initialModel param for pyspark.mllib.clustering.KMeans...
http://issues.apache.org/jira/browse/SPARK-10779    Author: Joseph K. Bradley , 2015-10-07, 22:05
[SPARK-10785] Scale QuantileDiscretizer using distributed binning - Spark - [issue]
...SPARK-10064 improves binning in decision trees by distributing the computation.  QuantileDiscretizer should do the same....
http://issues.apache.org/jira/browse/SPARK-10785    Author: Joseph K. Bradley , 2016-04-06, 07:55
[SPARK-10788] Decision Tree duplicates bins for unordered categorical features - Spark - [issue]
...Decision trees in spark.ml (RandomForest.scala) communicate twice as much data as needed for unordered categorical features.  Here's an example.Say there are 3 categories A, B, C.  ...
http://issues.apache.org/jira/browse/SPARK-10788    Author: Joseph K. Bradley , 2017-04-12, 01:35
[SPARK-10809] Single-document topicDistributions method for LocalLDAModel - Spark - [issue]
...We could provide a single-document topicDistributions method for LocalLDAModel to allow for quick queries which avoid RDD operations.  Currently, the user must use an RDD of documents....
http://issues.apache.org/jira/browse/SPARK-10809    Author: Joseph K. Bradley , 2016-01-14, 20:47
[SPARK-10595] Various ML programming guide cleanups post 1.5 - Spark - [issue]
...Various ML guide cleanups. ml-guide.md: Make it easier to access the algorithm-specific guides. LDA user guide: EM often begins with useless topics, but running longer generally improves the...
http://issues.apache.org/jira/browse/SPARK-10595    Author: Joseph K. Bradley , 2015-09-16, 02:43
[SPARK-10602] Univariate statistics as UDAFs: single-pass continuous stats - Spark - [issue]
...See parent JIRA for more details.  This subtask covers statistics for continuous values requiring a single pass over the data, such as min and max.This JIRA is an umbrella.  For in...
http://issues.apache.org/jira/browse/SPARK-10602    Author: Joseph K. Bradley , 2015-10-05, 22:33
[SPARK-10603] Univariate statistics as UDAFs: multi-pass continuous stats - Spark - [issue]
...See parent JIRA for more details. This subtask covers statistics for continuous values requiring multiple passes over the data, such as median and quantiles.This JIRA is an umbrella. For ind...
http://issues.apache.org/jira/browse/SPARK-10603    Author: Joseph K. Bradley , 2015-10-05, 22:32
[SPARK-10604] Univariate statistics as UDAFs: categorical stats - Spark - [issue]
...See parent JIRA for more details. This subtask covers statistics for categorical values, such as number of categories or mode.This JIRA is an umbrella. For individual stats, please create an...
http://issues.apache.org/jira/browse/SPARK-10604    Author: Joseph K. Bradley , 2015-10-05, 22:30
[SPARK-5972] Cache residuals for GradientBoostedTrees during training - Spark - [issue]
...In gradient boosting, the current model's prediction is re-computed for each training instance on every iteration.  The current residual (cumulative prediction of previously trained tre...
http://issues.apache.org/jira/browse/SPARK-5972    Author: Joseph K. Bradley , 2015-04-28, 01:56