clear query| facets| time Search criteria: author:"Xiangrui Meng".   Results from 1 to 10 from 60 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-27968] ArrowEvalPythonExec.evaluate shouldn't eagerly read the first batch - Spark - [issue]
...An issue mentioned here: https://github.com/apache/spark/pull/24734/files#r288377915, could be decoupled from that PR....
http://issues.apache.org/jira/browse/SPARK-27968    Author: Xiangrui Meng , 2019-06-07, 01:57
[SPARK-27364] User-facing APIs for GPU-aware scheduling - Spark - [issue]
...Design and implement: General guidelines for cluster managers to understand resource requests at application start. The concrete conf/param will be under the design of each cluster manager. ...
http://issues.apache.org/jira/browse/SPARK-27364    Author: Xiangrui Meng , 2019-06-05, 17:36
[SPARK-1486] Support multi-model training in MLlib - Spark - [issue]
...It is rare in practice to train just one model with a given set of parameters. Usually, this is done by training multiple models with different sets of parameters and then select the best ba...
http://issues.apache.org/jira/browse/SPARK-1486    Author: Xiangrui Meng , 2019-06-06, 13:57
[SPARK-1655] In naive Bayes, store conditional probabilities distributively. - Spark - [issue]
...In the current implementation, we collect all conditional probabilities to the driver node. When there are many labels and many features, this puts heavy load on the driver. For scalability,...
http://issues.apache.org/jira/browse/SPARK-1655    Author: Xiangrui Meng , 2019-06-06, 13:57
[SPARK-6617] Word2Vec is nondeterministic - Spark - [issue]
...Word2Vec uses repartition: https://github.com/apache/spark/blob/v1.3.0/mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala#L291, which doesn't provide deterministic ordering. ...
http://issues.apache.org/jira/browse/SPARK-6617    Author: Xiangrui Meng , 2019-06-06, 13:57
[SPARK-27887] Check python version and print deprecation warning if version < 3 - Spark - [issue]
...In Spark 3.0, users should see a deprecation warning if they use PySpark with Python < 3....
http://issues.apache.org/jira/browse/SPARK-27887    Author: Xiangrui Meng , 2019-06-04, 07:02
[SPARK-27372] Standalone executor process-level isolation to support GPU scheduling - Spark - [issue]
...As an admin, I can configure standalone to have multiple executor processes on the same worker node and processes are configured via cgroups so they only have access to assigned GPUs. So I d...
http://issues.apache.org/jira/browse/SPARK-27372    Author: Xiangrui Meng , 2019-06-11, 04:18
[SPARK-28056] Document SCALAR_ITER Pandas UDF - Spark - [issue]
...After SPARK-26412, we should document the new SCALAR_ITER Pandas UDF so user can discover the feature and learn how to use it....
http://issues.apache.org/jira/browse/SPARK-28056    Author: Xiangrui Meng , 2019-06-18, 03:52
[SPARK-27309] CypherSession implementation in spark-cypher - Spark - [issue]
...Implement CypherSession and RelationalCypherSession in spark-cypher module....
http://issues.apache.org/jira/browse/SPARK-27309    Author: Xiangrui Meng , 2019-06-12, 12:43
[SPARK-28030] Binary file data source doesn't support space in file names - Spark - [issue]
...echo 123 > "/tmp/test space.txt"spark.read.format("binaryFile").load("/tmp/test space.txt").count()...
http://issues.apache.org/jira/browse/SPARK-28030    Author: Xiangrui Meng , 2019-06-12, 20:24