clear query| facets| time Search criteria: .   Results from 1 to 10 from 648 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-26858] Vectorized gapplyCollect, Arrow optimization in native R function execution - Spark - [issue]
...Unlike gapply, gapplyCollect requires additional ser/de steps because it can omit the schema, and Spark SQL doesn't know the return type before actually execution happens.In original code pa...
http://issues.apache.org/jira/browse/SPARK-26858    Author: Hyukjin Kwon , 2019-02-20, 20:51
[SPARK-26901] Vectorized gapply should not prune columns - Spark - [issue]
...Currently, if some columns can be pushed, it's being pushed through FlatMapGroupsInRWithArrow.explain(count(gapply(df,                    &n...
http://issues.apache.org/jira/browse/SPARK-26901    Author: Hyukjin Kwon , 2019-02-20, 10:12
[SPARK-26762] Arrow optimization for conversion from Spark DataFrame to R DataFrame - Spark - [issue]
...Like SPARK-25981, collect(rdf) can be optimized via Arrow....
http://issues.apache.org/jira/browse/SPARK-26762    Author: Hyukjin Kwon , 2019-02-20, 03:35
[SPARK-26759] Arrow optimization in SparkR's interoperability - Spark - [issue]
...Arrow 0.12.0 is release and it contains R API. We could optimize Spark DaraFrame <> R DataFrame interoperability.For instance see the examples below: dapply    df <- creat...
http://issues.apache.org/jira/browse/SPARK-26759    Author: Hyukjin Kwon , 2019-02-20, 03:24
[expand - 1 more] - [build system] Jenkins stopped working - Spark - [mail # dev]
...Thanks Shane!! <32019년 2월 20일 (수) 오전 10:13, Wenchen Fan 님이 작성:> Thanks Shane!>> On Wed, Feb 20, 2019 at 6:48 AM shane knapp  wrote:>>> alright, i increased the http...
   Author: Hyukjin Kwon , 2019-02-20, 02:29
[SPARK-26922] Set socket timeout consistently in Arrow optimization - Spark - [issue]
...For instance, see https://github.com/apache/spark/blob/e8982ca7ad94e98d907babf2d6f1068b7cd064c6/R/pkg/R/context.R#L184it should set the timeout from SPARKR_BACKEND_CONNECTION_TIMEOUT. Or may...
http://issues.apache.org/jira/browse/SPARK-26922    Author: Hyukjin Kwon , 2019-02-19, 06:52
[SPARK-26924] Document Arrow optimization and vectorized R APIs - Spark - [issue]
...We should update SparkR guide documentation, and some related documents, comments like in SQLConf.scala when most of tasks are finished....
http://issues.apache.org/jira/browse/SPARK-26924    Author: Hyukjin Kwon , 2019-02-19, 04:11
[SPARK-26923] Refactor ArrowRRunner and RRunner to deduplicate codes - Spark - [issue]
...ArrowRRunner and RRunner has already duplicated codes. We should refactor and deduplicate them. Also, ArrowRRunner happened to have a rather hacky code (see https://github.com/apache/spark/p...
http://issues.apache.org/jira/browse/SPARK-26923    Author: Hyukjin Kwon , 2019-02-19, 04:09
[SPARK-26921] Fix CRAN hack as soon as Arrow is available on CRAN - Spark - [issue]
...Arrow optimization was added but Arrow is not available on CRAN.So, it had to add some hacks to avoid CRAN check in SparkR side. For example, see https://github.com/apache/spark/search?q=req...
http://issues.apache.org/jira/browse/SPARK-26921    Author: Hyukjin Kwon , 2019-02-19, 03:29
[SPARK-26920] Deduplicate type checking across Arrow optimization and vectorized APIs in SparkR - Spark - [issue]
...There are duplication about type checking in Arrow <> SparkR code paths. For instance,https://github.com/apache/spark/blob/8126d09fb5b969c1e293f1f8c41bec35357f74b5/R/pkg/R/group.R#L229...
http://issues.apache.org/jira/browse/SPARK-26920    Author: Hyukjin Kwon , 2019-02-19, 03:23