clear query| facets| time Search criteria: author:"Josh Rosen".   Results from 31 to 40 from 589 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-986] Add job cancellation to PySpark - Spark - [issue]
...We should add support for job cancellation to PySpark.  It would also be nice to be able to cancel jobs via ctrl-c in the PySpark shell....    Author: Josh Rosen , 2014-04-25, 03:21
[SPARK-988] Write PySpark profiling guide - Spark - [issue]
...Write a guide on profiling PySpark applications.  I've done this in the past by modifying the workers to make cProfile dumps, then using various tools to collect and merge those dumps i...    Author: Josh Rosen , 2015-01-03, 22:45
[SPARK-1004] PySpark on YARN - Spark - [issue]
...This is for tracking progress on supporting YARN in PySpark.We might be able to use yarn-client mode (    Author: Josh Rosen , 2016-06-15, 07:46
[SPARK-721] Fix remaining deprecation warnings - Spark - [issue]
...The recent patch to re-enable deprecation warnings fixed many of them, but there's still a few left; it would be nice to fix them.For example, here's one in RDDSuite:[warn] /Users/joshrosen/...    Author: Josh Rosen , 2014-06-22, 07:04
[SPARK-725] Ran out of disk space on EC2 master due to Ganglia logs - Spark - [issue]
...This morning, I started a Spark Standalone cluster on EC2 using 50 m1.medium instances.  When I tried to rebuild Spark ~5.5 hours later, the build failed because the master ran out of d...    Author: Josh Rosen , 2015-02-26, 00:05
[SPARK-726] Possible bugs in zip() transformation - Spark - [issue]
...A couple of bugs in the zip() transformation were reported on the mailing list, so I thought I'd link them here so they aren't forgotten:    Author: Josh Rosen , 2013-04-07, 18:16
[SPARK-732] Recomputation of RDDs may result in duplicated accumulator updates - Spark - [issue]
...Currently, Spark doesn't guard against duplicated updates to the same accumulator due to recomputations of an RDD.  For example:    val acc = sc.accumulator(0)    da...    Author: Josh Rosen , 2016-02-24, 22:23
[SPARK-733] Add documentation on use of accumulators in lazy transformation - Spark - [issue]
...Accumulators updates are side-effects of RDD computations.  Unlike RDDs, accumulators do not carry lineage that would allow them to be computed when their values are accessed on the mas...    Author: Josh Rosen , 2015-01-16, 21:33
[SPARK-745] Document Scala environment configuration when Scala is installed from RPM - Spark - [issue]
...As points out, the Typesafe Scala RPM installs the scala executable in /usr/bin/ and places the Scala library JARs in /usr/share/java/.That pull reque...    Author: Josh Rosen , 2013-06-30, 17:14
[SPARK-748] Add documentation page describing interoperability with other software (e.g. HBase, JDBC, Kafka, etc.) - Spark - [issue]
...Spark seems to be gaining a lot of data input / output features for integrating with systems like HBase, Kafka, JDBC, Hadoop, etc.It might be a good idea to create a single documentation pag...    Author: Josh Rosen , 2016-01-18, 10:24