clear query| facets| time Search criteria: .   Results from 1 to 10 from 753 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-18105] LZ4 failed to decompress a stream of shuffled data - Spark - [issue]
...When lz4 is used to compress the shuffle files, it may fail to decompress it as "stream is corrupt"Caused by: org.apache.spark.SparkException: Job aborted due to stage failure: Task 92 in st...
http://issues.apache.org/jira/browse/SPARK-18105    Author: Davies Liu , 2019-11-25, 11:27
[SPARK-5091] Hooks for PySpark tasks - Spark - [issue]
...Currently, it's not convenient to add package on executor to PYTHONPATH (we did not assume the environment of driver an executor are identical). It will be nice to have a hook to called befo...
http://issues.apache.org/jira/browse/SPARK-5091    Author: Davies Liu , 2019-05-21, 05:37
[SPARK-3916] recognize appended data in textFileStream() - Spark - [issue]
...Right now, we only find new data from new files, the data written to old files (processed in last batch) will not be processed.In order to support this, we need partialRDD(), which is an RDD...
http://issues.apache.org/jira/browse/SPARK-3916    Author: Davies Liu , 2019-05-21, 05:37
[SPARK-3153] shuffle will run out of space when disks have different free space - Spark - [issue]
...If we have several disks in SPARK_LOCAL_DIRS, and one of them is much smaller than others (maybe added in my mistake, or special disk, SSD), them the shuffle will meet the problem of run out...
http://issues.apache.org/jira/browse/SPARK-3153    Author: Davies Liu , 2019-05-21, 05:37
[SPARK-14333] Duration of task should be the total time (not just computation time) - Spark - [issue]
...Right now, the duration of a task in Stage page is task computation time, it should be total time (including serialization and fetching results). We should also have a separate column for co...
http://issues.apache.org/jira/browse/SPARK-14333    Author: Davies Liu , 2019-05-21, 04:37
[SPARK-13409] Log the stacktrace when stopping a SparkContext - Spark - [issue]
...Somethings we saw a stopped SparkContext, then have no idea it's stopped by what, we should log that for troubleshooting....
http://issues.apache.org/jira/browse/SPARK-13409    Author: Davies Liu , 2019-05-21, 04:37
[SPARK-18032] Spark test failed as OOM in jenkins - Spark - [issue]
...I saw some tests failed as OOM recently, for example, https://amplab.cs.berkeley.edu/jenkins/job/spark-master-test-maven-hadoop-2.6/1998/console#l10n-footerMaybe we should increase the heaps...
http://issues.apache.org/jira/browse/SPARK-18032    Author: Davies Liu , 2019-05-21, 04:37
[SPARK-12472] OOM when sort a table and save as parquet - Spark - [issue]
...t = sqlContext.table('store_sales')t.unionAll(t).coalesce(2).sortWithinPartitions(t[0]).write.partitionBy('ss_sold_date_sk').parquet("/tmp/ttt")15/12/21 14:35:52 WARN TaskSetManager: Lost ta...
http://issues.apache.org/jira/browse/SPARK-12472    Author: Davies Liu , 2019-05-21, 04:36
[SPARK-10572] Investigate the contentions bewteen tasks in the same executor - Spark - [issue]
...According to the benchmark results Jesse F Chen, It's surprised to see there are so much difference (4X) in term of number of executors, we should investigate the reason.```> Just be curi...
http://issues.apache.org/jira/browse/SPARK-10572    Author: Davies Liu , 2019-05-21, 04:36
[SPARK-14784] Build SQL for EXISTS/IN subquery - Spark - [issue]
http://issues.apache.org/jira/browse/SPARK-14784    Author: Davies Liu , 2019-05-21, 04:36