clear query| facets| time Search criteria: .   Results from 1 to 10 from 61 (0.0s).
Loading phrases to help you
refine your search...
[KUDU-2483] Scan tablets with bloom filter - Kudu - [issue]
...Join is really common/popular in Spark SQL, in this JIRA I take broadcast join as an example and describe how Kudu's bloom filter can help accelerate distributed computing.Spark runs broadca...
http://issues.apache.org/jira/browse/KUDU-2483    Author: jin xing , 2018-09-20, 06:18
[KUDU-2521] Java Implementation for BloomFilter - Kudu - [issue]
...We need a Java version BloomFilter and have exactly same behavior with C++ version.Thus Spark can generate and submit BloomFilter when scan.We mean to keep the implementation independent  an...
http://issues.apache.org/jira/browse/KUDU-2521    Author: jin xing , 2018-09-14, 18:04
[KUDU-2520] Java Implementation for BloomFilter - Kudu - [issue]
http://issues.apache.org/jira/browse/KUDU-2520    Author: jin xing , 2018-07-30, 08:53
[SPARK-22384] Refine partition pruning when attribute is wrapped in Cast - Spark - [issue]
...Sql below will get all partitions from metastore, which put much burden on metastore;CREATE TABLE test (value INT) PARTITIONED BY (dt STRING)SELECT * from test where dt=2017The reason is tha...
http://issues.apache.org/jira/browse/SPARK-22384    Author: jin xing , 2018-07-04, 07:53
[SPARK-24379] BroadcastExchangeExec should catch SparkOutOfMemory and re-throw SparkFatalException, which wraps SparkOutOfMemory inside. - Spark - [issue]
...After SPARK-22827, Spark won't fails the entire executor but only fails the task suffering SparkOutOfMemoryError. In current BroadcastExchangeExec, it try-catch OutOfMemoryError. Think about...
http://issues.apache.org/jira/browse/SPARK-24379    Author: jin xing , 2018-06-24, 07:24
[SPARK-24294] Throw SparkException when OOM in BroadcastExchangeExec - Spark - [issue]
...When OutOfMemoryError thrown from BroadcastExchangeExec, scala.concurrent.Future will hit scala bug – https://github.com/scala/bug/issues/9554, and hang until future timeout:We could w...
http://issues.apache.org/jira/browse/SPARK-24294    Author: jin xing , 2018-05-23, 20:12
[SPARK-19659] Fetch big blocks to disk when shuffle-read - Spark - [issue]
...Currently the whole block is fetched into memory(offheap by default) when shuffle-read. A block is defined by (shuffleId, mapId, reduceId). Thus it can be large when skew situations. If OOM ...
http://issues.apache.org/jira/browse/SPARK-19659    Author: jin xing , 2018-05-17, 21:19
[SPARK-24193] Sort by disk when number of limit is big in TakeOrderedAndProjectExec - Spark - [issue]
...Physical plan of  "select colA from t order by colB limit M" is TakeOrderedAndProject;Currently TakeOrderedAndProject sorts data in memory, see https://github.com/apache/spark/blob/mast...
http://issues.apache.org/jira/browse/SPARK-24193    Author: jin xing , 2018-05-17, 14:30
[SPARK-24240] Add a config to control whether InMemoryFileIndex should update cache when refresh. - Spark - [issue]
...In current code(https://github.com/apache/spark/blob/master/sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/InsertIntoHadoopFsRelationCommand.scala#L172), after data is in...
http://issues.apache.org/jira/browse/SPARK-24240    Author: jin xing , 2018-05-10, 08:03
[SPARK-24143] filter empty blocks when convert mapstatus to (blockId, size) pair - Spark - [issue]
...In current code(MapOutputTracker.convertMapStatuses), mapstatus are converted to (blockId, size) pair for all blocks – no matter the block is empty or not, which result in OOM when the...
http://issues.apache.org/jira/browse/SPARK-24143    Author: jin xing , 2018-05-07, 06:19