clear query| facets| time Search criteria: .   Results from 1 to 10 from 330 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-24797] Analyzer should respect spark.sql.hive.convertMetastoreOrc/Parquet when build the data source table - Spark - [issue]
...the current code path ignore the value of spark.sql.hive.convertMetastoreParquet when building data source table  https://github.com/apache/spark/blob/e0559f238009e02c40f65678fec691c07904e8c...
http://issues.apache.org/jira/browse/SPARK-24797    Author: Nan Zhu , 2018-07-13, 17:14
[expand - 1 more] - Integrating ML/DL frameworks with Spark - Spark - [mail # dev]
........how I skipped the last part........On Tue, May 8, 2018 at 11:16 AM, Reynold Xin  wrote:> Yes, Nan, totally agree. To be on the same page, that's exactly what I> wrote wasn't ...
   Author: Nan Zhu , 2018-05-08, 18:17
[expand - 3 more] - broken UI in 2.3? - Spark - [mail # user]
...Found more clues534482 [SparkUI-85] WARN org.spark_project.jetty.servlet.ServletHandler -Error for/api/v1/applications/application_1517256006008_2695197/allexecutorsjava.lang.NoSuchMethodErr...
   Author: Nan Zhu , 2018-03-05, 23:32
[VOTE] Spark 2.3.0 (RC5) - Spark - [mail # dev]
...+1  (non-binding), tested with internal workloads and benchmarksOn Mon, Feb 26, 2018 at 12:09 PM, Michael Armbrust wrote:> +1 all our pipelines have been running the RC for several d...
   Author: Nan Zhu , 2018-02-27, 00:03
[SPARK-22673] InMemoryRelation should utilize on-disk table stats whenever possible - Spark - [issue]
...The current implementation of InMemoryRelation always uses the most expensive execution plan when writing cacheWith CBO enabled, we can actually have a more exact estimation of the underlyin...
http://issues.apache.org/jira/browse/SPARK-22673    Author: Nan Zhu , 2018-01-27, 22:41
[SPARK-19280] Failed Recovery from checkpoint caused by the multi-threads issue in Spark Streaming scheduler - Spark - [issue]
...In one of our applications, we found the following issue, the application recovering from a checkpoint file named "checkpoint-***166700000" but with the timestamp ***166500000 will recover f...
http://issues.apache.org/jira/browse/SPARK-19280    Author: Nan Zhu , 2018-01-23, 13:08
[SPARK-22790] add a configurable factor to describe HadoopFsRelation's size - Spark - [issue]
...as per discussion in https://github.com/apache/spark/pull/19864#discussion_r156847927the current HadoopFsRelation is purely based on the underlying file size which is not accurate and makes ...
http://issues.apache.org/jira/browse/SPARK-22790    Author: Nan Zhu , 2018-01-13, 18:37
[expand - 1 more] - Palantir replease under org.apache.spark? - Spark - [mail # user]
...nvmOn Tue, Jan 9, 2018 at 9:42 AM, Nan Zhu  wrote:> Hi, all>> Out of curious, I just found a bunch of Palantir release under> org.apache.spark in maven central (https://mvnr...
   Author: Nan Zhu , 2018-01-09, 17:58
[SPARK-22599] Avoid extra reading for cached table - Spark - [issue]
...In the current implementation of Spark, InMemoryTableExec read all data in a cached table, filter CachedBatch according to stats and pass data to the downstream operators. This implementatio...
http://issues.apache.org/jira/browse/SPARK-22599    Author: Nan Zhu , 2017-12-25, 02:38
Request for review of SPARK-22599 - Spark - [mail # dev]
...Hi, allWhen we do perf test for Spark, we found that enabling table cache does notbring the expected speedup comparing to cloud-storage + parquet in manyscenarios. We identified that the per...
   Author: Nan Zhu , 2017-11-29, 19:04