clear query| facets| time Search criteria: author:"Wenchen Fan".   Results from 1 to 10 from 120 (0.0s).
Loading phrases to help you
refine your search...
RDD object Out of scope. - Spark - [mail # dev]
...RDD is kind of a pointer to the actual data. Unless it's cached, we don'tneed to clean up the RDD.On Tue, May 21, 2019 at 1:48 PM Nasrulla Khan Haris wrote:> HI Spark developers,>...
   Author: Wenchen Fan , 2019-05-21, 13:28
Access to live data of cached dataFrame - Spark - [mail # user]
...When you cache a dataframe, you actually cache a logical plan. That's whyre-creating the dataframe doesn't work: Spark finds out the logical plan iscached and picks the cached data.You need ...
   Author: Wenchen Fan , 2019-05-21, 13:36
[VOTE] Release Apache Spark 2.4.3 - Spark - [mail # dev]
...+1.The Scala version problem has been resolved, which is the main motivationof 2.4.3.On Mon, May 6, 2019 at 12:38 AM Felix Cheung wrote:> I ran basic tests on R, r-hub etc. LGTM.>> ...
   Author: Wenchen Fan , 2019-05-06, 08:14
[DISCUSS] Spark Columnar Processing - Spark - [mail # dev]
...Do you have some initial perf numbers? It seems fine to me to remainrow-based inside Spark with whole-stage-codegen, and convert rows tocolumnar batches when communicating with external syst...
   Author: Wenchen Fan , 2019-03-26, 04:53
[VOTE] Release Apache Spark 2.4.1 (RC9) - Spark - [mail # dev]
...+1, all the known blockers are resolved. Thanks for driving this!On Wed, Mar 27, 2019 at 11:31 AM DB Tsai  wrote:> Please vote on releasing the following candidate as Apache Spark ve...
   Author: Wenchen Fan , 2019-03-27, 19:37
[expand - 6 more] - [VOTE] Release Apache Spark 2.4.2 - Spark - [mail # dev]
...ah I didn't know that branch 2.4 still use Scala 2.11 as default. I thoughtwe've switched to Scala 2.12 when we deprecate Scala 2.11 in 2.4.1.If people complain we can do 2.4.3 quickly.Thank...
   Author: Wenchen Fan , 2019-04-26, 06:22
[expand - 2 more] - Spark 2.4.2 - Spark - [mail # dev]
...I volunteer to be the release manager for 2.4.2, as I was also going topropose 2.4.2 because of the reverting of SPARK-25250. Is there any otherongoing bug fixes we want to include in 2.4.2?...
   Author: Wenchen Fan , 2019-04-18, 00:50
[expand - 1 more] - DataFrameWriter does not adjust spark.sql.session.timeZone offset while writing orc files - Spark - [mail # user]
...How did you read/write the timestamp value from/to ORC file?On Wed, Apr 24, 2019 at 6:30 PM Shubham Chaurasia wrote:> Hi All,>> Consider the following(spark v2.4.0):>> Basical...
   Author: Wenchen Fan , 2019-04-24, 12:54
Moving forward with the timestamp proposal - Spark - [mail # dev]
...I think this is the right direction to go, but I'm wondering how can Sparksupport these new types if the underlying data sources(like parquet files)do not support them yet.I took a quick loo...
   Author: Wenchen Fan , 2019-02-21, 07:32
[expand - 1 more] - [DISCUSS] Spark 3.0 and DataSourceV2 - Spark - [mail # dev]
...I'm good with the list from Ryan, thanks!On Thu, Feb 28, 2019 at 1:00 AM Ryan Blue  wrote:> I think that's a good plan. Let's get the functionality done, but mark it> experimental...
   Author: Wenchen Fan , 2019-02-28, 01:32