clear query| facets| time Search criteria: .   Results from 1 to 10 from 14 (0.0s).
Loading phrases to help you
refine your search...
[VOTE] SPIP: Standardize SQL logical plans - Spark - [mail # dev]
...+1 (non-binding)On 18 July 2018 at 17:32, Xiao Li  wrote:> +1 (binding)>> Like what Ryan and I discussed offline, the contents of implementation> sketch is not part of this ...
   Author: Alessandro Solimando , 2018-07-18, 15:47
[Spark SQL] error in performing dataset union with complex data type (struct, list) - Spark - [mail # user]
...Hi Pranav,I don´t have an answer to your issue, but what I generally do in this casesis to first try to simplify it to a point where it is easier to checkwhat´s going on, and then adding bac...
   Author: Alessandro Solimando , 2018-06-03, 14:36
[expand - 1 more] - spark sql StackOverflow - Spark - [mail # user]
...From the information you provided I would tackle this as a batch problem,because this way you have access to more sophisticated techniques and youhave more flexibility (maybe HDFS and a Spar...
   Author: Alessandro Solimando , 2018-05-15, 09:48
A bug triggered by a particular sequence of "select", "groupby" and "join" in Spark 2.3.0 - Spark - [mail # user]
...Hi Shiyuan,can you show us the output of ¨explain¨ over df (as a last step)?On 11 April 2018 at 19:47, Shiyuan  wrote:> Variable name binding is a python thing, and Spark should not ...
   Author: Alessandro Solimando , 2018-04-11, 17:56
Union of multiple data frames - Spark - [mail # user]
...Hello Cesar,can you add some details like: number of columns, avg number of rows in theDFs, time spent to compute the plan with all the unions, and the timeneeded to perform the action?Thank...
   Author: Alessandro Solimando , 2018-04-06, 07:31
[expand - 1 more] - K Means Clustering Explanation - Spark - [mail # user]
...Hi Matt,unfortunately I have no code pointer at hand.I will sketch how to accomplish this via the API, it will for sure at leasthelp you getting started.1) ETL + vectorization (I assume your...
   Author: Alessandro Solimando , 2018-03-04, 13:09
[expand - 3 more] - redundant decision tree model - Spark - [mail # dev]
...Hello,a small recap for who is interested.There was already a ticket covering the case that I failed to find when Ichecked.As a result the other one has been correctly marked as duplicate:ht...
   Author: Alessandro Solimando , 2018-02-17, 05:44
[SPARK-23409] RandomForest/DecisionTree (syntactic) pruning of redundant subtrees - Spark - [issue]
...Improvement: redundancy elimination from decision trees where all the leaves of a given subtree share the same prediction.Benefits: Model interpretability Faster unitary model invocation (re...
http://issues.apache.org/jira/browse/SPARK-23409    Author: Alessandro Solimando , 2018-02-16, 21:30
transformSchema method policy for "duplicated" column names - Spark - [mail # dev]
...Hello everyone,after one month without any reply on stackoverflow (https://stackoverflow.com/questions/47789265/inconsistency-in-handling-duplicate-names-in-dataframe-schema)I try to pose th...
   Author: Alessandro Solimando , 2018-01-13, 14:03