clear query| facets| time Search criteria: .   Results from 1 to 10 from 134698 (0.0s).
Loading phrases to help you
refine your search...
Seemingly wasteful memory duplication in LDAModel getTopicDistributionMethod() - Spark - [mail # user]
...In my usage of MLLib's LDA, I have noticed that repeated invocations ofLDAModel.transform() result in the duplication of a matrix derived from themodel's topic matrix. Because this derived m...
   Author: Andrew Mathis , 2019-02-22, 22:39
[SPARK-26774] Document threading concerns in TaskSchedulerImpl - Spark - [issue]
...TaskSchedulerImpl has a couple of places threading concerns aren't clearly documented, which could improved a little.  There is also a race in killTaskAttempt on taskIdToExecutorId (tho...
http://issues.apache.org/jira/browse/SPARK-26774    Author: Imran Rashid , 2019-02-22, 22:30
How can I parse an "unnamed" json array present in a column? - Spark - [mail # user]
...I have an "unnamed" json array stored in a *column*.  The format is the following : column name : newsData : [  {    "source": "source1",    "name": "News site1...
   Author: Yeikel , 2019-02-22, 22:15
[expand - 16 more] - [DISCUSS] Spark 3.0 and DataSourceV2 - Spark - [mail # dev]
...>> To your other message: I already see a number of PMC members here. Who's> the other entity?>I'll answer indirectly since pointing fingers isn't really my intent. Inthe absence...
   Author: Mark Hamstra , Sean Owen , ... , 2019-02-22, 21:56
[SPARK-26950] Make RandomDataGenerator use Float.NaN or Double.NaN for all NaN values - Spark - [issue]
...Apache Spark uses the predefined `Float.NaN` and `Double.NaN` for NaN values, but there exists more NaN values with different binary presentations.scala> java.nio.ByteBuffer.allocate(4).p...
http://issues.apache.org/jira/browse/SPARK-26950    Author: Dongjoon Hyun , 2019-02-22, 21:55
[SPARK-22860] Spark workers log ssl passwords passed to the executors - Spark - [issue]
...The workers log the spark.ssl.keyStorePassword and spark.ssl.trustStorePassword passed by cli to the executor processes. The ExecutorRunner should escape passwords to not appear in the worke...
http://issues.apache.org/jira/browse/SPARK-22860    Author: Felix K. , 2019-02-22, 21:38
[SPARK-26975] Support nested-column pruning over limit/sample/repartition - Spark - [issue]
...As SPARK-26958 shows the benchmark, nested-column pruning has limitations. This issue aims to remove the limitations on `limit/repartition/sample`. In this issue, repartition means `Repartit...
http://issues.apache.org/jira/browse/SPARK-26975    Author: Dongjoon Hyun , 2019-02-22, 21:37
[SPARK-24238] HadoopFsRelation can't append the same table with multi job at the same time. - Spark - [issue]
...When there are multiple tasks at the same time append a HadoopFsRelation,there will be an error, there are the following two errors:1. A task will succeed, but the data will be wrong and mor...
http://issues.apache.org/jira/browse/SPARK-24238    Author: yangz , 2019-02-22, 21:13
[SPARK-26895] When running spark 2.3 as a proxy user (--proxy-user), SparkSubmit fails to resolve globs owned by target user - Spark - [issue]
...We are resolving globs in SparkSubmit here (by way of prepareSubmitEnvironment) without first going into a doAs:https://github.com/apache/spark/blob/6c18d8d8079ac4d2d6dc7539601ab83fc5b51760/...
http://issues.apache.org/jira/browse/SPARK-26895    Author: Alessandro Bellina , 2019-02-22, 19:15
Detect data from textFile RDD - Spark - [mail # user]
...Hey, I am working with spark source code. I am printing logs within the codeto understand how hadoopRDD works. I wan't to print a timestamp whenexecutor first reads the textFile RDD (input s...
   Author: swastik mittal , 2019-02-22, 19:08