clear
query|
facets|
time |
Search criteria: author:"Tathagata Das".
Results from 41 to 50 from
1165 (0.0s).
|
|
|
Loading phrases to help you refine your search...
|
[expand - 1 more]
[collapse]
-
Nullpointerexception error when in repartition -
Spark - [mail # user]
|
...It's not very surprising that doing this sort of RDD to DF conversioninside DStream.foreachRDD has weird corner cases like this. In fact, youare going to have additional problems with partia... |
|
|
|
|
Nullpointerexception error when in repartition - Spark - [mail # user]
|
...Have you read through the documentation of Structured Streaming?https://spark.apache.org/docs/latest/structured-streaming-programming-guide.htmlOne of the basic mistakes you are making is de... |
|
|
|
|
|
Structured Streaming on Kubernetes -
Spark - [mail # user]
|
...Structured streaming is stable in production! At Databricks, we and ourcustomers collectively process almost 100s of billions of records per dayusing SS. However, we are not using kubernetes... |
|
|
|
|
Does partition by and order by works only in stateful case? -
Spark - [mail # user]
|
...The traditional SQL windows with `over` is not supported in streaming. Onlytime-based windows, that is, `window("timestamp", "10 minutes")` issupported in streaming.On Thu, Apr 12, 2018 at 7... |
|
|
|
|
can we use mapGroupsWithState in raw sql? -
Spark - [mail # user]
|
...Unfortunately no. Honestly it does not make sense as for type-awareoperations like map, mapGroups, etc., you have to provide an actual JVMfunction. That does not fit in with the SQL language... |
|
|
|
|
How to read json data from kafka and store to hdfs with spark structued streaming? -
Spark - [mail # user]
|
...Are you writing multiple streaming query output to the same location? Ifso, I can see this error occurring. Multiple streaming queries writing tothe same directory is not supported.On Tue, J... |
|
|
|
|
Exceptions with simplest Structured Streaming example -
Spark - [mail # user]
|
...Unfortunately, your output is not visible in the email that we see. Was itan image that some got removed?Maybe best to copy the output text (i.e. the error message) into the email.On Thu, Ju... |
|
|
|
|
How to execute non-timestamp-based aggregations in spark structured streaming? -
Spark - [mail # user]
|
...SQL windows with the 'over' syntax does not work in Structured Streaming.It is very hard to incrementalize that in the general case. Hence non-timewindows are not supported.On Sat, Apr 20, 2... |
|
|
|
|
Spark streaming join on yarn -
Spark - [mail # user]
|
...How many tasks in the stage 2? How long do they take? If there are 200tasks taking 1 second each (so many "rounds" of tasks on available corestaking 13 seconds), then you can reduce the numb... |
|
|
|
|
Spark Streaming ZeroMQ Java Example -
Spark - [mail # user]
|
...Unfortunately there isnt one right now. But it is probably too hard to start with the JavaNetworkWordCount, and use the ZeroMQUtils in the same way as the Scala ZeroMQWordCount example. Basi... |
|
|
|
|
[expand - 1 more]
[collapse]
-
[bug?] streaming window unexpected behaviour -
Spark - [mail # user]
|
...You can probably do it in a simpler but sort of hacky way! If your window size is W and sliding interval S, you can do some math to figure out how many of the first windows are actuall... |
|
|
|
|
[bug?] streaming window unexpected behaviour - Spark - [mail # user]
|
...Yes, I believe that is current behavior. Essentially, the first few RDDs will be partial windows (assuming window duration > sliding interval). TD On Mon, Mar 24, 2014 at 1:1... |
|
|
|
|
|
|