clear query| facets| time Search criteria: .   Results from 1 to 10 from 58 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-20144] no long maintains ordering of the data - Spark - [issue]
...Hi, We are trying to upgrade Spark from 1.6.3 to 2.0.2. One issue we found is when we read parquet files in 2.0.2, the ordering of rows in the resulting dataframe is not the same as the orde...    Author: Li Jin , 2018-10-15, 21:27
[SPARK-25640] Clarify/Improve EvalType for grouped aggregate and window aggregate - Spark - [issue]
...Currently, grouped aggregate and window aggregate uses different EvalType, however, they map to the same user facing type PandasUDFType.GROUPED_MAP.It makes sense to have one user facing typ...    Author: Li Jin , 2018-10-10, 05:50
[expand - 2 more] - [DISCUSS] PySpark Window UDF - Spark - [mail # dev]
...Thanks Wes and Felix!I have finished the initial development work and the PR is in a good statefor review (have pinged a couple of people to review this too). I amexcited to work with the co...
   Author: Li Jin , 2018-09-20, 17:49
[SPARK-24561] User-defined window functions with pandas udf (bounded window) - Spark - [issue]    Author: Li Jin , 2018-08-31, 15:24
[DISCUSS] move away from python doctests - Spark - [mail # dev]
...Hi Imran,My understanding is that doctests and unittests are orthogonal - doctestsare used to make sure docstring examples are correct and are not meant toreplace unittests.Functionalities a...
   Author: Li Jin , 2018-08-29, 19:02
[SPARK-25213] DataSourceV2 doesn't seem to produce unsafe rows - Spark - [issue]
...Reproduce (Need to compile test-classes):bin/pyspark --driver-class-path sql/core/target/scala-2.11/test-classesdatasource_v2_df = \             ...    Author: Li Jin , 2018-08-28, 13:32
[SPARK-25216] Provide better error message when a column contains dot and needs backticks quote - Spark - [issue]
...The current error message is  often confusing to a new Spark user that a column containing "." needs backticks quote. For example, consider the following code:spark.range(0, 1).toDF('a.b')['...    Author: Li Jin , 2018-08-23, 18:46
[discuss][minor] impending python 3.x jenkins upgrade... 3.5.x? 3.6.x? - Spark - [mail # dev]
...Thanks for looking into this Shane. If we can only have a single python 3version, I agree 3.6 would be better than 3.5. Otherwise, ideally I thinkit would be nice to test all supported 3.x v...
   Author: Li Jin , 2018-08-20, 20:30
code freeze and branch cut for Apache Spark 2.4 - Spark - [mail # dev]
...I agree with Byran. If it's acceptable to have another job to test withPython 3.5 and pyarrow 0.10.0, I am leaning towards upgrading arrow.Arrow 0.10.0 has tons of bug fixes and improves fro...
   Author: Li Jin , 2018-08-10, 17:59
[SPARK-23633]  Update Pandas UDFs section in sql-programming-guide - Spark - [issue]
...Let's make sure sql-programming-guide is up-to-date before 2.4 release.    Author: Li Jin , 2018-07-31, 02:12