clear query| facets| time Search criteria: .   Results from 1 to 10 from 408 (0.0s).
Loading phrases to help you
refine your search...
[SPARK-23258] Should not split Arrow record batches based on row count - Spark - [issue]
...Currently when executing scalar pandas_udf or using toPandas() the Arrow record batches are split up once the record count reaches a max value, which is configured with "spark.sql.execution....
http://issues.apache.org/jira/browse/SPARK-23258    Author: Bryan Cutler , 2020-07-11, 00:31
[ARROW-9357] [Java] Document how to set netty/unsafe allocators - Arrow - [issue]
...There are now 2 allocators available, one based on netty and one using unsafe apis. We should provide end-user documentation on which one is default and how to set and use each one....
http://issues.apache.org/jira/browse/ARROW-9357    Author: Bryan Cutler , 2020-07-07, 18:20
[ARROW-9356] [Java] Remove Netty dependency from arrow-vector - Arrow - [issue]
...Cleanup remaining usage of Netty from arrow-vector and remove as a dependency after ARROW-9300....
http://issues.apache.org/jira/browse/ARROW-9356    Author: Bryan Cutler , 2020-07-07, 18:15
[SPARK-32162] Improve Pandas Grouped Map with Window test output - Spark - [issue]
...The output of GroupedMapInPandasTests.test_grouped_over_window_with_key is not helpful, only gives ======================================================================FAIL: test_grouped_ov...
http://issues.apache.org/jira/browse/SPARK-32162    Author: Bryan Cutler , 2020-07-06, 12:41
[SPARK-24554] Add MapType Support for Arrow in PySpark - Spark - [issue]
...Add support for MapType in Arrow related classes in Scala/Java and pyarrow functionality in Python....
http://issues.apache.org/jira/browse/SPARK-24554    Author: Bryan Cutler , 2020-06-26, 16:51
[VOTE] Add Decimal::bitWidth field to Schema.fbs for forward compatibility - Arrow - [mail # dev]
...+1On Wed, Jun 24, 2020, 10:38 AM Francois Saint-Jacques <[EMAIL PROTECTED]> wrote:> +1 (binding)>...
   Author: Bryan Cutler , 2020-06-25, 16:03
[SPARK-32080] Simplify ArrowColumnVector ListArray accessor - Spark - [issue]
...The ArrowColumnVector ListArray accessor calculates start and end offset indices manually. There were APIs added in Arrow 0.15.0 that do this and using them will simplify this code and make ...
http://issues.apache.org/jira/browse/SPARK-32080    Author: Bryan Cutler , 2020-06-24, 13:16
[ANNOUNCE] New Arrow committers: Ji Liu and Liya Fan - Arrow - [mail # dev]
...Congratulations!On Thu, Jun 11, 2020, 9:29 PM Fan Liya  wrote:> Dear all,>> I want to thank you all for all your kind help.> It is a great honor to work with you in this gre...
   Author: Bryan Cutler , 2020-06-12, 14:34
[SPARK-31964] Avoid Pandas import for CategoricalDtype with Arrow conversion - Spark - [issue]
...The import for CategoricalDtype changed in Pandas from 0.23 to 1.0 and currently pyspark checks 2 places to import. It would be better check the type as a string and avoid any imports....
http://issues.apache.org/jira/browse/SPARK-31964    Author: Bryan Cutler , 2020-06-11, 04:29
[SPARK-25351] Handle Pandas category type when converting from Python with Arrow - Spark - [issue]
...There needs to be some handling of category types done when calling createDataFrame with Arrow or the return value of pandas_udf.  Without Arrow, Spark casts each element to the categor...
http://issues.apache.org/jira/browse/SPARK-25351    Author: Bryan Cutler , 2020-06-11, 00:04