I recommend trying different values using the parquet-cli. That's an easy
way to see how different row group and page sizes perform. That's what I do
to tune all of our tables.

rb

On Fri, Jan 12, 2018 at 10:43 AM, ALeX Wang <[EMAIL PROTECTED]> wrote:

--
Ryan Blue
Software Engineer
Netflix