Python supports tabular structures using pyarrow.

https://arrow.apache.org/docs/python/generated/pyarrow.schema.html

For nested structures like JSON you have to use C++ (parquet-cpp)

https://github.com/apache/parquet-cpp

We need more APIs developed to create nested JSON..

-----Original Message-----
From: Divya Gehlot [mailto:[EMAIL PROTECTED]]
Sent: Tuesday, June 12, 2018 5:25 AM
To: [EMAIL PROTECTED]
Subject: Re: Which perform better JSON or convert JSON to parquet format ?

[EXTERNAL EMAIL]
Hi David,
How to create the schema first using parquet library ?
Can you please give an example?

Thanks,
Divya

On Tue, 12 Jun 2018 at 00:03, Lee, David <[EMAIL PROTECTED]> wrote: