Hi,

parquet_writer.write_table(table)

This line writes a single file.
The documentation says:
This creates a single Parquet file. In practice, a Parquet dataset may
consist of many files in many directories. We can read a single file back
with read_table:

Is there a way for PyArrow to create a parquet file in the form of a
directory with multiple part files in it such as :

ls -lrt permit-inspections-recent.parquet
...  14:53 part-00001-bd5d902d-fac9-4e03-b63e-6a8dfc4060b6.snappy.parquet
...  14:53 part-00000-bd5d902d-fac9-4e03-b63e-6a8dfc4060b6.snappy.parquet

Regards,
Yash

Reply via email to