Hi Tao,

This jira [1] looks exactly what you are asking but it was merged recently 
(thanks to Anant Damle for working on this!) and it should be available only in 
Beam 2.28.0.

[1] https://issues.apache.org/jira/browse/BEAM-11460

Regards,
Alexey

> On 6 Jan 2021, at 18:57, Tao Li <[email protected]> wrote:
> 
> Hi beam community,
>  
> Quick question about ParquetIO 
> <https://beam.apache.org/releases/javadoc/2.25.0/org/apache/beam/sdk/io/parquet/ParquetIO.html>.
>  Is there a way to avoid specifying the avro schema when reading parquet 
> files? The reason is that we may not know the parquet schema until we read 
> the files. In comparison, spark parquet reader 
> <https://spark.apache.org/docs/latest/sql-data-sources-parquet.html> does not 
> require such a schema specification.
>  
> Please advise. Thanks a lot!

Reply via email to