Hi Tao, This jira [1] looks exactly what you are asking but it was merged recently (thanks to Anant Damle for working on this!) and it should be available only in Beam 2.28.0.
[1] https://issues.apache.org/jira/browse/BEAM-11460 Regards, Alexey > On 6 Jan 2021, at 18:57, Tao Li <[email protected]> wrote: > > Hi beam community, > > Quick question about ParquetIO > <https://beam.apache.org/releases/javadoc/2.25.0/org/apache/beam/sdk/io/parquet/ParquetIO.html>. > Is there a way to avoid specifying the avro schema when reading parquet > files? The reason is that we may not know the parquet schema until we read > the files. In comparison, spark parquet reader > <https://spark.apache.org/docs/latest/sql-data-sources-parquet.html> does not > require such a schema specification. > > Please advise. Thanks a lot!
