Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/21742#discussion_r201569743 --- Diff: dev/sparktestsupport/modules.py --- @@ -170,6 +170,16 @@ def __hash__(self): ] ) +avro = Module( --- End diff -- This is much cleaner, like what we did for kafka, which is also a built-in data source. Ideally, we should separate parquet, orc and other built-in data sources from sql module. We can do the refactoring in the future, if needed
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org