sclukas77 commented on a change in pull request #12341:
URL: https://github.com/apache/beam/pull/12341#discussion_r469973877



##########
File path: 
sdks/java/extensions/sql/src/main/java/org/apache/beam/sdk/extensions/sql/meta/provider/SchemaIOTableProviderWrapper.java
##########
@@ -80,13 +80,17 @@ public BeamSqlTable buildBeamSqlTable(Table 
tableDefinition) {
     }
   }
 
-  private BeamTableStatistics getTableStatistics(PipelineOptions options) {
+  public BeamTableStatistics getTableStatistics(PipelineOptions options) {
     if (isBounded().equals(PCollection.IsBounded.BOUNDED)) {
       return BeamTableStatistics.BOUNDED_UNKNOWN;
     }
     return BeamTableStatistics.UNBOUNDED_UNKNOWN;
   }
 
+  public BeamTableStatistics getTableStatistics(PipelineOptions options, 
SchemaIO schemaIO) {
+    return getTableStatistics(options);
+  }
+

Review comment:
       The reason why I added this additional getTableStatistics() function was 
that DataStoreV1 is the first IO whose getTableStatistics() function relied on 
the schemaIO data, and this SchemaIOTableWrapper#getTableStatistics function 
could be overridden in DataStoreV1TableProvider. The other IOs so far have not 
required the schemaIO. Do you think we should get rid of the other 
getTableStatistics() function and always require a schemaIO, even when the 
schemaIO isn't absolutely necessary? Or support both cases?




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to