damccorm opened a new issue, #20770:
URL: https://github.com/apache/beam/issues/20770

   There are multiple issues wrong with the API of withProjection:
   
   1. The current API requires an extra encoderSchema that is not needed when 
projecting data in Parquet. The simplest way to get this with the Parquet API 
is by passing the projectionSchema like this:
   > 
{color:#000000}AvroReadSupport{color}.setAvroReadSchema({color:#871094}conf{color},
 {color:#871094}projectionSchema{color});
   > 
olor:#000000}AvroReadSupport{color}.setRequestedProjection({color:#871094}conf{color},
 {color:#871094}projectionSchema{color});
   > We can offer an alternative method `withProjection(Configuration conf, 
List<String\> fields)` so users don't have to build their own projection 
Schema, but historically we have let users to rely on the upstream connector 
API. If we follow this we can better document in ParquetIO how to project 
fields by relying in the Parquet APIs and avoid maintaining this extra code in 
the Beam side.
   
    
   
   Imported from Jira 
[BEAM-11908](https://issues.apache.org/jira/browse/BEAM-11908). Original Jira 
may contain additional context.
   Reported by: iemejia.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to