kennknowles opened a new issue, #18647: URL: https://github.com/apache/beam/issues/18647
Currently there are utility methods in BeamRecord to get field values by name, e.g. BeamRecord.getFieldValue(String name). Internally they call fieldNamesArrayList.indexOf(fieldName) to find the index of the field name. This works as long as there is only one field with such name in the record. But when joining 2 records you can end up with duplicate field names, and without any means of distinguishing them or getting a value from specific field by name. We don't keep any metadata in BeamRecordType to help identify a field in this case. It feels that this can lead to obscure bugs. We probably should keep more detailed schema information attached to the fields, so that we could reference them using qualifiers like "[schemaA].[pcollectionB].[fieldC]". Imported from Jira [BEAM-3574](https://issues.apache.org/jira/browse/BEAM-3574). Original Jira may contain additional context. Reported by: kedin. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
