AjayBoddeda4 commented on issue #542:
URL: https://github.com/apache/wayang/issues/542#issuecomment-4105475052
Hi, I am Ajay Boddeda, a GSoC 2026 applicant working on the DataFrames API
proposal for Apache Wayang.
This issue is directly relevant to my proposal. In the DataFrame API I am
designing, the Schema class maps column names to their types and positions —
which is exactly what would enable retrieving data by field name rather than by
index.
In the Row class I am planning to implement, the getAttribute(String
columnName) method would use the Schema to look up the correct index
internally. This way users never need to know the index — they just use column
names like df.select('age') and the Schema handles the position lookup
automatically.
This approach makes the DataFrame API much more user friendly and aligns
with how Spark Dataset[Row] works. Would love to contribute to this as part of
the broader DataFrame API work.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]