[
https://issues.apache.org/jira/browse/PIG-966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12781692#action_12781692
]
Dmitriy V. Ryaboy commented on PIG-966:
---------------------------------------
LoadFunc has a method called determineSchema, not getSchema. This implies some
sort of introspection, so I can see interpreting this as "if you are looking at
the data, use determineSchema, and if you have a metadata store/repo then
implement LoadMetadata".
But I agree this is clunky and potentially confusing.
I am of two minds about this. On one hand, moving the method make sense as it's
metadata-related. On the other hand, it makes implementations that work with
self-describing formats like Avro implement a heavy-looking interface, and
requires further changes to existing LoadFunc implementations that will have to
be ported.
Another issue is that LoadMetadata.getSchema() returns a ResourceSchema,
whereas LoadFunc.determineSchema() returns Pig's Schema. The two are compatible
(I have a translation from one to the other in PIG-760), but not the same.
> Proposed rework for LoadFunc, StoreFunc, and Slice/r interfaces
> ---------------------------------------------------------------
>
> Key: PIG-966
> URL: https://issues.apache.org/jira/browse/PIG-966
> Project: Pig
> Issue Type: Improvement
> Components: impl
> Reporter: Alan Gates
> Assignee: Alan Gates
>
> I propose that we rework the LoadFunc, StoreFunc, and Slice/r interfaces
> significantly. See http://wiki.apache.org/pig/LoadStoreRedesignProposal for
> full details
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.