[
https://issues.apache.org/jira/browse/PIG-443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shubham Chopra updated PIG-443:
-------------------------------
Attachment: pig_types_24_09.patch
* I have fixed the connection issue. I think I had misunderstood the way
PigContext works. Using a proper pigContext fixed the connection problem.
* Deducing a schema from sample data is a bit of a problem. Consider the
following case:
a = load 'somefile' as (x, y, z);
b = filter a by some-condition;
c = foreach b generate x;
illustrate c;
If 'b' is an empty set, 'c' would also be an empty set. I wouldn't be able to
depend on sample data alone to deduce the schemas. I might end up using the
same logic as the getSchema methods to deduce the schema for 'c' in this case.
Also, schemas are used only while generating synthetic data and displaying the
final set of examples.
* That issue with cogroup I guess is again because of a no-schema load. Like
pointed out earlier, the augmentation phase where I try to generate synthetic
data used schemas.
I have done a modification in the code. I now check for schema while reading
the data and throw an exception if I don't find one asking users to provide a
schema.
> Illustrate for the Types branch
> -------------------------------
>
> Key: PIG-443
> URL: https://issues.apache.org/jira/browse/PIG-443
> Project: Pig
> Issue Type: New Feature
> Affects Versions: types_branch
> Reporter: Shubham Chopra
> Assignee: Shubham Chopra
> Fix For: types_branch
>
> Attachments: pig_types_22_09.patch, pig_types_23_09.patch,
> pig_types_24_09.patch
>
>
> This implementation of Illustrate uses visitors to traverse the plans.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.