[
https://issues.apache.org/jira/browse/PIG-1271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Daniel Dai updated PIG-1271:
----------------------------
Description:
With [PIG-613|https://issues.apache.org/jira/browse/PIG-613], we are able to
load txt files containing complex data type (map/bag/tuple) according to
schema. However, the format of complex data field is very strict. User have to
use pre-determined special characters to mark the beginning and end of each
field, and those special characters can not be used in the content. The goals
of this issue are:
1. Provide a way for user to escape special characters
2. Make it easy for users to customize Utf8StorageConverter when they have
their own data format
This is a candidate project for Google summer of code 2012. More information
about the program can be found at
https://cwiki.apache.org/confluence/display/PIG/GSoc2012
was:
With [PIG-613|https://issues.apache.org/jira/browse/PIG-613], we are able to
load txt files containing complex data type (map/bag/tuple) according to
schema. However, the format of complex data field is very strict. User have to
use pre-determined special characters to mark the beginning and end of each
field, and those special characters can not be used in the content. The goals
of this issue are:
1. Provide a way for user to escape special characters
2. Make it easy for users to customize Utf8StorageConverter when they have
their own data format
> Provide a more flexible data format to load complex field (bag/tuple/map) in
> PigStorage
> ---------------------------------------------------------------------------------------
>
> Key: PIG-1271
> URL: https://issues.apache.org/jira/browse/PIG-1271
> Project: Pig
> Issue Type: Bug
> Affects Versions: 0.7.0
> Reporter: Daniel Dai
> Assignee: Daniel Dai
> Labels: gsoc2012
>
> With [PIG-613|https://issues.apache.org/jira/browse/PIG-613], we are able to
> load txt files containing complex data type (map/bag/tuple) according to
> schema. However, the format of complex data field is very strict. User have
> to use pre-determined special characters to mark the beginning and end of
> each field, and those special characters can not be used in the content. The
> goals of this issue are:
> 1. Provide a way for user to escape special characters
> 2. Make it easy for users to customize Utf8StorageConverter when they have
> their own data format
> This is a candidate project for Google summer of code 2012. More information
> about the program can be found at
> https://cwiki.apache.org/confluence/display/PIG/GSoc2012
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira