[ 
https://issues.apache.org/jira/browse/CHUKWA-316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eric Yang updated CHUKWA-316:
-----------------------------

    Attachment: CHUKWA-316.patch

The current behavior for loading sequence file to database:

     SQ          DB     
     K1 ------> K1 (If not defined in the dictionary, the default means load to 
database, if the field name matches)
     K2 ------> K1
     K3 ------> K1

In order to avoid hidden meaning, to avoid the default rule to load, we should 
explicit define:

     K1 ------> ""
     K2 ------> K1
     K3 ------> K1


> jobconf Error in mr_job_* table
> -------------------------------
>
>                 Key: CHUKWA-316
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-316
>             Project: Hadoop Chukwa
>          Issue Type: Bug
>          Components: Data Processors
>    Affects Versions: 0.2.0
>         Environment: Redhat EL 5,1, Java 6
>            Reporter: Eric Yang
>            Assignee: Eric Yang
>            Priority: Blocker
>         Attachments: CHUKWA-316.patch
>
>
> The current mr_job table, jobconf columns contains the content of jobconf for 
> some rows, and other rows contains the
> path of the jobconf.
> In the sequence file, jobconf is the field for location of the jobconf path.  
> The content of the jobconf is
> JOBCONF-JSON.  In database table, jobconf is the content of the jobconf.  
> Unfortunately, the new automatic mapping will
> detect sequence file "jobconf" field is a match with "jobconf" column in the 
> database.  Hence the
> last update wins.
> To work around this problem, we need ability to undefine sequence file field 
> to match to database column.
> Personally, I think indirect data translation mapping is the wrong thing to 
> do.  The mapping should be as straight forward as possible.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to