Pig scripts should be able to handle scenario where input datasets not 
present/or empty before running
------------------------------------------------------------------------------------------------------

                 Key: PIG-869
                 URL: https://issues.apache.org/jira/browse/PIG-869
             Project: Pig
          Issue Type: Improvement
          Components: impl
    Affects Versions: 0.2.0
         Environment: grid environment testing of pig 2.2
            Reporter: Rekha
            Priority: Minor
             Fix For: 0.2.0


Pig 2.2 does not handle situatiosn where dataset is not present, as in file 
missing, or empty file.

It would be great if Pig would within scripts enforce some data checks.
It can be any simple command like below that can be easily wrapped around all 
input sources--

if ( datapath_valid && data_present && file_not_empty)  {
           run the rest of the script 
} 
else {
            throw an exception/error code  
          --this should be easily trappable valuecode in logs
}

This improvement can be beneficial for our DQ check.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to