[ 
https://issues.apache.org/jira/browse/PIG-4670?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rohini Palaniswamy updated PIG-4670:
------------------------------------
    Description: 
PIG-3204 fixed pig script parsing to parse in batches instead of line by line. 
But the fix in BoundScript is not right and it is still parsing line by line. 
That makes parsing take long time for very large pig scripts using PigStorage 
when there is no schema file stored and without -noschema as it tries to find 
the schema file lots of times.

It should be grunt.parseStopOnError(false); instead of 
grunt.parseStopOnError(true); to make it parse statements in batch.

  was:PIG-3204 fixed pig script parsing to parse in batches instead of line by 
line. But the fix in BoundScript is not right and it is still parsing line by 
line. That makes parsing take long time for very large pig scripts using 
PigStorage when there is no schema file stored and without -noschema as it 
tries to find the schema file lots of times.


> Embedded Python scripts still parse line by line
> ------------------------------------------------
>
>                 Key: PIG-4670
>                 URL: https://issues.apache.org/jira/browse/PIG-4670
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Rohini Palaniswamy
>            Assignee: Rohini Palaniswamy
>             Fix For: 0.16.0
>
>         Attachments: PIG-4670-1.patch
>
>
> PIG-3204 fixed pig script parsing to parse in batches instead of line by 
> line. But the fix in BoundScript is not right and it is still parsing line by 
> line. That makes parsing take long time for very large pig scripts using 
> PigStorage when there is no schema file stored and without -noschema as it 
> tries to find the schema file lots of times.
> It should be grunt.parseStopOnError(false); instead of 
> grunt.parseStopOnError(true); to make it parse statements in batch.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to