[ 
https://issues.apache.org/jira/browse/PIG-3359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13727988#comment-13727988
 ] 

Cheolsoo Park commented on PIG-3359:
------------------------------------

All the unit tests pass. I also tested with some of production scripts and 
verified that "pig -dryrun" generates the same output. Awesome!

1. The only difference that I see is that a lot more warnings are printed when 
there are many macro files. For example,
{code}
2013-08-02 19:05:46,158 [main] WARN  
org.apache.pig.tools.parameters.PreprocessorContext - Warning : Multiple values 
found for GCI_SOURCE_NETFLIX_SEASONS. Using value 1
2013-08-02 19:05:46,158 [main] WARN  
org.apache.pig.tools.parameters.PreprocessorContext - Warning : Multiple values 
found for GCI_ATTRIBUTE_ORIGINAL_COUNTRY. Using value 1
2013-08-02 19:05:46,158 [main] WARN  
org.apache.pig.tools.parameters.PreprocessorContext - Warning : Multiple values 
found for GCI_ATTRIBUTE_SEASON_SEQUENCE_NBR. Using value 5
2013-08-02 19:05:46,158 [main] WARN  
org.apache.pig.tools.parameters.PreprocessorContext - Warning : Multiple values 
found for GCI_ATTRIBUTE_RELEASE_YEAR. Using value 6
2013-08-02 19:05:46,158 [main] WARN  
org.apache.pig.tools.parameters.PreprocessorContext - Warning : Multiple values 
found for GCI_ATTRIBUTE_EPISODE_COUNT. Using value 8
2013-08-02 19:05:46,158 [main] WARN  
org.apache.pig.tools.parameters.PreprocessorContext - Warning : Multiple values 
found for GCI_ATTRIBUTE_TITLE_TYPE. Using value 20
{code}
This makes sense because you load params every time when importing a macro 
file. Can you please lower the log level to debug for these messages? This may 
unnecessarily scare the user.

2. Can you update the doc? I think you can simply remove the following lines 
from the [macro section|http://pig.apache.org/docs/r0.11.0/cont.html#macros]:
{code}
- Macros can only contain Pig Latin statements. The REGISTER statement is not 
supported. The shell commands (used with Grunt) are not supported.
- Parameter substitution cannot be used inside of macros. Parameters should be 
explicitly passed to macros and parameter substitution used only at the top 
level.
{code}
You can add your examples too, but I will let you decide on that. :-)

I will commit this as soon as you update your patch. Thanks a lot! 
                
> Register Statements and Param Substitution in Macros
> ----------------------------------------------------
>
>                 Key: PIG-3359
>                 URL: https://issues.apache.org/jira/browse/PIG-3359
>             Project: Pig
>          Issue Type: Bug
>          Components: parser
>            Reporter: Jonathan Packer
>            Assignee: Jonathan Packer
>         Attachments: PIG-3359_test.tar.gz, PIG-3359-v1.diff, 
> PIG-3359-v2.diff, PIG-3359-v3.diff, PIG-3359-v3-test-failures.txt, 
> PIG-3359-v4.diff, PIG-3359-v5.diff, PIG-3359-v6.diff
>
>
> There are some gaps in the functionality of macros that I've made a patch to 
> address. The goal is to provide everything you'd need to make reusable 
> algorithms libraries.
> 1. You can't register udfs inside a macro
> 2. Paramater substitutions aren't done inside macros
> 3. Resources (including macros) should not be redundantly acquired if they 
> are already present.
> Rohini's patch https://issues.apache.org/jira/browse/PIG-3204 should address 
> problem 3 where Pig reparses everything every time it reads a line, but there 
> still would be a problem if two separate files import the same macro / udf 
> file.
> To get this working, I moved methods for registering jars/udfs and param 
> substitution from PigServer to PigContext so they can be accessed in 
> QueryParserDriver which processes macros (QPD was already passed a PigContext 
> reference). Is that ok?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to