[ 
https://issues.apache.org/jira/browse/DATAFU-88?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14317197#comment-14317197
 ] 

Matthew Hayes commented on DATAFU-88:
-------------------------------------

Yes, if a user downloads the datafu-pig JAR and attempts to use this UDF 
without separately downloading the GPL v3 library, then this will fail at 
runtime.   All other UDFs would be unaffected.

Another option is to ship a separate datafu-pig-whatever JAR that has UDFs that 
require additional dependencies to be downloaded.

> Port Stanford Core NLP Functionality to DataFu
> ----------------------------------------------
>
>                 Key: DATAFU-88
>                 URL: https://issues.apache.org/jira/browse/DATAFU-88
>             Project: DataFu
>          Issue Type: New Feature
>    Affects Versions: 1.3.0
>            Reporter: Russell Jurney
>            Assignee: Russell Jurney
>              Labels: lemmatizer, nlp, pig, pig_udf, stanford, stemmer
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> For starters I need the Stanford Core NLP stemmer and lemmatizer. 
> It looks like maybe I can add something generic and feed arguments to code 
> like: props.put("annotators", "tokenize, ssplit, pos, lemma");
> Helpful example of lemmatizing at 
> http://stackoverflow.com/questions/1578062/lemmatization-java



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to