[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models

2017-07-11 Thread Joern Kottmann (JIRA)

[ 
https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16081931#comment-16081931
 ] 

Joern Kottmann commented on OPENNLP-1013:
-

Doesn't seem likely that the POS Tagger fails on model loading, the unit tests 
we have are doing that a lot.
The openNLP R stuff, even tough it is called OpenNLP R, is not provided by this 
project.

> [OpenNLP][R Language][1.5.3-2] Bug when using French models
> ---
>
> Key: OPENNLP-1013
> URL: https://issues.apache.org/jira/browse/OPENNLP-1013
> Project: OpenNLP
>  Issue Type: Bug
>  Components: POS Tagger
>Affects Versions: tools-1.5.3
> Environment: R Language, RStudio
>Reporter: Iuri Deolindo Nogueira
> Fix For: 1.8.2
>
>
> When using French models in R language, I'm receving a "subscript out of 
> bound" issue. I'm going to detail:
> -
> Well, I'm using French models to NLP in R environment. To get the french 
> models, I'm using binaries compiled and develloped by Nicolas:
> https://sites.google.com/site/nicolashernandez/resources/opennlp
> http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html
> https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0
> -
> The problem it happens only with the POS function. This is how I call the 
> function and respective issue:
> Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = 
> paste0(, "fr-pos.bin"))
> Issue: 
> Error in environment(f)$meta[[tag]] : subscript out of bounds
> -
> However, if I deleted the language parameter, the issue does not happen 
> anymore:
> Maxent_POS_Tag_Annotator(probs = TRUE, model = 
> paste0(, "fr-pos.bin"))



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models

2017-07-09 Thread Bruno P. Kinoshita (JIRA)

[ 
https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16079528#comment-16079528
 ] 

Bruno P. Kinoshita commented on OPENNLP-1013:
-

Not a problem. The rJava is just the bridge. First I would check with the [CRAN 
opennlp R package|https://cran.r-project.org/package=openNLP]. They will be 
able to thoroughly analyse this issue, and point whether there is a) an 
internal change that is necessary, b) something wrong with rJava, or c) a bug 
in the Java OpenNLP code.

> [OpenNLP][R Language][1.5.3-2] Bug when using French models
> ---
>
> Key: OPENNLP-1013
> URL: https://issues.apache.org/jira/browse/OPENNLP-1013
> Project: OpenNLP
>  Issue Type: Bug
>  Components: POS Tagger
>Affects Versions: tools-1.5.3
> Environment: R Language, RStudio
>Reporter: Iuri Deolindo Nogueira
> Fix For: 1.8.2
>
>
> When using French models in R language, I'm receving a "subscript out of 
> bound" issue. I'm going to detail:
> -
> Well, I'm using French models to NLP in R environment. To get the french 
> models, I'm using binaries compiled and develloped by Nicolas:
> https://sites.google.com/site/nicolashernandez/resources/opennlp
> http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html
> https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0
> -
> The problem it happens only with the POS function. This is how I call the 
> function and respective issue:
> Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = 
> paste0(, "fr-pos.bin"))
> Issue: 
> Error in environment(f)$meta[[tag]] : subscript out of bounds
> -
> However, if I deleted the language parameter, the issue does not happen 
> anymore:
> Maxent_POS_Tag_Annotator(probs = TRUE, model = 
> paste0(, "fr-pos.bin"))



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models

2017-07-09 Thread Iuri Deolindo Nogueira (JIRA)

[ 
https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16079515#comment-16079515
 ] 

Iuri Deolindo Nogueira commented on OPENNLP-1013:
-

Ops, so sorry. My misunderstood. 
I'll check from my side, and if need, I'll go getting in touch with rJava 
package responsables to see if they can give me some more detailled logs to 
understand better the situation.
Thank you again for your concern about it.

> [OpenNLP][R Language][1.5.3-2] Bug when using French models
> ---
>
> Key: OPENNLP-1013
> URL: https://issues.apache.org/jira/browse/OPENNLP-1013
> Project: OpenNLP
>  Issue Type: Bug
>  Components: POS Tagger
>Affects Versions: tools-1.5.3
> Environment: R Language, RStudio
>Reporter: Iuri Deolindo Nogueira
> Fix For: 1.8.2
>
>
> When using French models in R language, I'm receving a "subscript out of 
> bound" issue. I'm going to detail:
> -
> Well, I'm using French models to NLP in R environment. To get the french 
> models, I'm using binaries compiled and develloped by Nicolas:
> https://sites.google.com/site/nicolashernandez/resources/opennlp
> http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html
> https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0
> -
> The problem it happens only with the POS function. This is how I call the 
> function and respective issue:
> Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = 
> paste0(, "fr-pos.bin"))
> Issue: 
> Error in environment(f)$meta[[tag]] : subscript out of bounds
> -
> However, if I deleted the language parameter, the issue does not happen 
> anymore:
> Maxent_POS_Tag_Annotator(probs = TRUE, model = 
> paste0(, "fr-pos.bin"))



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models

2017-07-08 Thread Bruno P. Kinoshita (JIRA)

[ 
https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16079455#comment-16079455
 ] 

Bruno P. Kinoshita commented on OPENNLP-1013:
-

Not fixed [~iurinog]. I added a comment, and the fix version was bumped to the 
next release. Though, again, I believe there is no easy way to troubleshoot and 
maybe fix the issue on our part. Better report an issue for the R package, and 
understand what's happening in the R code first. Maybe the Java API changed and 
now someone needs to update the R code, or maybe the language option wasn't 
really much used/tested before in the R code?

> [OpenNLP][R Language][1.5.3-2] Bug when using French models
> ---
>
> Key: OPENNLP-1013
> URL: https://issues.apache.org/jira/browse/OPENNLP-1013
> Project: OpenNLP
>  Issue Type: Bug
>  Components: POS Tagger
>Affects Versions: tools-1.5.3
> Environment: R Language, RStudio
>Reporter: Iuri Deolindo Nogueira
> Fix For: 1.8.2
>
>
> When using French models in R language, I'm receving a "subscript out of 
> bound" issue. I'm going to detail:
> -
> Well, I'm using French models to NLP in R environment. To get the french 
> models, I'm using binaries compiled and develloped by Nicolas:
> https://sites.google.com/site/nicolashernandez/resources/opennlp
> http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html
> https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0
> -
> The problem it happens only with the POS function. This is how I call the 
> function and respective issue:
> Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = 
> paste0(, "fr-pos.bin"))
> Issue: 
> Error in environment(f)$meta[[tag]] : subscript out of bounds
> -
> However, if I deleted the language parameter, the issue does not happen 
> anymore:
> Maxent_POS_Tag_Annotator(probs = TRUE, model = 
> paste0(, "fr-pos.bin"))



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models

2017-07-08 Thread Iuri Deolindo Nogueira (JIRA)

[ 
https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16079339#comment-16079339
 ] 

Iuri Deolindo Nogueira commented on OPENNLP-1013:
-

Thank you for fixing the problem.
I'll try with the newest version !!

> [OpenNLP][R Language][1.5.3-2] Bug when using French models
> ---
>
> Key: OPENNLP-1013
> URL: https://issues.apache.org/jira/browse/OPENNLP-1013
> Project: OpenNLP
>  Issue Type: Bug
>  Components: POS Tagger
>Affects Versions: tools-1.5.3
> Environment: R Language, RStudio
>Reporter: Iuri Deolindo Nogueira
> Fix For: 1.8.2
>
>
> When using French models in R language, I'm receving a "subscript out of 
> bound" issue. I'm going to detail:
> -
> Well, I'm using French models to NLP in R environment. To get the french 
> models, I'm using binaries compiled and develloped by Nicolas:
> https://sites.google.com/site/nicolashernandez/resources/opennlp
> http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html
> https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0
> -
> The problem it happens only with the POS function. This is how I call the 
> function and respective issue:
> Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = 
> paste0(, "fr-pos.bin"))
> Issue: 
> Error in environment(f)$meta[[tag]] : subscript out of bounds
> -
> However, if I deleted the language parameter, the issue does not happen 
> anymore:
> Maxent_POS_Tag_Annotator(probs = TRUE, model = 
> paste0(, "fr-pos.bin"))



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models

2017-07-07 Thread Bruno P. Kinoshita (JIRA)

[ 
https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078014#comment-16078014
 ] 

Bruno P. Kinoshita commented on OPENNLP-1013:
-

I believe this issue could be in the R module, and not in OpenNLP.

Looking at the code around [this 
part|https://github.com/cran/openNLP/blob/a1709dea5f8a92757fcfa5bf672aa922041dc119/R/pos.R#L54],
 it appears for English (default language value) we have the right fields in 
the meta var.

But when you give it a different language, it is trying to load the models in a 
different way. I am not sure what could be the problem exactly, but it looks to 
be in the R code, of that package, not in OpenNLP code.

> [OpenNLP][R Language][1.5.3-2] Bug when using French models
> ---
>
> Key: OPENNLP-1013
> URL: https://issues.apache.org/jira/browse/OPENNLP-1013
> Project: OpenNLP
>  Issue Type: Bug
>  Components: POS Tagger
>Affects Versions: tools-1.5.3
> Environment: R Language, RStudio
>Reporter: Iuri Deolindo Nogueira
>
> When using French models in R language, I'm receving a "subscript out of 
> bound" issue. I'm going to detail:
> -
> Well, I'm using French models to NLP in R environment. To get the french 
> models, I'm using binaries compiled and develloped by Nicolas:
> https://sites.google.com/site/nicolashernandez/resources/opennlp
> http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html
> https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0
> -
> The problem it happens only with the POS function. This is how I call the 
> function and respective issue:
> Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = 
> paste0(, "fr-pos.bin"))
> Issue: 
> Error in environment(f)$meta[[tag]] : subscript out of bounds
> -
> However, if I deleted the language parameter, the issue does not happen 
> anymore:
> Maxent_POS_Tag_Annotator(probs = TRUE, model = 
> paste0(, "fr-pos.bin"))



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)