[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models
[ https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16081931#comment-16081931 ] Joern Kottmann commented on OPENNLP-1013: - Doesn't seem likely that the POS Tagger fails on model loading, the unit tests we have are doing that a lot. The openNLP R stuff, even tough it is called OpenNLP R, is not provided by this project. > [OpenNLP][R Language][1.5.3-2] Bug when using French models > --- > > Key: OPENNLP-1013 > URL: https://issues.apache.org/jira/browse/OPENNLP-1013 > Project: OpenNLP > Issue Type: Bug > Components: POS Tagger >Affects Versions: tools-1.5.3 > Environment: R Language, RStudio >Reporter: Iuri Deolindo Nogueira > Fix For: 1.8.2 > > > When using French models in R language, I'm receving a "subscript out of > bound" issue. I'm going to detail: > - > Well, I'm using French models to NLP in R environment. To get the french > models, I'm using binaries compiled and develloped by Nicolas: > https://sites.google.com/site/nicolashernandez/resources/opennlp > http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html > https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0 > - > The problem it happens only with the POS function. This is how I call the > function and respective issue: > Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = > paste0(, "fr-pos.bin")) > Issue: > Error in environment(f)$meta[[tag]] : subscript out of bounds > - > However, if I deleted the language parameter, the issue does not happen > anymore: > Maxent_POS_Tag_Annotator(probs = TRUE, model = > paste0(, "fr-pos.bin")) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models
[ https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16079528#comment-16079528 ] Bruno P. Kinoshita commented on OPENNLP-1013: - Not a problem. The rJava is just the bridge. First I would check with the [CRAN opennlp R package|https://cran.r-project.org/package=openNLP]. They will be able to thoroughly analyse this issue, and point whether there is a) an internal change that is necessary, b) something wrong with rJava, or c) a bug in the Java OpenNLP code. > [OpenNLP][R Language][1.5.3-2] Bug when using French models > --- > > Key: OPENNLP-1013 > URL: https://issues.apache.org/jira/browse/OPENNLP-1013 > Project: OpenNLP > Issue Type: Bug > Components: POS Tagger >Affects Versions: tools-1.5.3 > Environment: R Language, RStudio >Reporter: Iuri Deolindo Nogueira > Fix For: 1.8.2 > > > When using French models in R language, I'm receving a "subscript out of > bound" issue. I'm going to detail: > - > Well, I'm using French models to NLP in R environment. To get the french > models, I'm using binaries compiled and develloped by Nicolas: > https://sites.google.com/site/nicolashernandez/resources/opennlp > http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html > https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0 > - > The problem it happens only with the POS function. This is how I call the > function and respective issue: > Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = > paste0(, "fr-pos.bin")) > Issue: > Error in environment(f)$meta[[tag]] : subscript out of bounds > - > However, if I deleted the language parameter, the issue does not happen > anymore: > Maxent_POS_Tag_Annotator(probs = TRUE, model = > paste0(, "fr-pos.bin")) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models
[ https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16079515#comment-16079515 ] Iuri Deolindo Nogueira commented on OPENNLP-1013: - Ops, so sorry. My misunderstood. I'll check from my side, and if need, I'll go getting in touch with rJava package responsables to see if they can give me some more detailled logs to understand better the situation. Thank you again for your concern about it. > [OpenNLP][R Language][1.5.3-2] Bug when using French models > --- > > Key: OPENNLP-1013 > URL: https://issues.apache.org/jira/browse/OPENNLP-1013 > Project: OpenNLP > Issue Type: Bug > Components: POS Tagger >Affects Versions: tools-1.5.3 > Environment: R Language, RStudio >Reporter: Iuri Deolindo Nogueira > Fix For: 1.8.2 > > > When using French models in R language, I'm receving a "subscript out of > bound" issue. I'm going to detail: > - > Well, I'm using French models to NLP in R environment. To get the french > models, I'm using binaries compiled and develloped by Nicolas: > https://sites.google.com/site/nicolashernandez/resources/opennlp > http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html > https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0 > - > The problem it happens only with the POS function. This is how I call the > function and respective issue: > Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = > paste0(, "fr-pos.bin")) > Issue: > Error in environment(f)$meta[[tag]] : subscript out of bounds > - > However, if I deleted the language parameter, the issue does not happen > anymore: > Maxent_POS_Tag_Annotator(probs = TRUE, model = > paste0(, "fr-pos.bin")) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models
[ https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16079455#comment-16079455 ] Bruno P. Kinoshita commented on OPENNLP-1013: - Not fixed [~iurinog]. I added a comment, and the fix version was bumped to the next release. Though, again, I believe there is no easy way to troubleshoot and maybe fix the issue on our part. Better report an issue for the R package, and understand what's happening in the R code first. Maybe the Java API changed and now someone needs to update the R code, or maybe the language option wasn't really much used/tested before in the R code? > [OpenNLP][R Language][1.5.3-2] Bug when using French models > --- > > Key: OPENNLP-1013 > URL: https://issues.apache.org/jira/browse/OPENNLP-1013 > Project: OpenNLP > Issue Type: Bug > Components: POS Tagger >Affects Versions: tools-1.5.3 > Environment: R Language, RStudio >Reporter: Iuri Deolindo Nogueira > Fix For: 1.8.2 > > > When using French models in R language, I'm receving a "subscript out of > bound" issue. I'm going to detail: > - > Well, I'm using French models to NLP in R environment. To get the french > models, I'm using binaries compiled and develloped by Nicolas: > https://sites.google.com/site/nicolashernandez/resources/opennlp > http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html > https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0 > - > The problem it happens only with the POS function. This is how I call the > function and respective issue: > Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = > paste0(, "fr-pos.bin")) > Issue: > Error in environment(f)$meta[[tag]] : subscript out of bounds > - > However, if I deleted the language parameter, the issue does not happen > anymore: > Maxent_POS_Tag_Annotator(probs = TRUE, model = > paste0(, "fr-pos.bin")) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models
[ https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16079339#comment-16079339 ] Iuri Deolindo Nogueira commented on OPENNLP-1013: - Thank you for fixing the problem. I'll try with the newest version !! > [OpenNLP][R Language][1.5.3-2] Bug when using French models > --- > > Key: OPENNLP-1013 > URL: https://issues.apache.org/jira/browse/OPENNLP-1013 > Project: OpenNLP > Issue Type: Bug > Components: POS Tagger >Affects Versions: tools-1.5.3 > Environment: R Language, RStudio >Reporter: Iuri Deolindo Nogueira > Fix For: 1.8.2 > > > When using French models in R language, I'm receving a "subscript out of > bound" issue. I'm going to detail: > - > Well, I'm using French models to NLP in R environment. To get the french > models, I'm using binaries compiled and develloped by Nicolas: > https://sites.google.com/site/nicolashernandez/resources/opennlp > http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html > https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0 > - > The problem it happens only with the POS function. This is how I call the > function and respective issue: > Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = > paste0(, "fr-pos.bin")) > Issue: > Error in environment(f)$meta[[tag]] : subscript out of bounds > - > However, if I deleted the language parameter, the issue does not happen > anymore: > Maxent_POS_Tag_Annotator(probs = TRUE, model = > paste0(, "fr-pos.bin")) -- This message was sent by Atlassian JIRA (v6.4.14#64029)
[jira] [Commented] (OPENNLP-1013) [OpenNLP][R Language][1.5.3-2] Bug when using French models
[ https://issues.apache.org/jira/browse/OPENNLP-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16078014#comment-16078014 ] Bruno P. Kinoshita commented on OPENNLP-1013: - I believe this issue could be in the R module, and not in OpenNLP. Looking at the code around [this part|https://github.com/cran/openNLP/blob/a1709dea5f8a92757fcfa5bf672aa922041dc119/R/pos.R#L54], it appears for English (default language value) we have the right fields in the meta var. But when you give it a different language, it is trying to load the models in a different way. I am not sure what could be the problem exactly, but it looks to be in the R code, of that package, not in OpenNLP code. > [OpenNLP][R Language][1.5.3-2] Bug when using French models > --- > > Key: OPENNLP-1013 > URL: https://issues.apache.org/jira/browse/OPENNLP-1013 > Project: OpenNLP > Issue Type: Bug > Components: POS Tagger >Affects Versions: tools-1.5.3 > Environment: R Language, RStudio >Reporter: Iuri Deolindo Nogueira > > When using French models in R language, I'm receving a "subscript out of > bound" issue. I'm going to detail: > - > Well, I'm using French models to NLP in R environment. To get the french > models, I'm using binaries compiled and develloped by Nicolas: > https://sites.google.com/site/nicolashernandez/resources/opennlp > http://enicolashernandez.blogspot.fr/2012/12/apache-opennlp-fr-models.html > https://drive.google.com/drive/folders/0B4AyWQriFkxgWHR6QzlvcmxmdE0 > - > The problem it happens only with the POS function. This is how I call the > function and respective issue: > Maxent_POS_Tag_Annotator(language = "fr", probs = TRUE, model = > paste0(, "fr-pos.bin")) > Issue: > Error in environment(f)$meta[[tag]] : subscript out of bounds > - > However, if I deleted the language parameter, the issue does not happen > anymore: > Maxent_POS_Tag_Annotator(probs = TRUE, model = > paste0(, "fr-pos.bin")) -- This message was sent by Atlassian JIRA (v6.4.14#64029)