[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-10-10 Thread T Jake Luciani (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13124343#comment-13124343 ] T Jake Luciani commented on SOLR-1979: -- build on 3x branch still failing because

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-10-10 Thread Commented
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13124366#comment-13124366 ] Jan Høydahl commented on SOLR-1979: --- Fixed overview.html in branch

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-12 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13102520#comment-13102520 ] Markus Jelsma commented on SOLR-1979: - Hi Jan, Can we also use the mapping feature

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13102573#comment-13102573 ] Jan Høydahl commented on SOLR-1979: --- @Markus: Sure. If you put your pre-known language

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-12 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13102578#comment-13102578 ] Markus Jelsma commented on SOLR-1979: - Hi. This is not what i understood from reading

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13102646#comment-13102646 ] Jan Høydahl commented on SOLR-1979: --- Yep, it will skip detection if the field defined in

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-12 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13102723#comment-13102723 ] Jan Høydahl commented on SOLR-1979: --- Any changes you'd like before committing this?

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-11 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13102374#comment-13102374 ] Jan Høydahl commented on SOLR-1979: --- An updated documentation of the Processor is now at

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-09-09 Thread Lance Norskog (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13101612#comment-13101612 ] Lance Norskog commented on SOLR-1979: - I'm impressed! This is a lot of work and

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-06-22 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13053227#comment-13053227 ] Jan Høydahl commented on SOLR-1979: --- One question regarding the JUnit test: I now use

[jira] [Commented] (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2011-06-03 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13043448#comment-13043448 ] Jan Høydahl commented on SOLR-1979: --- Continuing on this implementing the ideas above...

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-14 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12971322#action_12971322 ] Grant Ingersoll commented on SOLR-1979: --- bq. What about leveraging payloads (we can

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-14 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12971338#action_12971338 ] Jan Høydahl commented on SOLR-1979: --- {quote} Jan, do you have any updates to the patch?

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-14 Thread Tommaso Teofili (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12971400#action_12971400 ] Tommaso Teofili commented on SOLR-1979: --- bq. Keep it basic in first version. Allow for

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-08 Thread Erik Hatcher (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12969404#action_12969404 ] Erik Hatcher commented on SOLR-1979: What about leveraging payloads (we can output

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Tommaso Teofili (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12968633#action_12968633 ] Tommaso Teofili commented on SOLR-1979: --- bq. However, have you considered extending

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12968786#action_12968786 ] Robert Muir commented on SOLR-1979: --- bq. Kind of random that Thai is thrown in there! I

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12967211#action_12967211 ] Jan Høydahl commented on SOLR-1979: --- @Grant: I dropped the outputField setting and a

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12967214#action_12967214 ] Grant Ingersoll commented on SOLR-1979: --- bq. There should be a way to output the

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12968445#action_12968445 ] Yonik Seeley commented on SOLR-1979: bq. In skimming the current patch, it looks like

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-06 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12968528#action_12968528 ] Grant Ingersoll commented on SOLR-1979: --- bq. So for all unmapped languages, you may

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12966955#action_12966955 ] Grant Ingersoll commented on SOLR-1979: --- See

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12966964#action_12966964 ] Jan Høydahl commented on SOLR-1979: --- Simply allowing to set the threshold for

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12966970#action_12966970 ] Jan Høydahl commented on SOLR-1979: --- The idField input parameter is just used for decent

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12966972#action_12966972 ] Robert Muir commented on SOLR-1979: --- bq. cause that distance measure is kind of an

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12966978#action_12966978 ] Robert Muir commented on SOLR-1979: --- We really need to not be using ISO 639-1 here. For

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12967010#action_12967010 ] Grant Ingersoll commented on SOLR-1979: --- bq. I would like to see RFC 3066 instead

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12967011#action_12967011 ] Grant Ingersoll commented on SOLR-1979: --- Another thought, here, is that, over time,

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Yonik Seeley (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12967016#action_12967016 ] Yonik Seeley commented on SOLR-1979: bq. The new field is made by concatenating the

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12967019#action_12967019 ] Robert Muir commented on SOLR-1979: --- bq. Yeah, that makes sense, however, I believe Tika

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12967032#action_12967032 ] Jan Høydahl commented on SOLR-1979: --- @Robert: Yes, there must be a way to tell whether or

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12967046#action_12967046 ] Grant Ingersoll commented on SOLR-1979: --- bq. @Grant: I actually planned to do the

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12967048#action_12967048 ] Grant Ingersoll commented on SOLR-1979: --- Note, the patch still needs more tests and

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-12-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12967076#action_12967076 ] Robert Muir commented on SOLR-1979: --- {quote} It makes sense to allow for detecting

[jira] Commented: (SOLR-1979) Create LanguageIdentifierUpdateProcessor

2010-08-17 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-1979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12899568#action_12899568 ] Jan Høydahl commented on SOLR-1979: --- I have implemented a first shot patch using the Tika