Tom Lampert created MAHOUT-1633:
-----------------------------------

             Summary: Failure to execute query when solr index contains 
documents with different fields
                 Key: MAHOUT-1633
                 URL: https://issues.apache.org/jira/browse/MAHOUT-1633
             Project: Mahout
          Issue Type: Bug
          Components: CLI
    Affects Versions: 1.0
         Environment: CentOS 6.5
            Reporter: Tom Lampert
            Priority: Critical
             Fix For: 1.0


When using Lucene2Seq on a lucene Index that contains documents that have 
different fields the following error is output:

java.lang.IllegalArgumentException: Could not create query scorer for query: 
tableName:code
        at 
org.apache.mahout.text.LuceneSegmentRecordReader.initialize(LuceneSegmentRecordReader.java:69)
        at 
org.apache.mahout.text.LuceneSegmentInputFormat.createRecordReader(LuceneSegmentInputFormat.java:76)
        at 
org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.<init>(MapTask.java:492)
        at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:735)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:340)
        at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:167)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:415)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1557)
        at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:162)

The query that is used executes fine in Solr upon the same index. If the index 
does not contain documents having different fields (from the same source) the 
function executes without a problem.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to