Terry Blankers created MAHOUT-1521:
--------------------------------------
Summary: lucene2seq - Error trying to load data from stored field
(when non-indexed)
Key: MAHOUT-1521
URL: https://issues.apache.org/jira/browse/MAHOUT-1521
Project: Mahout
Issue Type: Bug
Affects Versions: 0.9
Reporter: Terry Blankers
Fix For: 1.0
When using lucene2seq to load data from a field that is stored but not indexed
I receive the following error:
{noformat}IllegalArgumentException: Field 'body' does not exist in the
index{noformat}
Field is described in schema.xml as:
{noformat}<fieldname="body"type="string" stored="true"
indexed="false"/>{noformat}
BTW, field is copied to 'content' field for searching, schema.xml snippet:
{noformat}<copyField source="body" dest="content" />{noformat}
Copy field is described in schema.xml as:
{noformat}<fieldname="content" type="text" stored="false" indexed="true"
multiValued="true"/>{noformat}
If I try to load data from the copy field, lucene2seq runs with no errors but I
receive empty data for each key/doc:
{noformat}Key class: class org.apache.hadoop.io.Text Value Class: class
org.apache.hadoop.io.Text
Key: 96C4C76CF9D7449C724CA77CB8F650EAFD33E31C: Value:
Key: D6842B81B8D09733B50BEDB4767C2A5C49E43B20: Value:{noformat}
--
This message was sent by Atlassian JIRA
(v6.2#6252)