Hi folks,

It looks like the handle leak may be real - Simon Willnauer has been looking at 
it and could not find an explanation for the behavior I have been seeing.  But 
before we got too far on that problem, I encountered what appears to be an even 
more serious problem.  Specifically, I'm losing field data out of some records.

The index I'm building is fairly large - some 25M records when complete.  What 
I'm seeing is that the main searchable field ("value") is not finding all the 
records it should.  I was able to locate one such record just now:

curl 
"http://localhost:8983/solr/nose/standard?fl=*,score&q=id:\"POI|DEU:205:20187477:1014564|brandenburger+tor\""
<?xml version="1.0" encoding="UTF-8"?>
<response>
<lst name="responseHeader"><int name="status">0</int><int 
name="QTime">95</int><lst name="params"><str 
name="q">id:"POI|DEU:205:20187477:1014564|brandenburger tor"</str><str 
name="fl">*,score</str></lst></lst><result name="response" numFound="1" 
start="0" maxScore="17.335964"><doc><float name="score">17.335964</float><str 
name="entityid">POI|DEU:205:20187477:1014564|brandenburger tor</str><str 
name="id">POI|DEU:205:20187477:1014564|brandenburger tor</str><str 
name="reference">brandenburger tor, potsdam, deutschland</str><str 
name="type">poi</str> ... </doc></result>
</response>

.. but it is completely missing the supposedly required "value" field:

   <!-- The value field.  This contains the actual string that will be 
matched.-->
   <field name="value" type="string_idx"  required="true" stored="false"/>

The code that does the indexing is straightforward, and *some* of the records 
of this class are indeed searchable via the "value" field, but others aren't.  
I know the "value" field is non-empty, because it is used to construct the "id" 
field, which is correct above.

Simon is also looking into this one, but if anyone else has advice for figuring 
out what's going wrong, please let me know.  FWIW, this is a trunk build from 
Monday morning.

Karl

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to