[jira] Commented: (LUCENE-509) Performance optimization when retrieving a single field from a document

Doug Cutting (JIRA) Thu, 02 Mar 2006 10:19:02 -0800

    [ 
http://issues.apache.org/jira/browse/LUCENE-509?page=comments#action_12368559 ]


Doug Cutting commented on LUCENE-509:
-------------------------------------

This seems like a fine idea.  But unless I'm mistaken, there's a bug when 
fields are strings that contain characters > 127.  With strings, the length 
written is (unfortunately) the number of Java characters, not the number of 
bytes.  There has been a lot of discussion about eventually changing this to be 
the number of bytes, but that has not yet happened.  So, until that happens, 
we'd have to scan the UTF8 for string values, counting characters, rather than 
simply seeking ahead.

> Performance optimization when retrieving a single field from a document
> -----------------------------------------------------------------------
>
>          Key: LUCENE-509
>          URL: http://issues.apache.org/jira/browse/LUCENE-509
>      Project: Lucene - Java
>         Type: Improvement
>   Components: Index
>     Versions: 1.9, 2.0
>     Reporter: Steven Tamm
>  Attachments: DocField.patch
>
> If you just want to retrieve a single field from a Document, the only way to 
> do it is to retrieve all the fields from the Document and then search it.  
> This patch is an optimization that allows you retrieve a specific field from 
> a document without instantiating a lot of field and string objects.  This 
> reduces our memory consumption on a per query basis by around around 20% when 
> a lot of documents are returned.
> I've added a lot of comments saying you should only call it if you only ever 
> need one field.  There's also a unit test.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

[jira] Commented: (LUCENE-509) Performance optimization when retrieving a single field from a document

Reply via email to