On Sun, Mar 17, 2013 at 5:56 PM, eksdev <eks...@googlemail.com> wrote:
> Hi Adrian,

Hi eksdev,

> I cannot tell if such thing would make it less or more robust, just thinking 
> aloud  :)
>
> I am thinking of it as a way to somehow postpone byte->type conversion to the 
> moment where it is really needed.  Simply, keep byte[] around as long as 
> possible.
> *Theoretically*, this should improve gc() and memory footprint for some types 
> of downstream processing. It all depends how easy would something like that 
> be.
>
> There is already a way to achieve this by using binary field type, …  hmmm, 
> maybe some lucene.expert hack to make Lucene think every field is binary wold 
> be simple and robust enough?
> e.g. Visitor.transportOnlySerializedValuesWithoutTypeConversion()

Sorry, but I think it would do more harm than good:
 - Stored fields encoding is an implementation detail so someone could
write a StoredFieldsFormat that serializes strings in UTF-16 to avoid
decoding overhead at read time, how
would.transportOnlySerializedValuesWithoutTypeConversion know the
actual encoding used by the underlying StoredFieldsFormat?
 - It would make users think that this kind of optimization is
valuable performance-wise while I think it's unnoticeable.

-- 
Adrien

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to