"Is it good idea to store complete Json as string to Lucene DB. If we store as 
separate fields then we have around 30 fields. There will be 30 seeks to get 
complete stored fields”

This is not true. Under the covers, all the stored fields are compressed and 
stored as a blob and Lucene does the magic of un-compressing that blob and 
extracting the stored field when you ask for it.

Further, while you’re right that storing lots of things will bloat the index, 
that’s not very important. Stored data is kept in separate files (*.fdx) in 
each segment and has little to no impact on search performance. That data is 
not accessed unless you ask for the field to be returned, i.e. it’s not part of 
the data used to get the top N documents. Say you have a search that has 
10,000,000 hits and return the top 10. _Only_ the stored data for those top 10 
hits is accessed, and that only after all the scoring is done.

I think this is premature optimization, try using the least-complex way 
organizing your data and measure.

Best,
Erick

> On Apr 22, 2020, at 1:00 AM, ganesh m <emailg...@yahoo.co.in.INVALID> wrote:
> 
> Is it good idea to store complete Json as string to Lucene DB. If we store as 
> separate fields then we have around 30 fields. There will be 30 seeks to get 
> complete stored fields


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to