Multiple fields vs one field

Albert Vila Mon, 06 Aug 2007 05:12:39 -0700

Hi all

 My data looks like:


        Document 1
                code, title, content, type, language, date, ...
        Document 2
                code, title, content, type, language, date, ...
        ...
        Document n
                code, title, content, type, language, date, ...

Now all document types share the same fields, but in a future weneed to add more document types with specific fields. I allways sortdocuments by date. I have 200.000 new documents each day and 130million documents. The janaury index size is 4.2Gb (the data size isabout 10Gb).


 I was wondering how to index the new document types.

Option 1: One index for each document type. Each index will have itsfields.Problem: I will have to perfom a search for each index, and sortresults by date.Option 2: One big index containing all fields. A field could beempty if the field is not applicable for that document type.Option 3: One big index containing all common fields, and adding andextra field named metadata. Inside this field I will add all specificfields (field1:value1 field2:value2).

Comments, pros and contras will be appreciated. I don't know exactlythe diference between option 2 and option 3.


Thanks

Albert

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Multiple fields vs one field

Reply via email to