Hi again....
Another dumb question :) (actually I'm too busy to look at the code :) )
In the index, is the datastructure of termDocs (is that the right
term), sorted by anything? Or is it just insertion order ? I could
see how one might want to sort by the Doc with the highest term
frequency ? But I can also see why
it might not help.
e.g. Token1 -> doc1 (2) [occurences] -> doc2 (6) -> doc3 (3)
or is it like this ?
Token1 -> doc2 (6) -> doc3 (3) -> doc1 (2) ?
I have an idea for an optimization I want to make, but I'm not sure
exactly whether it is warrants investigation.
Winton
Winton Davies
Lead Engineer, Overture (NSDQ: OVER)
1820 Gateway Drive, Suite 360
San Mateo, CA 94404
work: (650) 403-2259
cell: (650) 867-1598
http://www.overture.com/
--
To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>