I am going to leave them off for now.

>>> [EMAIL PROTECTED] 02/17/04 04:03PM >>>
Grant Ingersoll wrote:
> Do you see any reason to write position information at all for the term vectors?

It could be useful to some folks.  If, for example, you only want to 
expand a query with terms that occur near query terms, like automatic 
phrase identification.  In general, the vector stuff is just a constant 
factor improvement over re-tokenizing the text of the document, but 
hopefully a substantial one.  If folks are doing computations which 
require positional information, but don't require the actual text (e.g., 
they don't need user-readable fragments) then positions could be handy.

But, certainly, most applications for term vectors do not need 
positions, and I would not be upset if these were left out of the first 
version.

Doug

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED] 
For additional commands, e-mail: [EMAIL PROTECTED] 



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to