I am going to leave them off for now. >>> [EMAIL PROTECTED] 02/17/04 04:03PM >>> Grant Ingersoll wrote: > Do you see any reason to write position information at all for the term vectors?
It could be useful to some folks. If, for example, you only want to expand a query with terms that occur near query terms, like automatic phrase identification. In general, the vector stuff is just a constant factor improvement over re-tokenizing the text of the document, but hopefully a substantial one. If folks are doing computations which require positional information, but don't require the actual text (e.g., they don't need user-readable fragments) then positions could be handy. But, certainly, most applications for term vectors do not need positions, and I would not be upset if these were left out of the first version. Doug --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]