SIPs and CAPs

Erik Hatcher Thu, 14 Jul 2005 03:45:44 -0700

Has anyone developed code to extract SIPs (statistically improbablephrases) and CAPs (capitalized phrases) from a Lucene index, such asAmazon does with it's books as shown here?

<http://www.amazon.com/exec/obidos/tg/detail/-/0764526413/ref=sip_top_dp/102-8573693-0514548?%5Fencoding=UTF8&v=glance>

I'm curious as it is something I'd like to do with some of my work.Of course CAPs would be impossible to extract from an index that useda lowercasing analyzer, so that is a special case that would requirework during indexing. But SIPs could be extracted from an existingindex.


Thanks,
    Erik


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

SIPs and CAPs

Reply via email to