Yes. This can be done. It isn't necessarily real simple to do. See http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.56.7275 for an old (but still pretty good) example.
On Tue, Jun 23, 2009 at 6:45 AM, Paul Jones <paul_jone...@yahoo.co.uk>wrote: > Imagine we have crawled 100K webpages, and we have 100 pages which show > "red" and 100 which show "crimson" and then 100 which show both "red and > crimson" is there a way to deduce that there maybe (albeit weak) > relationship between red AND crimson. Of course we can pre-seed this info, > which then gets weighted by actual results. >