On Thu, Mar 12, 2009 at 5:15 AM, Dave Long <[email protected]> wrote: >> We find that proper-nouns >> constitute 40% of query terms, and proper nouns and nouns together >> constitute over 70% of query terms. We also show that the majority of >> queries are nounphrases, not unstructured collections of terms. > > Way back when, we had a web-authoring environment that proposed likely > hyperlinks by picking noun phrases out of a draft webpage and matching them > against a full-text index of other content on the same server. At the time > NCSA's "what's new" page was state-of-the-art in finding third-party web > content; it might be interesting were someone else to take a run at that > fence using modern search engine queries...
I recently read this about Erwise - apparently the world's first graphical browser.[1] "Kim Nyberg, Kari Sydänmaanlakka, Teemu Rantanen, and Kati Borgers were the four college students who developed the first graphical point-and-click web browser in Helsinki, Finland, in 1991, and completed it in April 1992. It had all sorts of abilities that Mosaic, released a year later, did not have, such as word search on webpages. This search system was interesting because if it couldn't find a word on the currently displayed webpage, it would trail the internet to find it in other pages - 12 at the time. More impressive was Erwise's ability to load multiple pages at the same time; you could click on a hyperlink and another window would open the linked page. " -- Vinayak 1. http://www.osnews.com/story/21076/The_World_s_First_Graphical_Browser_Erwise
