Probably some of you already saw this, which Sven Aerts sent to some of us...
It seems it is much related to the database we were speaking about... Proabably a way of getting into it would be better by trying to join efforts with Google (and resources). We should make a paper describing what the current search engine is lacking, and how to overcome it, and then suggest it to the Google people... Carlos Gershenson... Centrum Leo Apostel, Vrije Universiteit Brussel Krijgskundestraat 33. B-1160 Brussels, Belgium http://homepages.vub.ac.be/~cgershen/ “Knowledge brings more questions than answers” This is a forwarded message From: Ricardo Barbosa <[EMAIL PROTECTED]> Date: Monday, November 22, 2004, 7:25:22 PM Subject: Fwd: Scientists get their own Google ===8<==============Original message text=============== Scientists get their own Google Declan Butler <http://www.nature.com/news/about/aboutus.html#Butler> *New search engine ranks papers by importance, and finds the free versions.* Google Scholar searches for scientific articles instead of web pages. /© Google/ Media box <javascript:void(0)> Imagine searching the Internet and being able to restrict your results to academic texts. Today Google launched a free search engine that aims to do just that. Google Scholar searches only journal articles, theses, books, preprints, and technical reports across any area of research. A test version of the search engine is available at http://scholar.google.com <http://scholar.google.com/>, so you can try it out. In a search for the phrase "human genome", for example, a normal Google web search throws back 450,000 or so hits, with genome centres and databases and other websites ranked top. In contrast, Google Scholar returns just 113,000 hits, and all the top-ranked items are not websites but seminal papers on the subject. In fact, the number one hit is the landmark article "Initial sequencing and analysis of the human genome"^1 <http://www.nature.com/news/2004/041115/full/041115-13.html#B1> published in /Nature/ in 2001. *On the links* The tool is based on principles similar to those of Google's web search. The original search manages to make the most useful references appear at the top of the page using algorithms that exploit the structure of the links between web pages. Pages with many links pointing to them are considered 'authorities', and ranked highest in search returns. The ranking is refined by taking into account the importance of the origins of links to a paper. "We don't just look at the number of links," says Sergey Brin, a cofounder of Google. "A link from the Nature home page will be given more weight than a link from my home page," he explains. Google Scholar works in much the same way, using the citations at the end of each paper, rather than web links. It automatically identifies the format and content of scientific texts from around the web, extracts the references and builds automatic citation analyses for all the papers indexed. This approach has been pioneered in computer science by ResearchIndex, software produced by the information technology company NEC. *Search for success* Much of the peer-reviewed material has been made available to Google by publishers, including Nature Publishing Group, the Association for Computing Machinery and the Institute of Electrical and Electronics Engineers, through a pilot cross-publisher search engine called CrossRef Search. Publishers have arranged for Google robots to scan the full texts of their articles. Users clicking on a hit returned by Google Scholar are directed to the article on the publisher's site, where subscribers can access full text and non-subscribers get an abstract or information on how to buy an article. Google Scholar has a subversive feature, however. Each hit also links to all the free versions of the article it has found saved on other sites, for example on personal home pages, elsewhere on the Internet. ===8<===========End of original message text===========
Scientists get their own Google
A test version of the search engine is available at http://scholar.google.com, so you can try it out. In a search for the phrase "human genome", for example, a normal Google web search throws back 450,000 or so hits, with genome centres and databases and other websites ranked top. In contrast, Google Scholar returns just 113,000 hits, and all the top-ranked items are not websites but seminal papers on the subject. In fact, the number one hit is the landmark article "Initial sequencing and analysis of the human genome"1 published in Nature in 2001. On the links The tool is based on principles similar to those of Google's web search. The original search manages to make the most useful references appear at the top of the page using algorithms that exploit the structure of the links between web pages. Pages with many links pointing to them are considered 'authorities', and ranked highest in search returns. The ranking is refined by taking into account the importance of the origins of links to a paper. "We don't just look at the number of links," says Sergey Brin, a cofounder of Google. "A link from the Nature home page will be given more weight than a link from my home page," he explains. Google Scholar works in much the same way, using the citations at the end of each paper, rather than web links. It automatically identifies the format and content of scientific texts from around the web, extracts the references and builds automatic citation analyses for all the papers indexed. This approach has been pioneered in computer science by ResearchIndex, software produced by the information technology company NEC. Search for success Much of the peer-reviewed material has been made available to Google by publishers, including Nature Publishing Group, the Association for Computing Machinery and the Institute of Electrical and Electronics Engineers, through a pilot cross-publisher search engine called CrossRef Search. Publishers have arranged for Google robots to scan the full texts of their articles. Users clicking on a hit returned by Google Scholar are directed to the article on the publisher's site, where subscribers can access full text and non-subscribers get an abstract or information on how to buy an article. Google Scholar has a subversive feature, however. Each hit also links to all the free versions of the article it has found saved on other sites, for example on personal home pages, elsewhere on the Internet. |