Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by MichaelStack: http://wiki.apache.org/nutch/FAQ The comment on the change is: Edit. Add mention of RSS 2.0. ------------------------------------------------------------------------------ Anchor text makes a large contribution to document score (You can see the anchor text for a page by browsing to "explain" then editing the URL to put in place "anchors.jsp" in place of "explain.jsp"). ==== What is the RSS symbol in search results all about? ==== - Clicking on the RSS symbol sends the current query back to Nutch to a servlet named OpenSearchServlet. OpenSearchServlet reruns the query and returns the results formatted instead as RSS (XML). The RSS format is based on [http://a9.com/-/spec/opensearchrss/1.0/ OpenSearch RSS 1.0] from [http://www.a9.com a9.com]: "OpenSearch RSS 1.0 is an extension to the RSS 2.0 standard, conforming to the guidelines for RSS extensibility as outlined by the RSS 2.0 specification" (See also [http://opensearch.a9.com/ OpenSearch]). Nutch in turn makes extension to OpenSearch. The Nutch extensions are identified by the 'nutch' namespace prefix and add to OpenSearch navigation information, the original query, and all fields that are available at search result time including the Nutch page boost, the name of the segment the page resides in, etc. + Clicking on the RSS symbol sends the current query back to Nutch to a servlet named [http://lucene.apache.org/nutch/apidocs/org/apache/nutch/searcher/OpenSearchServlet.html OpenSearchServlet]. [http://lucene.apache.org/nutch/apidocs/org/apache/nutch/searcher/OpenSearchServlet.html OpenSearchServlet] reruns the query and returns the results formatted instead as RSS (XML). The RSS format is based on [http://a9.com/-/spec/opensearchrss/1.0/ OpenSearch RSS 1.0] from [http://www.a9.com a9.com]: "[http://a9.com/-/spec/opensearchrss/1.0/ OpenSearch] RSS 1.0 is an extension to the RSS 2.0 standard, conforming to the guidelines for RSS extensibility as outlined by the RSS 2.0 specification" (See also [http://opensearch.a9.com/ opensearch]). Nutch in turn makes extension to [http://a9.com/-/spec/opensearchrss/1.0/ OpenSearch]. The Nutch extensions are identified by the 'nutch' namespace prefix and add to [http://a9.com/-/spec/opensearchrss/1.0/ OpenSearch] navigation information, the original query, and all fields that are available at search result time including the Nutch page boost, the name of the segment the page resides in, etc. - Results as RSS (XML) rather than HTML are easier for programmatic clients to parse: such clients will query against OpenSearchServlet rather than search.jsp. Results as XML can also be transformed using XSL stylesheets, the likely direction of UI development going forward. + Results as RSS (XML) rather than HTML are easier for programmatic clients to parse: such clients will query against [http://lucene.apache.org/nutch/apidocs/org/apache/nutch/searcher/OpenSearchServlet.html OpenSearchServlet] rather than search.jsp. Results as XML can also be transformed using XSL stylesheets, the likely direction of UI development going forward. === Crawling === ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nutch-cvs mailing list Nutch-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-cvs