Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The following page has been changed by MichaelStack:
http://wiki.apache.org/nutch/FAQ

The comment on the change is:
Edit.  Add mention of RSS 2.0.

------------------------------------------------------------------------------
  Anchor text makes a large contribution to document score (You can see the 
anchor text for a page by browsing to "explain" then editing the URL to put in 
place "anchors.jsp" in place of "explain.jsp").
  
  ==== What is the RSS symbol in search results all about? ====
- Clicking on the RSS symbol sends the current query back to Nutch to a servlet 
named OpenSearchServlet.  OpenSearchServlet reruns the query and returns the 
results formatted instead as RSS (XML).  The RSS format is based on 
[http://a9.com/-/spec/opensearchrss/1.0/ OpenSearch RSS 1.0] from 
[http://www.a9.com a9.com]: "OpenSearch RSS 1.0 is an extension to the RSS 2.0 
standard, conforming to the guidelines for RSS extensibility as outlined by the 
RSS 2.0 specification" (See also [http://opensearch.a9.com/ OpenSearch]). Nutch 
in turn  makes extension to OpenSearch.  The Nutch extensions are identified by 
the 'nutch' namespace prefix and add to OpenSearch navigation information, the 
original query, and all fields that are available at search result time 
including the Nutch page boost, the name of the segment the page resides in, 
etc. 
+ Clicking on the RSS symbol sends the current query back to Nutch to a servlet 
named 
[http://lucene.apache.org/nutch/apidocs/org/apache/nutch/searcher/OpenSearchServlet.html
 OpenSearchServlet].  
[http://lucene.apache.org/nutch/apidocs/org/apache/nutch/searcher/OpenSearchServlet.html
 OpenSearchServlet] reruns the query and returns the results formatted instead 
as RSS (XML).  The RSS format is based on 
[http://a9.com/-/spec/opensearchrss/1.0/ OpenSearch RSS 1.0] from 
[http://www.a9.com a9.com]: "[http://a9.com/-/spec/opensearchrss/1.0/ 
OpenSearch] RSS 1.0 is an extension to the RSS 2.0 standard, conforming to the 
guidelines for RSS extensibility as outlined by the RSS 2.0 specification" (See 
also [http://opensearch.a9.com/ opensearch]). Nutch in turn  makes extension to 
[http://a9.com/-/spec/opensearchrss/1.0/ OpenSearch].  The Nutch extensions are 
identified by the 'nutch' namespace prefix and add to 
[http://a9.com/-/spec/opensearchrss/1.0/ OpenSearch] navigation information,
  the original query, and all fields that are available at search result time 
including the Nutch page boost, the name of the segment the page resides in, 
etc. 
  
- Results as RSS (XML) rather than HTML are easier for programmatic clients to 
parse: such clients will query against OpenSearchServlet rather than 
search.jsp.  Results as XML can also be transformed using XSL stylesheets, the 
likely direction of UI development going forward.
+ Results as RSS (XML) rather than HTML are easier for programmatic clients to 
parse: such clients will query against 
[http://lucene.apache.org/nutch/apidocs/org/apache/nutch/searcher/OpenSearchServlet.html
 OpenSearchServlet] rather than search.jsp.  Results as XML can also be 
transformed using XSL stylesheets, the likely direction of UI development going 
forward.
  
  === Crawling ===
  


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Nutch-cvs mailing list
Nutch-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-cvs

Reply via email to