otis 2003/10/15 06:51:38 Modified: xdocs/lucene-sandbox index.xml docs/lucene-sandbox index.html Log: - Fixed indentation and organized better. Revision Changes Path 1.10 +144 -144 jakarta-lucene/xdocs/lucene-sandbox/index.xml Index: index.xml =================================================================== RCS file: /home/cvs/jakarta-lucene/xdocs/lucene-sandbox/index.xml,v retrieving revision 1.9 retrieving revision 1.10 diff -u -r1.9 -r1.10 --- index.xml 15 Oct 2003 13:50:03 -0000 1.9 +++ index.xml 15 Oct 2003 13:51:38 -0000 1.10 @@ -1,149 +1,149 @@ <?xml version="1.0"?> <document> -<properties> -<author>Otis Gospodentic</author> -<title>Lucene Sandbox</title> -</properties> -<body> - -<section name="Lucene Sandbox"> -<p> -Lucene project also contains a workspace, Lucene Sandbox, that is open to all Lucene committers, as well -as a few other developers. The purpose of the Sandbox is to host various third party contributions, -and to serve as a place to try out new ideas and prepare them for inclusion into the core Lucene -distribution.<br/> -Users are free to experiment with the components developed in the Sandbox, but Sandbox components will -not necessarily be maintained, particularly in their current state. -</p> - -<p> -You can access the Lucene Sandbox CVS repository at -<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/">http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/</a>. -</p> - -<subsection name="Snowball Stemmers for Lucene"> -<p> -This project provides pre-compiled versions of the Snowball stemmers -for Lucene. -</p> - -<p> -More information can be found -<a href="http://jakarta.apache.org/lucene/docs/lucene-sandbox/snowball/">here</a>. -</p> - -<p> -<a href="http://snowball.tartarus.org/">Background information on Snowball</a>, -which is a language for stemmers developed by Martin Porter. -</p> - -</subsection> - -<subsection name="SearchBean"> -<p> -SearchBean is a UI component that can be used to browse through the results of a Lucene search. -The SearchBean searches the index for a given query string, retrieves the hits, and then brings -them into the HitsIterator class, which can be used for paging and sorting through search results. - -</p> -<p> -<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/searchbean/">The -CVS repository for the SearchBean contribution.</a> -</p> - -</subsection> - -<subsection name="Ant"> -<p> -The Ant project is a useful Ant task that creates a Lucene index out of an Ant fileset. It also -contains an example HTML parser that uses JTidy. -</p> -<p> -<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/ant/">The -CVS repository for the Ant contribution.</a> -</p> -</subsection> - -<subsection name="WordNet/Synonyms"> -<p> -The Lucene WordNet code consists of a single class which parses a prolog file -from the WordNet site that contains a list of English words and synonyms. -The class builds a Lucene index from the synonyms file. Your querying code could -hit this index to build up a set of synonyms for the terms in the -search query. -</p> -<p> -More information on the <a href="http://www.tropo.com/techno/java/lucene/wordnet.html">Lucene WordNet package</a>. -<a href="http://www.cogsci.princeton.edu/~wn/">WordNet</a> is an online database of English language words that contains -synonyms, definitions, and various relationships between synonym sets. -</p> -<p> -<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/WordNet/"> -CVS for the WordNet module.</a> -</p> -</subsection> - -<subsection name="SAX/DOM XML Indexing demo"> -<p> -This contribution is some sample code that demonstrates adding simple XML documents into the index. It creates -a new Document object for each file, and then populates the Document with a Field for each XML element, recursively. -There are examples included for both SAX and DOM. -</p> -<p> - -<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/XML-Indexing-Demo/"> -CVS for the XML Indexing Demo.</a> -</p> -</subsection> - -<subsection name="LARM"> -<p> -LARM is a web crawler optimized for large intranets with up to a couple of hundred hosts. -</p> -<a href="larm/overview.html">Technical Overview</a>. -See also: <a href="http://larm.sourceforge.net/">LARM's home page on SourceForge</a>. - -</subsection> - -<subsection name="Javascript Query Constructor"> -<p> -Javascript library to support client-side query-building. Provides support for a user interface similar to -<a href="http://www.google.com.sg/advanced_search">Google's Advanced Search</a>. -</p> -<p> - -<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/javascript/queryConstructor/"> -CVS for the files.</a> -</p> -</subsection> - -<subsection name="Javascript Query Validator"> -<p> -Javascript library to support client-side query validation. Lucene doesn't like malformed queries and tends to -throw ParseException, which are often difficult to interpret and pass on to the user. This library hopes to -alleviate that problem. -</p> -<p> - -<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/javascript/queryValidator/"> -CVS for files.</a> -</p> -</subsection> - -<subsection name="High Frequency Terms"> -<p> -The miscellaneous package is for classes that don't fit anywhere else. The only class in it right now determines -what terms occur the most inside a Lucene index. This could be useful for analyzing which terms may need to go -into a custom stop word list for better search results. -</p> -<p> - -<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/miscellaneous/src/java/org/apache/lucene/misc/"> -CVS for miscellaneous classes.</a> -</p> -</subsection> + <properties> + <author>Otis Gospodentic</author> + <title>Lucene Sandbox</title> + </properties> + <body> + + <section name="Lucene Sandbox"> + <p> + Lucene project also contains a workspace, Lucene Sandbox, that is open to all Lucene committers, as well + as a few other developers. The purpose of the Sandbox is to host various third party contributions, + and to serve as a place to try out new ideas and prepare them for inclusion into the core Lucene + distribution.<br/> + Users are free to experiment with the components developed in the Sandbox, but Sandbox components will + not necessarily be maintained, particularly in their current state. + </p> + + <p> + You can access the Lucene Sandbox CVS repository at + <a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/">http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/</a>. + </p> + + <subsection name="Snowball Stemmers for Lucene"> + <p> + This project provides pre-compiled versions of the Snowball stemmers + for Lucene. + </p> + + <p> + More information can be found + <a href="http://jakarta.apache.org/lucene/docs/lucene-sandbox/snowball/">here</a>. + </p> + + <p> + <a href="http://snowball.tartarus.org/">Background information on Snowball</a>, + which is a language for stemmers developed by Martin Porter. + </p> + + </subsection> + + <subsection name="SearchBean"> + <p> + SearchBean is a UI component that can be used to browse through the results of a Lucene search. + The SearchBean searches the index for a given query string, retrieves the hits, and then brings + them into the HitsIterator class, which can be used for paging and sorting through search results. + + </p> + <p> + <a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/searchbean/">The + CVS repository for the SearchBean contribution.</a> + </p> + + </subsection> + + <subsection name="Ant"> + <p> + The Ant project is a useful Ant task that creates a Lucene index out of an Ant fileset. It also + contains an example HTML parser that uses JTidy. + </p> + <p> + <a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/ant/">The + CVS repository for the Ant contribution.</a> + </p> + </subsection> + + <subsection name="WordNet/Synonyms"> + <p> + The Lucene WordNet code consists of a single class which parses a prolog file + from the WordNet site that contains a list of English words and synonyms. + The class builds a Lucene index from the synonyms file. Your querying code could + hit this index to build up a set of synonyms for the terms in the + search query. + </p> + <p> + More information on the <a href="http://www.tropo.com/techno/java/lucene/wordnet.html">Lucene WordNet package</a>. + <a href="http://www.cogsci.princeton.edu/~wn/">WordNet</a> is an online database of English language words that contains + synonyms, definitions, and various relationships between synonym sets. + </p> + <p> + <a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/WordNet/"> + CVS for the WordNet module.</a> + </p> + </subsection> + + <subsection name="SAX/DOM XML Indexing demo"> + <p> + This contribution is some sample code that demonstrates adding simple XML documents into the index. It creates + a new Document object for each file, and then populates the Document with a Field for each XML element, recursively. + There are examples included for both SAX and DOM. + </p> + <p> + + <a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/XML-Indexing-Demo/"> + CVS for the XML Indexing Demo.</a> + </p> + </subsection> + + <subsection name="LARM"> + <p> + LARM is a web crawler optimized for large intranets with up to a couple of hundred hosts. + </p> + <a href="larm/overview.html">Technical Overview</a>. + See also: <a href="http://larm.sourceforge.net/">LARM's home page on SourceForge</a>. + + </subsection> + + <subsection name="Javascript Query Constructor"> + <p> + Javascript library to support client-side query-building. Provides support for a user interface similar to + <a href="http://www.google.com.sg/advanced_search">Google's Advanced Search</a>. + </p> + <p> + + <a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/javascript/queryConstructor/"> + CVS for the files.</a> + </p> + </subsection> + + <subsection name="Javascript Query Validator"> + <p> + Javascript library to support client-side query validation. Lucene doesn't like malformed queries and tends to + throw ParseException, which are often difficult to interpret and pass on to the user. This library hopes to + alleviate that problem. + </p> + <p> + + <a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/javascript/queryValidator/"> + CVS for files.</a> + </p> + </subsection> + + <subsection name="High Frequency Terms"> + <p> + The miscellaneous package is for classes that don't fit anywhere else. The only class in it right now determines + what terms occur the most inside a Lucene index. This could be useful for analyzing which terms may need to go + into a custom stop word list for better search results. + </p> + <p> + + <a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/miscellaneous/src/java/org/apache/lucene/misc/"> + CVS for miscellaneous classes.</a> + </p> + </subsection> -</section> + </section> -</body> + </body> </document> 1.18 +91 -94 jakarta-lucene/docs/lucene-sandbox/index.html Index: index.html =================================================================== RCS file: /home/cvs/jakarta-lucene/docs/lucene-sandbox/index.html,v retrieving revision 1.17 retrieving revision 1.18 diff -u -r1.17 -r1.18 --- index.html 9 Oct 2003 14:40:30 -0000 1.17 +++ index.html 15 Oct 2003 13:51:38 -0000 1.18 @@ -121,29 +121,37 @@ <tr><td> <blockquote> <p> -Lucene project also contains a workspace, Lucene Sandbox, that is open to all Lucene committers, as well -as a few other developers. The purpose of the Sandbox is to host various third party contributions, -and to serve as a place to try out new ideas and prepare them for inclusion into the core Lucene -distribution.<br /> -Users are free to experiment with the components developed in the Sandbox, but Sandbox components will -not necessarily be maintained, particularly in their current state. -</p> - <p> -You can access the Lucene Sandbox CVS repository at -<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/">http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/</a>. -</p> + Lucene project also contains a workspace, Lucene Sandbox, that is open to all Lucene committers, as well + as a few other developers. The purpose of the Sandbox is to host various third party contributions, + and to serve as a place to try out new ideas and prepare them for inclusion into the core Lucene + distribution.<br /> + Users are free to experiment with the components developed in the Sandbox, but Sandbox components will + not necessarily be maintained, particularly in their current state. + </p> + <p> + You can access the Lucene Sandbox CVS repository at + <a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/">http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/</a>. + </p> <table border="0" cellspacing="0" cellpadding="2" width="100%"> <tr><td bgcolor="#828DA6"> <font color="#ffffff" face="arial,helvetica,sanserif"> - <a name="LARM"><strong>LARM</strong></a> + <a name="Snowball Stemmers for Lucene"><strong>Snowball Stemmers for Lucene</strong></a> </font> </td></tr> <tr><td> <blockquote> <p> -LARM is a web crawler optimized for large intranets with up to a couple of hundred hosts. -</p> - <a href="larm/overview.html">Technical Overview</a> + This project provides pre-compiled versions of the Snowball stemmers + for Lucene. + </p> + <p> + More information can be found + <a href="http://jakarta.apache.org/lucene/docs/lucene-sandbox/snowball/">here</a>. + </p> + <p> + <a href="http://snowball.tartarus.org/">Background information on Snowball</a>, + which is a language for stemmers developed by Martin Porter. + </p> </blockquote> </td></tr> <tr><td><br/></td></tr> @@ -151,23 +159,21 @@ <table border="0" cellspacing="0" cellpadding="2" width="100%"> <tr><td bgcolor="#828DA6"> <font color="#ffffff" face="arial,helvetica,sanserif"> - <a name="Snowball Stemmers for Lucene"><strong>Snowball Stemmers for Lucene</strong></a> + <a name="SearchBean"><strong>SearchBean</strong></a> </font> </td></tr> <tr><td> <blockquote> <p> -This project provides pre-compiled versions of the Snowball stemmers -for Lucene. -</p> - <p> -More information can be found -<a href="http://jakarta.apache.org/lucene/docs/lucene-sandbox/snowball/">here</a>. -</p> - <p> -<a href="http://snowball.tartarus.org/">Background information on Snowball</a>, -which is a language for stemmers developed by Martin Porter. -</p> + SearchBean is a UI component that can be used to browse through the results of a Lucene search. + The SearchBean searches the index for a given query string, retrieves the hits, and then brings + them into the HitsIterator class, which can be used for paging and sorting through search results. + + </p> + <p> + <a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/searchbean/">The + CVS repository for the SearchBean contribution.</a> + </p> </blockquote> </td></tr> <tr><td><br/></td></tr> @@ -181,13 +187,13 @@ <tr><td> <blockquote> <p> -The Ant project is a useful Ant task that creates a Lucene index out of an Ant fileset. It also -contains an example HTML parser that uses JTidy. -</p> - <p> -<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/ant/">The -CVS repository for the Ant contribution.</a> -</p> + The Ant project is a useful Ant task that creates a Lucene index out of an Ant fileset. It also + contains an example HTML parser that uses JTidy. + </p> + <p> + <a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/ant/">The + CVS repository for the Ant contribution.</a> + </p> </blockquote> </td></tr> <tr><td><br/></td></tr> @@ -195,25 +201,27 @@ <table border="0" cellspacing="0" cellpadding="2" width="100%"> <tr><td bgcolor="#828DA6"> <font color="#ffffff" face="arial,helvetica,sanserif"> - <a name="SearchBean"><strong>SearchBean</strong></a> + <a name="WordNet/Synonyms"><strong>WordNet/Synonyms</strong></a> </font> </td></tr> <tr><td> <blockquote> <p> -SearchBean is a UI component that can be used to browse through the results of a Lucene search. -The SearchBean searches the index for a given query string, retrieves the hits, and then brings -them into the HitsIterator class, which can be used for paging and sorting through search results. - -</p> - <p> -<a href="http://cvs.apache.org/viewcvs/jakarta-lucene-sandbox/contributions/searchBean/">The -CVS repository for the SearchBean contribution.</a> -</p> - <p> -<a href="http://snowball.tartarus.org/">Background information on Snowball</a>, -which is a language for stemmers developed by Martin Porter. -</p> + The Lucene WordNet code consists of a single class which parses a prolog file + from the WordNet site that contains a list of English words and synonyms. + The class builds a Lucene index from the synonyms file. Your querying code could + hit this index to build up a set of synonyms for the terms in the + search query. + </p> + <p> + More information on the <a href="http://www.tropo.com/techno/java/lucene/wordnet.html">Lucene WordNet package</a>. + <a href="http://www.cogsci.princeton.edu/~wn/">WordNet</a> is an online database of English language words that contains + synonyms, definitions, and various relationships between synonym sets. + </p> + <p> + <a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/WordNet/"> + CVS for the WordNet module.</a> + </p> </blockquote> </td></tr> <tr><td><br/></td></tr> @@ -221,27 +229,21 @@ <table border="0" cellspacing="0" cellpadding="2" width="100%"> <tr><td bgcolor="#828DA6"> <font color="#ffffff" face="arial,helvetica,sanserif"> - <a name="WordNet/Synonyms"><strong>WordNet/Synonyms</strong></a> + <a name="SAX/DOM XML Indexing demo"><strong>SAX/DOM XML Indexing demo</strong></a> </font> </td></tr> <tr><td> <blockquote> <p> -The Lucene WordNet code consists of a single class which parses a prolog file -from the WordNet site that contains a list of English words and synonyms. -The class builds a Lucene index from the synonyms file. Your querying code could -hit this index to build up a set of synonyms for the terms in the -search query. -</p> - <p> -More information on the <a href="http://www.tropo.com/techno/java/lucene/wordnet.html">Lucene WordNet package</a>. -<a href="http://www.cogsci.princeton.edu/~wn/">WordNet</a> is an online database of English language words that contains -synonyms, definitions, and various relationships between synonym sets. -</p> - <p> -<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/WordNet/"> -CVS for the WordNet module.</a> -</p> + This contribution is some sample code that demonstrates adding simple XML documents into the index. It creates + a new Document object for each file, and then populates the Document with a Field for each XML element, recursively. + There are examples included for both SAX and DOM. + </p> + <p> + + <a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/XML-Indexing-Demo/"> + CVS for the XML Indexing Demo.</a> + </p> </blockquote> </td></tr> <tr><td><br/></td></tr> @@ -249,21 +251,16 @@ <table border="0" cellspacing="0" cellpadding="2" width="100%"> <tr><td bgcolor="#828DA6"> <font color="#ffffff" face="arial,helvetica,sanserif"> - <a name="SAX/DOM XML Indexing demo"><strong>SAX/DOM XML Indexing demo</strong></a> + <a name="LARM"><strong>LARM</strong></a> </font> </td></tr> <tr><td> <blockquote> <p> -This contribution is some sample code that demonstrates adding simple XML documents into the index. It creates -a new Document object for each file, and then populates the Document with a Field for each XML element, recursively. -There are examples included for both SAX and DOM. -</p> - <p> - -<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/XML-Indexing-Demo/"> -CVS for the XML Indexing Demo.</a> -</p> + LARM is a web crawler optimized for large intranets with up to a couple of hundred hosts. + </p> + <a href="larm/overview.html">Technical Overview</a> + <a href="http://larm.sourceforge.net/">LARM's home page on SourceForge</a> </blockquote> </td></tr> <tr><td><br/></td></tr> @@ -277,14 +274,14 @@ <tr><td> <blockquote> <p> -Javascript library to support client-side query-building. Provides support for a user interface similar to -<a href="http://www.google.com.sg/advanced_search">Google's Advanced Search</a>. -</p> + Javascript library to support client-side query-building. Provides support for a user interface similar to + <a href="http://www.google.com.sg/advanced_search">Google's Advanced Search</a>. + </p> <p> -<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/javascript/queryConstructor/"> -CVS for the files.</a> -</p> + <a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/javascript/queryConstructor/"> + CVS for the files.</a> + </p> </blockquote> </td></tr> <tr><td><br/></td></tr> @@ -298,15 +295,15 @@ <tr><td> <blockquote> <p> -Javascript library to support client-side query validation. Lucene doesn't like malformed queries and tends to -throw ParseException, which are often difficult to interpret and pass on to the user. This library hopes to -alleviate that problem. -</p> + Javascript library to support client-side query validation. Lucene doesn't like malformed queries and tends to + throw ParseException, which are often difficult to interpret and pass on to the user. This library hopes to + alleviate that problem. + </p> <p> -<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/javascript/queryValidator/"> -CVS for files.</a> -</p> + <a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/javascript/queryValidator/"> + CVS for files.</a> + </p> </blockquote> </td></tr> <tr><td><br/></td></tr> @@ -320,15 +317,15 @@ <tr><td> <blockquote> <p> -The miscellaneous package is for classes that don't fit anywhere else. The only class in it right now determines -what terms occur the most inside a Lucene index. This could be useful for analyzing which terms may need to go -into a custom stop word list for better search results. -</p> + The miscellaneous package is for classes that don't fit anywhere else. The only class in it right now determines + what terms occur the most inside a Lucene index. This could be useful for analyzing which terms may need to go + into a custom stop word list for better search results. + </p> <p> -<a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/miscellaneous/src/java/org/apache/lucene/misc/"> -CVS for miscellaneous classes.</a> -</p> + <a href="http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/miscellaneous/src/java/org/apache/lucene/misc/"> + CVS for miscellaneous classes.</a> + </p> </blockquote> </td></tr> <tr><td><br/></td></tr>
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]