text-query.mdtext

ajs6f Sun, 21 May 2017 06:39:08 -0700

Author: ajs6f
Date: Sun May 21 13:38:37 2017
New Revision: 1795700

URL: http://svn.apache.org/viewvc?rev=1795700&view=rev
Log:
Adding docs for Elasticsearch text indexes


Modified:
    jena/site/trunk/content/documentation/query/text-query.mdtext

Modified: jena/site/trunk/content/documentation/query/text-query.mdtext
URL: 
http://svn.apache.org/viewvc/jena/site/trunk/content/documentation/query/text-query.mdtext?rev=1795700&r1=1795699&r2=1795700&view=diff
==============================================================================
--- jena/site/trunk/content/documentation/query/text-query.mdtext (original)
+++ jena/site/trunk/content/documentation/query/text-query.mdtext Sun May 21 
13:38:37 2017
@@ -7,7 +7,7 @@ SPARQL queries. Text indexes are additio
 accessing the RDF graph.
 
 The text index can be either [Apache Lucene](http://lucene.apache.org/core) 
for a
-same-machine text index, or [Apache Solr](http://lucene.apache.org/solr/)
+same-machine text index, or [Elasticsearch](https://www.elastic.co/)
 for a large scale enterprise search application.
 
 Some example code is [available 
here](https://github.com/apache/jena/tree/master/jena-text/src/main/java/examples/).
@@ -55,7 +55,7 @@ external content with only additional RD
 The text index uses the native query language of the index:
 [Lucene query 
format](http://lucene.apache.org/core/4_1_0/queryparser/org/apache/lucene/queryparser/classic/package-summary.html#package_description)
 or
-[Solr query format](http://wiki.apache.org/solr/SolrQuerySyntax).
+[Elasticsearch query 
format](https://www.elastic.co/guide/en/elasticsearch/reference/5.2/query-dsl.html).
 
 A text-supporting dataset is configured with a description of which
 properties work with.  When data is added, any properties matching the
@@ -84,7 +84,7 @@ The maintenance of the index is external
 
 ### External applications
 
-By using Solr, in either pattern A (RDF data indexed) or pattern B
+By using Elasticsearch, in either pattern A (RDF data indexed) or pattern B
 (external content indexed), other applications can share the
 text index with SPARQL search.
 
@@ -159,11 +159,11 @@ The usual way to describe an index is wi
 [Jena assembler description](../assembler/index.html).  Configurations can
 also be built with code. The assembler describes a 'text
 dataset' which has an underlying RDF dataset and a text index. The text
-index describes the text index technology (Lucene or Solr) and the details
+index describes the text index technology (Lucene or Elasticsearch) and the 
details
 needed for for each.
 
 A text index has an "entity map" which defines the properties to
-index, the name of the lucene/solr field and field used for storing the URI
+index, the name of the Lucene/Elasticsearch field and field used for storing 
the URI
 itself.
 
 For common RDF use, there will be one field, mapping a property to a text
@@ -196,8 +196,8 @@ The following is an example of a TDB dat
     text:TextDataset      rdfs:subClassOf   ja:RDFDataset .
     # Lucene index
     text:TextIndexLucene  rdfs:subClassOf   text:TextIndex .
-    # Solr index
-    text:TextIndexSolr    rdfs:subClassOf   text:TextIndex .
+    # Elasticsearch index
+    text:TextIndexES    rdfs:subClassOf   text:TextIndex .
 
     ## ---------------------------------------------------------------
     ## This URI must be fixed - it's used to assemble the text dataset.
@@ -244,10 +244,9 @@ needs to identify the text dataset by it
 ### Configuring an Analyzer
 
 Text to be indexed is passed through a text analyzer that divides it into 
tokens 
-and may perform other transformations such as eliminating stop words. If a 
Solr text
-index is used, the analyzer used is determined by the Solr configuration.
-If a Lucene text index is used, then by default a `StandardAnalyzer` is used. 
However, 
-it can be replaced by another analyzer with the `text:analyzer` property. 
+and may perform other transformations such as eliminating stop words. If a 
Lucene
+or Elasticsearch text index is used, then by default a `StandardAnalyzer` is 
used.
+However, it can be replaced by another analyzer with the `text:analyzer` 
property. 
 For example with a `SimpleAnalyzer`:   
 
     <#indexLucene> a text:TextIndexLucene ;
@@ -357,6 +356,8 @@ the `TextIndexLucene` resource:
         text:queryParser text:AnalyzingQueryParser .
 
 
+Elasticsearch currently doesn't support Analyzers beyond Standard Analyzer. 
+
 ### Configuration by Code
 
 A text dataset can also be constructed in code as might be done for a
@@ -733,4 +734,14 @@ then use the following maven dependency:
     </dependency>
 
 adjusting the version <code>X.Y.Z</code> as necessary.  This will automatically
-include a compatible version of Lucene and the Solr java client, but not Solr 
server.
\ No newline at end of file
+include a compatible version of Lucene.
+
+For Elasticsearch implementation, you can include the following Maven 
Dependency:
+
+    <dependency>
+      <groupId>org.apache.jena</groupId>
+      <artifactId>jena-text-es</artifactId>
+      <version>X.Y.Z</version>
+    </dependency>
+
+adjusting the version <code>X.Y.Z</code> as necessary.
\ No newline at end of file

svn commit: r1795700 - /jena/site/trunk/content/documentation/query/text-query.mdtext

Reply via email to