[ 
https://issues.apache.org/jira/browse/JENA-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15926063#comment-15926063
 ] 

ASF GitHub Bot commented on JENA-1305:
--------------------------------------

Github user anujgandharv commented on a diff in the pull request:

    https://github.com/apache/jena/pull/227#discussion_r106152940
  
    --- Diff: jena-text/src/main/resources/text-config-es.ttl ---
    @@ -0,0 +1,65 @@
    +    # Licensed to the Apache Software Foundation (ASF) under one
    +    # or more contributor license agreements.  See the NOTICE file
    +    # distributed with this work for additional information
    +    # regarding copyright ownership.  The ASF licenses this file
    +    # to you under the Apache License, Version 2.0 (the
    +    # "License"); you may not use this file except in compliance
    +    # with the License.  You may obtain a copy of the License at
    +    #
    +    #     http://www.apache.org/licenses/LICENSE-2.0
    +    #
    +    # Unless required by applicable law or agreed to in writing, software
    +    # distributed under the License is distributed on an "AS IS" BASIS,
    +    # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or 
implied.
    +    # See the License for the specific language governing permissions and
    +    # limitations under the License.
    +
    + ## Example of a TDB dataset and text index for ElasticSearch
    +
    +@prefix :        <http://localhost/jena_example/#> .
    +@prefix rdf:     <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
    +@prefix rdfs:    <http://www.w3.org/2000/01/rdf-schema#> .
    +@prefix tdb:     <http://jena.hpl.hp.com/2008/tdb#> .
    +@prefix ja:      <http://jena.hpl.hp.com/2005/11/Assembler#> .
    +@prefix text:    <http://jena.apache.org/text#> .
    +
    +# TDB
    +[] ja:loadClass "org.apache.jena.tdb.TDB" .
    +tdb:DatasetTDB  rdfs:subClassOf  ja:RDFDataset .
    +tdb:GraphTDB    rdfs:subClassOf  ja:Model .
    +
    +# Text
    +[] ja:loadClass "org.apache.jena.query.text.TextQuery" .
    +text:TextDataset      rdfs:subClassOf   ja:RDFDataset .
    +text:TextIndexES      rdfs:subClassOf   text:TextIndex .
    +
    +## ---------------------------------------------------------------
    +## This URI must be fixed - it's used to assemble the text dataset.
    +
    +:text_dataset rdf:type     text:TextDataset ;
    +    text:dataset   <#dataset> ;
    +    text:index     <#indexES> ;
    +    .
    +
    +<#dataset> rdf:type      tdb:DatasetTDB ;
    +    tdb:location "--mem--" ;
    +    .
    +
    +<#indexES> a text:TextIndexES ;
    +    text:serverList "127.0.0.1:9300" ; # A comma-separated list of 
Host:Port values of the ElasticSearch Cluster nodes.
    +    text:clusterName "elasticsearch" ; # Name of the ElasticSearch 
Cluster. If not specified defaults to 'elasticsearch'
    +    text:shards "1" ;                  # The number of shards for the 
index. Defaults to 1
    +    text:replicas "1" ;                # The number of replicas for the 
index. Defaults to 1
    +    text:indexName "jena-text" ;       # Name of the Index. defaults to 
jena-text
    +    text:multilingualSupport true ;
    +    text:entityMap <#entMap> ;
    +    .
    +
    +<#entMap> a text:EntityMap ;
    +    text:entityField      "intel" ; # Defines the Document Type in the ES 
Index
    --- End diff --
    
    I will change the value to "uri". In TextIndexES, I use the entity field 
name to define the name of the Document Type, thus the reason for the comment.


> Elastic Search Support for Apache Jena Text 
> --------------------------------------------
>
>                 Key: JENA-1305
>                 URL: https://issues.apache.org/jira/browse/JENA-1305
>             Project: Apache Jena
>          Issue Type: New Feature
>          Components: Text
>    Affects Versions: Jena 3.2.0
>            Reporter: Anuj Kumar
>            Assignee: Osma Suominen
>              Labels: elasticsearch
>   Original Estimate: 240h
>  Remaining Estimate: 240h
>
> This Jira tracks the development of Jena Text ElasticSearch Implementation.
> The goal is to extend Jena Text capability to index, at scale, in 
> ElasticSearch. This implementation would be similar to the Lucene and Solr 
> implementations.
> We will use ES version 5.2.1 for the implementation.
> The following functionalities would be supported:
> * Indexing Literal values
> * Updating indexed values
> * Deleting Indexed values
> * Custom Analyzer Support
> * Configuration using Assembler as well as Java techniques.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to