[ 
https://issues.apache.org/jira/browse/VXQUERY-32?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14378179#comment-14378179
 ] 

Steven Jacobs commented on VXQUERY-32:
--------------------------------------

The code that has been implemented so far is as follows:
1. There is a new syntax which will create a full Lucene index on a given 
collection's current state
2. There is a new syntax to run an arbitrary query on the index rather than the 
collection itself

This code is currently slightly out of sync with the codebase (but will not be 
very hard to reintegrate). After that the next steps would be to find ways of 
optimizing query creation and usage, For example, dynamically deciding 
when/where an index can/should be used for a query. You will need to 
familiarize yourself with the VXQuery codebase.

> VXQuery integration with Apache Lucene
> --------------------------------------
>
>                 Key: VXQUERY-32
>                 URL: https://issues.apache.org/jira/browse/VXQUERY-32
>             Project: VXQuery
>          Issue Type: Improvement
>            Reporter: Vinayak Borkar
>            Assignee: Preston Carman
>              Labels: gsoc, gsoc2015, java, mentor, xml, xquery
>
> XML documents contain textual content and the need to search large corpora of 
> documents is a fairly common task. The intent of this project is to leverage 
> Apache Lucene's indexing and search capability so that users of the VXQuery 
> engine can express and run text-search queries.
> This project will have two parts.
> 1. Design and implement the ability for users to create and manage text 
> indices on collections of XML documents.
> 2. Implement functions in VXQuery to exploit these text-indices to execute 
> relevant queries efficiently.
> As a starting point, the system does not need to do automatic index selection 
> (decide to use the text index when the query did not actually refer to an 
> index). Instead the functions would be used directly in the query and the 
> system would have to use the said indices.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to