Re: Using Lucene for Semantic search

2006-07-20 Thread Chuck Williams
I have built such a system, although not with Lucene at the time. I doubt you need to modify anything in Lucene to achieve this. You may want to index words, stems and/or concepts from the ontology. Concepts from the ontology may relate to words or phrases. Lucene's token structure is flexible,

Re: Flexible index format / Payloads Cont'd

2006-07-20 Thread Marvin Humphrey
On Jul 19, 2006, at 10:26 AM, Nicolas Lalevée wrote: Then I looked deeper in the Lucene file format, and I manage to introduce some generic field metadata without breaking the file format compatibility. I just used another bit of the "Bits" to mark that there is or not some metadata on th

Re: Using Lucene for Semantic search

2006-07-20 Thread karl wettin
On Thu, 2006-07-20 at 14:19 -0400, Chris Wildgoose wrote: > I have been working with Lucene for some time, and I have an interest > in developing a Semantic Search solution. I was looking into extending > lucene for this. I know this would involve some significant > re-engineering of the indexing p

Using Lucene for Semantic search

2006-07-20 Thread Chris Wildgoose
I have been working with Lucene for some time, and I have an interest in developing a Semantic Search solution. I was looking into extending lucene for this. I know this would involve some significant re-engineering of the indexing procedure to support the ability to assign words to nodes within

Re: GData - Server, Indexing entries

2006-07-20 Thread Simon Willnauer
Hi Otis, On 7/20/06, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: Hi Simon, I have to admit that I'm not sure what exactly you are asking here. :) Well that is what I do ask myself sometimes. The Gdata Server is unfortunately a bit different to the rest of the lucene work. I just can discuss wh

[jira] Commented: (LUCENE-550) InstanciatedIndex - faster but memory consuming index

2006-07-20 Thread Karl Wettin (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-550?page=comments#action_12422363 ] Karl Wettin commented on LUCENE-550: A comment on memory usage: about 2x a RAMDirectory (900MB and 1800MB) on a 150,000 document corpus (when the corpus term c

[jira] Commented: (LUCENE-550) InstanciatedIndex - faster but memory consuming index

2006-07-20 Thread Karl Wettin (JIRA)
[ http://issues.apache.org/jira/browse/LUCENE-550?page=comments#action_12422359 ] Karl Wettin commented on LUCENE-550: To make this index work flawless (I hope), remove the if-statement around the following row in InstatiatedIndexWriter (row