Re: Use the lucene for searching in the Semantic Web.

2006-01-17 Thread xing jiang
Hi, I think one problem of the existing method is that, to query on the RDF files or similar structures, we have to form SQL like queries. However, for searching in the text files, we only need to type several keywords. Can we combine the two methods and how can we combine the two methods. For

Re: Use the lucene for searching in the Semantic Web.

2006-01-18 Thread xing jiang
Hi, I have done some surveys about the information retrieval on the Semantic Web, (maybe i miss many papers, most papers i used are published in recent WWW and CIKM conferences, :). 1. A typical way of using the ontology is to select exact term from the domain ontology to form queries. The first

Re: Use the lucene for searching in the Semantic Web.

2006-01-19 Thread xing jiang
Hi Mathias, Can you give more details? Is your application for text + ontology, or ontology only? regards jiang xing On 1/19/06, Mathias Lux [EMAIL PROTECTED] wrote: Hi! (1) I'm working on a similar problem, but based on MPEG-7 Semantic Description Graphs. I've already a prototype for

Re: Use the lucene for searching in the Semantic Web.

2006-01-19 Thread xing jiang
of the ontology best? The selection of concepts / classes / node types (whatever :) depends on the use case. hope this helps a bit, mathias -Ursprüngliche Nachricht- Von: xing jiang [mailto:[EMAIL PROTECTED] Gesendet: Donnerstag, 19. Jänner 2006 12:14 An: java-user@lucene.apache.org

Re: Use the lucene for searching in the Semantic Web.

2006-01-19 Thread xing jiang
On 1/19/06, Mathias Lux [EMAIL PROTECTED] wrote: -Ursprüngliche Nachricht- Von: xing jiang [mailto:[EMAIL PROTECTED] Gesendet: Donnerstag, 19. Jänner 2006 13:11 An: java-user@lucene.apache.org Betreff: Re: Use the lucene for searching in the Semantic Web. Hi, I am

Re: Use the lucene for searching in the Semantic Web.

2006-01-19 Thread xing jiang
On 1/19/06, Mathias Lux [EMAIL PROTECTED] wrote: Actually, my problem is that, for instance, for a document d, Its feature vector may be keywords and concepts. I don't know how to weight the two items. Right now, i used a stupid method, given a document d, i can obtain a rank D

Re: Use the lucene for searching in the Semantic Web.

2006-01-19 Thread xing jiang
On 1/20/06, Klaus [EMAIL PROTECTED] wrote: Hi, Actually, my problem is that, for instance, for a document d, Its feature vector may be keywords and concepts. What do you exactly mean by features vector? You are referring to the predicate - object pairs, connected to one subject node, don't

How does the lucene normalize the score?

2006-01-26 Thread xing jiang
Hi, I want to know how the lucene normalizes the score. I see hits class has this function to get each document's score. But i dont know how lucene calculates the normalized score and in the Lucene in action, it only said normalized score of the nth top scoring docuemnts. -- Regards Jiang Xing

Re: How does the lucene normalize the score?

2006-01-27 Thread xing jiang
/ topDocs.getMaxScore(); } int end = scoreDocs.length length ? scoreDocs.length : length; for (int i = hitDocs.size(); i end; i++) { hitDocs.addElement(new HitDoc(scoreDocs[i].score * scoreNorm, scoreDocs[i].doc)); } On 1/27/06, xing jiang

Re: Related searches

2006-01-31 Thread xing jiang
I think you should build a type of domain specific dictionary first. You should say, for instance, automobile = car. This approach can satisfy your requirement. On 1/30/06, Leon Chaddock [EMAIL PROTECTED] wrote: Hi, Does anyone know if it is possible to show related searches with lucene, for

Re: Related searches

2006-02-01 Thread xing jiang
Hi, I got a question of doing the related search. For instance, if I want to say Support Vector Machine == SVM. Then, How can i use this information when retrieve documents. I dont think it can be added in the Synonym Filter. On 2/1/06, Dave Kor [EMAIL PROTECTED] wrote: On 1/30/06, Leon

Two problems of lucene.

2006-02-04 Thread xing jiang
Hi, I got two problems of lucene. 1. How does the lucene calculate each term's weight in the query? Is it a simple boolean value? 2. Can i change the similarity measure in the lucene? For instance, i only use the term frequence instead of the tf/idf value to give weight to each term in the

two problems of using the lucene.

2006-02-04 Thread xing jiang
Hi, I got two problems of using the lucene and may need your help. 1. For each word, how the lucene calculate its weight. I only know for each work in the document will be weighed by its tf/idf values. 2. Can I modify the lucene so that i use the term frequency instead of tf/idf value to