Re: Rebuilding Document from index?

Mathieu Lecarme Tue, 26 Feb 2008 13:18:49 -0800

Yes, I've found a tester!
A patch was submited for this kind of job :
https://issues.apache.org/jira/browse/LUCENE-1190


And here is the svn work in progress :
https://admin.garambrogne.net/subversion/revuedepresse/trunk/src/java/lexicon

And the web version :
https://admin.garambrogne.net/projets/revuedepresse/browser/trunk/src/java/lexicon


Le 26 févr. 08 à 17:33, Itamar Syn-Hershko a écrit :


Implementing something like MoreLikeThis for Hebrew. Non-Hebrew

implementations are relevant, but much less accurate since a wordlike PURIMcan show up in the actual document with initials (LPURIM, BPURIMetc.) oreven with 1-4 letters after it which all reffer to the same term,and then

the score it will get upon analyzing using the current MoreLikeThis
implementation will not reflect its real importance.

I'm still trying to engineer the best possible solution for Lucenewith

Hebrew, right now my path is NOT using a stemmer by default, only by
explicit request of the user. MoreLikeThis would only return relevant
results if I will use a non-stemmed scoring and lookup.

Itamar.

-----Original Message-----
From: Erick Erickson [mailto:[EMAIL PROTECTED]
Sent: Tuesday, February 26, 2008 4:29 PM
To: java-user@lucene.apache.org
Subject: Re: Rebuilding Document from index?

See TermDocs/TermEnum. Or perhaps TermFreqVector. I admit I haven'tused

that last, but that family of methods ought to fix you up.

What problem are you trying to solve? Perhaps there are bettersolutions to

suggest....

Best
Erick

On Mon, Feb 25, 2008 at 6:04 PM, Itamar Syn-Hershko <[EMAIL PROTECTED]>

wrote:

Hello again,

If I wanted to do this programmatically, how would I do this(retrieve

a list of all terms in a field for a specific document - better if it
was in alphabettic order and with frequency data)?

Thanks,

Itamar.

-----Original Message-----
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Sent: Friday, February 22, 2008 3:22 PM
To: java-user@lucene.apache.org
Subject: RE: Rebuilding Document from index?

You can use Luke to rebuild the document. It will show you the terms
of the analyzed document, not the original content.
And this is what you want, if I understood you correctly.

-----Original Message-----
From: Itamar Syn-Hershko [mailto:[EMAIL PROTECTED]
Sent: Freitag, 22. Februar 2008 14:02
To: java-user@lucene.apache.org
Subject: Rebuilding Document from index?

Hi,

Is it possible to re-create a document from an index, if its not
stored?
What I'm looking for is a way to have a text document with the text
AFTER it was analyzed, so I can see how my analyzer handles certain
cases. So that means I don't care if I will not get the original
document. I want to see the document as the index knows it.

Thanks in advance,

Itamar.



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]





---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Rebuilding Document from index?

Reply via email to