2011 10:11
To: solr-user@lucene.apache.org
Subject: Re: Using MLT feature
Couldn't you extend the TextProfileSignature and modify the TokenComparator
class to use lexical order when token have the same frequency ?
Ludovic.
2011/4/8 Frederico Azeiteiro [via Lucene] <
ml-node+2794604-1
de=2794604&i=0&by-user=t>]
>
> Sent: sexta-feira, 8 de Abril de 2011 09:49
> To: [hidden
> email]<http://user/SendEmail.jtp?type=node&node=2794604&i=1&by-user=t>
> Subject: Re: Using MLT feature
>
> It seems that tokens are sorted by frequencies :
gt; Thank you for all your help,
> Frederico
>
>
-
Jouve
France.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Using-MLT-feature-tp2774454p2794585.h
tml
Sent from the Solr - User mailing list archive at Nabble.com.
for all your help,
> Frederico
>
>
-
Jouve
France.
--
View this message in context:
http://lucene.472066.n3.nabble.com/Using-MLT-feature-tp2774454p2794585.html
Sent from the Solr - User mailing list archive at Nabble.com.
okens by some hashmap internal sort method that I can't understand :), and so,
impossible to copy to C# implementation.
Thank you for all your help,
Frederico
-Original Message-
From: Lance Norskog [mailto:goks...@gmail.com]
Sent: quinta-feira, 7 de Abril de 2011 04:09
To: solr-use
my apps (Java and C#) return the same signature but SOLR returns a
> different one..
> Can anyone understand what I should be doing wrong?
>
> Thank you once again.
>
> Frederico
>
> -Original Message-
> From: Markus Jelsma [mailto:markus.jel...@openindex.io]
>
Jelsma [mailto:markus.jel...@openindex.io]
Sent: terça-feira, 5 de Abril de 2011 15:20
To: solr-user@lucene.apache.org
Cc: Frederico Azeiteiro
Subject: Re: Using MLT feature
If you check the code for TextProfileSignature [1] your'll notice the init
method reading params. You can set those pa
5
>
> On the processor tag.
>
> Best regards,
> Frederico
>
>
> -Original Message-
> From: Markus Jelsma [mailto:markus.jel...@openindex.io]
> Sent: terça-feira, 5 de Abril de 2011 12:01
> To: solr-user@lucene.apache.org
> Cc: Frederico Azeiteiro
> S
essor tag.
Best regards,
Frederico
-Original Message-
From: Markus Jelsma [mailto:markus.jel...@openindex.io]
Sent: terça-feira, 5 de Abril de 2011 12:01
To: solr-user@lucene.apache.org
Cc: Frederico Azeiteiro
Subject: Re: Using MLT feature
On Tuesday 05 April 2011 12:19:33 Fred
k you,
> Frederico
>
>
> -----Original Message-
> From: Markus Jelsma [mailto:markus.jel...@openindex.io]
> Sent: segunda-feira, 4 de Abril de 2011 16:47
> To: solr-user@lucene.apache.org
> Cc: Frederico Azeiteiro
> Subject: Re: Using MLT feature
>
> > Hi
at these parameters can help creating the same sig for
the above example?
Is anyone using the TextProfileSignature with success?
Thank you,
Frederico
-Original Message-
From: Markus Jelsma [mailto:markus.jel...@openindex.io]
Sent: segunda-feira, 4 de Abril de 2011 16:47
To: solr-user@
--Original Message-
> From: Frederico Azeiteiro [mailto:frederico.azeite...@cision.com]
> Sent: segunda-feira, 4 de Abril de 2011 11:59
> To: solr-user@lucene.apache.org
> Subject: RE: Using MLT feature
>
> Thank you Markus it looks great.
>
Azeiteiro [mailto:frederico.azeite...@cision.com]
Sent: segunda-feira, 4 de Abril de 2011 11:59
To: solr-user@lucene.apache.org
Subject: RE: Using MLT feature
Thank you Markus it looks great.
But the wiki is not very detailed on this.
Do you mean if I:
1. Create:
true
false
t: segunda-feira, 4 de Abril de 2011 10:48
To: solr-user@lucene.apache.org
Subject: Re: Using MLT feature
http://wiki.apache.org/solr/Deduplication
On Monday 04 April 2011 11:34:52 Frederico Azeiteiro wrote:
> Hi,
>
> The ideia is don't index if something similar (headline+bodyte
in a temp index)
> and then use the MLT feature to find similar docs before adding to final
> index?
>
> Thanks,
> Frederico
>
>
> -Original Message-
> From: Chris Fauerbach [mailto:chris.fauerb...@gmail.com]
> Sent: segunda-feira, 4 de Abril de 2011 10:22
>
ssage-
From: Chris Fauerbach [mailto:chris.fauerb...@gmail.com]
Sent: segunda-feira, 4 de Abril de 2011 10:22
To: solr-user@lucene.apache.org
Subject: Re: Using MLT feature
Do you want to not index if something similar? Or don't index if exact.
I would look into a hash code of the docum
Do you want to not index if something similar? Or don't index if exact. I
would look into a hash code of the document if you don't want to index exact.
Similar though, I think has to be based off a document in the index.
On Apr 4, 2011, at 5:16, Frederico Azeiteiro
wrote:
> Hi,
>
>
Hi,
I would like to hear your opinion about the MLT feature and if it's a
good solution to what I need to implement.
My index has fields like: headline, body and medianame.
What I need to do is, before adding a new doc, verify if a similar doc
exists for this media.
My idea is to use t
18 matches
Mail list logo