Hi, Yuta-san,
>> Now I use own Analyzer which based on "MeCab" (It's open source
>> Japanese morphological analyzer).
>> I try to modify it to support surrogate pairs.
>>
>> And I'm expecting the next release!
Cool!
I look forward to that. Is there a link somewhere to your project? I am very
http://en.wikipedia.org/wiki/Delegation_pattern
-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de
> -Original Message-
> From: Peter Keegan [mailto:peterlkee...@gmail.com]
> Sent: Thursday, March 11, 2010 9:41 PM
> To: java-user@lucen
Hi, I'm using a custom analyser based on standardanalyser with good
results to search artists (i.e rolling stones/beatles) but it fails to
match some weird artists names such as '!!!', this is not suprising
because the analyser ignores punctuation which is what I want it to
normally. I just won
Hi,
I'm using Lucene 2.4.1 with Hibernate Search 3.1.1. I have objects in the
index, for each field I index as follows:
@Fields( value = {
@Field(index = Index.TOKENIZED, store = Store.YES),
@Field(name = "name_forSort", index = Index.UN_TOKENIZED, store =
Store.NO),
})
T
Can you just lowercase a dedicated sort field and leave the others alone.
--
Ian.
On Fri, Mar 12, 2010 at 10:47 AM, Kev Kilroy wrote:
>
> Hi,
>
> I'm using Lucene 2.4.1 with Hibernate Search 3.1.1. I have objects in the
> index, for each field I index as follows:
>
> @Fields( value = {
>
Really, your app should not drop things on the floor and hope for the
best you should explicitly close your IRs when you're done with
them.
I think the relevant change here was the removal of finalizers, under
this issue:
http://issues.apache.org/jira/browse/LUCENE-1715
[Simple]FSDir's I
Hello there! We are indexing metadata for our medias. One ideia is that each
user adds its own metadata, so each document may have different
number/name/type of fields. Is this ok on Lucene? I mean, is Lucene ok with
the this relax approach.
Also, considering that each user may define its own meta
You are right. My test was faulty. I do get descriptor leak even with
SimpleFSDir. I guess I have some work to do.
Thanks!
Alexey
-Original Message-
From: Michael McCandless [mailto:luc...@mikemccandless.com]
Sent: Friday, March 12, 2010 5:13 PM
To: java-user@lucene.apache.org
Subject:
There's no requirement that all documents have the same
fields, Lucene is fine with different docs having different
fields.
There's no limit on the number of different fields allowed
that I know of, but I'm sure someone will chime in if there
is
HTH
Erick
On Fri, Mar 12, 2010 at 7:51 AM, Vin
You get memory problems if you turn on norms for all those fields (as norms are
large byte[] arrays per field). But this is not a hard limitation, but you
should take care.
-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de
> -Original Me
There is some bottleneck when you have a large number of fields and of
words. Each field has its own list of terms which means that the
dictionary, in the worst case, could be of size n*m (with n the number
of fields, and m the number of terms).
This can lead to some overhead when looking up a t
Ok, thanks. I got stuck on trying to extend TopFieldDocCollector and didn't
notice it's also a TopDocsCollector. A couple of questions about Solr:
1. In Solr's DocSetDelegateCollector, a lot of code is duplicated. Why not
this:
public void collect(int doc) throws IOException {
collector.collect
Just to follow up on our previous discussion, here are a few runs in which we
have tested some of the Lucene different scoring mechanisms and other options.
We used Lucene's patches for LnbLtcSimilarity and BM25 and contrib module for
the SweetSpotSimilarity.
Lucene Default: 0.149
Lucene BM25:
Thanks Hoss for the useful info.
Acording the coord(q,d) definition it's calculated at document level. It's
said:
is a score factor based on how many of the query terms are found in the
specified document
If I am just searching for a term, "ipod" in this case, how would be coord
computed? Would i
I just want to report that the download of the zip sources from the old-archive
directory now works for me. I'm not sure what the problem was, but it's gone
now. Thanks to those who replied.
An Hong
-Original Message-
From: Scott Ribe [mailto:scott_r...@killerbytes.com]
Sent: Wednesd
15 matches
Mail list logo