We're running a 14M documents index. For each document we have:
field name=id type=sint indexed=true
stored=true
required=true /
field name=title type=text_ngram indexed=true
stored=trueomitNorms=true/
field name=cat_id
Hello,
is there any existing FAQ or HowTo on how to setup the improved (and new?)
chinese analyzer on Solr 1.4?
I'd appreciate any help you may provide on this.
Thanks,
--
View this message in context:
We are trying SOLR 1.3 with Paoding Chinese Analyzer , and after reindexing
the index size went from 1.5 Gb to 2.7 Gb.
Is that some expected behavior ?
Is there any switch or trick to avoid having a double + index file size?
Koji Sekiguchi-2 wrote:
CharFilter can normalize (convert)
Is it correct to assume that using field compression will cause performance
issues if we decide to allow search over this field?
ie:
field name=id type=sint indexed=true stored=true
required=true /
field name=title type=textindexed=true stored=true
omitNorms=true/
wrote:
I can't tell what that analyzer does, but I'm guessing it uses n-grams?
Maybe consider trying https://issues.apache.org/jira/browse/LUCENE-1629
instead?
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
From: Fer-Bj fernando.b
Here is what we have:
for all the documents we have a field called small_body , which is a 60
chars max text field that were we store the abstract for each article.
We have about 8,000,000 documents indexed, and usually we display this
small_body on our listing pages.
For each listing page we
?
On Tuesday, June 2, 2009, Fer-Bj fernando.b...@gmail.com wrote:
I'm sending 3 files:
- schema.xml
- solrconfig.xml
- error.txt (with the error description)
I can confirm by now that this error is due to invalid characters for the
XML format (ASCII 0 or 11).
However, this problem now
how to, but I wouldn't think it would be much different from
1.2
-Grant
On May 31, 2009, at 10:31 PM, Fer-Bj wrote:
Hello,
is there any how to already created to get me up using SOLR 1.3
running
for a chinese based website?
Currently our site is using SOLR 1.2, and we tried
Hello,
is there any how to already created to get me up using SOLR 1.3 running
for a chinese based website?
Currently our site is using SOLR 1.2, and we tried to move into 1.3 but we
couldn't complete our reindex as it seems like 1.3 is more strict when it
comes to special chars.
I would