On Wed, 2006-12-20 at 21:52 -0800, sanjeev wrote:
Hello,
My crawl index is not being created correctly using the new settings.
https://issues.apache.org/jira/browse/SOLR-88
Although the log shows no errors - I am not able to open using Luke,
it says index corrupt, access denied, invalid
...:-(
And no info in the logs about how many documents were indexed and all - the
logfile
pattern in 0.8.1 seems different from nutch 0.7.2 - am i right or wrong ?
please help as i'm going despo here ...
Thanks.
sanjeev.
--
View this message in context:
http://www.nabble.com/implement-thai-language-indexing
/
--
View this message in context:
http://www.nabble.com/implement-thai-language-indexing-and-search-tf2641172.html#a7827701
Sent from the Nutch - Dev mailing list archive at
Nabble.com.
__
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best
is
found for the identified
language, it is used to analyze the document.
Regards
Jérôme
--
http://motrech.free.fr/
http://www.frutch.org/
--
View this message in context:
http://www.nabble.com/implement-thai-language-indexing-and-search-tf2641172.html
sided if a suitable analyzer is found for the identified
language, it is used to analyze the document.
Regards
Jérôme
--
http://motrech.free.fr/
http://www.frutch.org/
--
View this message in context:
http://www.nabble.com/implement-thai-language-indexing-and-search-tf2641172
this message in context:
http://www.nabble.com/implement-thai-language-indexing-and-search-tf2641172.html#a7671727
Sent from the Nutch - Dev mailing list archive at Nabble.com.
i used an existing ThaiAnalyzer which was in lucene package.
ok - i renamed the lucene.analysis.th.* to nutch.analysis.th.* - compiled
and
placed all class files in a jar - analysis-th.jar (do i need to bundle the
ngp file in the jar as well ?)
1. You don't have to refactor the lucene analyzer.
in context:
http://www.nabble.com/implement-thai-language-indexing-and-search-tf2641172.html#a7375203
Sent from the Nutch - Dev mailing list archive at Nabble.com.
ok. I was able to enable the language identifier plugin by adding the
value
in plugin.includes attribute
in nutch-site.xml - but i'm not sure just by doing that I can have thai
text
recognized and tokenized
properly.
What else do I have to do ? Please help me.
1. You must create a thai NGP
--
View this message in context:
http://www.nabble.com/implement-thai-language-indexing-and-search-tf2641172.html#a7375925
Sent from the Nutch - Dev mailing list archive at Nabble.com.
10 matches
Mail list logo