hi
thanks for your reply. Please suggest me what to do now.
i want to index the document which contains multiple languages.
I really waiting for this to complete with your help.
Please,please help me
Erick Erickson wrote:
>
> I'm puzzled why you say
>
> "By the above out put we can say that
I'm puzzled why you say
"By the above out put we can say that StandardAnalyzer is
enough to get rid of danish elements."
It does NOT get rid of the accents, according to your own output.
If your goal is to go ahead and index multiple language documents
in a single index then search it, I'd recom
Hi Thanks for your reply.
After gone threw with the site which you given... i understood that
StandardAnalyzer is enough to handle these special characters.
i'm attaching one class called AnalysisDemo.java. By executing that class
i'm able to say the above sentance(i.e StandardAnalyzer is enough
OK, this is a much different problem than you were originally
asking about, effectively "how to index/search mixed language
documents".
This topic has been discussed multiple times on the user list, I
think your first step should be to search the archive. I *was*
going to find the old searchable m
HI
Here are the details about my goals.
1. I want to use this lucene for mixed languages.
2. I want to make indexes of the documents which are either english or
danish etc.
I'm attaching my IndexFiles.java file.
When i'm searching i'm giving the index path location as well as doucmets
folder.
HI
Here are the details about my goals.
1. I want to use this lucene for mixed languages.
2. I want to make indexes of the documents which are either english or
danish etc.
I'm attaching my IndexFiles.java file.
When i'm searching i'm giving the index path location as well as doucmets
folder.
Are you *also* using the DutchAnalyzer for your *query*?
Please show us the index and search code (simplified as much
as possible), then we'll be able to provide better suggestions.
Also, tell us a bit more about your goals here. Is this an
index entirely of Dutch documents? Or is it a mixed-lang
Hi
Thanks for your reply.
I'm able to see the DutchAnalyzer.
When i'm indexing my documents i given instace of DutchAnalyzer as an
argument to IndexWriter Class.
After this when i search for the
http://www.nabble.com/file/p23170710/IndexFiles.java IndexFiles.java
contains the danish elements
Take a look at DutchAnalyzer. The problem you'll have is if you're indexing
this document along with a bunch of documents from other languages.
You could search the mail archive for extensive discussions of indexing/
searching documents from several languages.
Best
Erick
On Tue, Apr 21, 2009 at