Stop words in Unicode/UTF8 format is not working with Lucene .Net 2.9.2 -----------------------------------------------------------------------
Key: LUCENENET-386 URL: https://issues.apache.org/jira/browse/LUCENENET-386 Project: Lucene.Net Issue Type: Wish Environment: Windows XP, Lucene .Net Reporter: Satish P Wani We are using Lucene .Net version 2.9.2. We tried giving Unicode/UTF8 stop words (words in Devnagari) using following 3 cases. All below cases are not working. Please let us know whether any one has tried this. // Case 1 string StopWordsFilePath="D:\\lucene\stopword.txt" FileInfo StopWordsList = new FileInfo(StopWordsFilePath); StandardAnalyzer _oStandardAnalyzer = new StandardAnalyzer(Lucene.Net.Util.Version.LUCENE_CURRENT, StopWordsList); IndexWriter indexWriter = new IndexWriter(IndexFolderPath, _oStandardAnalyzer, UpdateIndexFlag); // Case 2 string StopWordsFilePath="D:\\lucene\stopword.txt" // File type of this file is UTF-8 string[] stopWords = File.ReadAllLines(StopWordsFilePath, Encoding.UTF8); StandardAnalyzer _oStandardAnalyzer = new StandardAnalyzer(stopWords); IndexWriter indexWriter = new IndexWriter(IndexFolderPath, _oStandardAnalyzer, UpdateIndexFlag); // Case 3 string StopWordsFilePath="D:\\lucene\stopword.txt" // File type of this file is Unicode string[] stopWords = File.ReadAllLines(StopWordsFilePath, Encoding.Unicode); StandardAnalyzer _oStandardAnalyzer = new StandardAnalyzer(stopWords); IndexWriter indexWriter = new IndexWriter(IndexFolderPath, _oStandardAnalyzer, UpdateIndexFlag); -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.