Dear fellow Java developers: I have a very basic question about indexing text using Lucene. I am indexing a large amount of text, that includes names that contain certain punctuation (eg. "Jane Doe-Smith", "Sa'eed", etc.) Will the punctuation throw off the indexer in any way, such that it breaks up the tokens when they shouldn't be, or will the indexer simply treat the punctuation inside the names as any other character, and the presence of the punctuation will not in any way hinder a user's ability to search for that name? Are there any precautions that I should take to avoid any problems?
I hope this question is clear and makes sense. Thanks in advance to all who reply. -- View this message in context: http://old.nabble.com/Basic-question-about-indexing-certain-words-tp26937880p26937880.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org