Hi Amit, Anyone can edit any Solr Wiki page - just create an account (I think the link to that is in the page footer) and edit.
Otis ---- Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch Lucene ecosystem search :: http://search-lucene.com/ ----- Original Message ---- > From: Amit Nithian <anith...@gmail.com> > To: solr-user@lucene.apache.org > Sent: Sat, July 31, 2010 4:41:44 PM > Subject: DIH, UTF8 and default DIH encoding value > > All, > > I am not sure if this is overly obvious or not (it wasn't to me) but in > trying to index some international characters from XML files using the DIH, > I found that setting the encoding attribute on the dataSource element to > "UTF-8" fixed my problem. > > <dataSource type="FileDataSource" encoding="UTF-8"/> > > My question is why the default isn't UTF-8 or if there is a good reason, can > the DIH wiki be made more clear that this encoding attribute can affect the > indexing of international characters? If I can get access to edit this wiki > page, I can add a section to that effect.. perhaps under a troubleshooting > section? > > Thanks! > Amit >