Try this in mysql: mysql> SHOW VARIABLES LIKE 'character_set%'; From: http://forums.mysql.com/read.php?45,362257,362257
>From the same page: do you have utf8 set in your 'my.cnf' (shouldn't effect JDBC but will effect the server)? > [client] > default-character-set = utf8 > [mysql] > default-character-set=utf8 > [mysqld] > character_set_server = utf8 > character_set_client = utf8 -Glen On Mon, Dec 27, 2010 at 6:15 PM, Mark <static.void....@gmail.com> wrote: > I tried both of those with no such luck. > > On 12/27/10 2:49 PM, Glen Newton wrote: >> >> 1 - Verify your mysql is set up using UTF-8 >> 2 - Does your JDBC connect string contain: >> useUnicode=true&characterEncoding=UTF-8 >> See: >> http://dev.mysql.com/doc/refman/5.0/en/connector-j-reference-charsets.html >> >> Glen >> http://zzzoot.blogspot.com/ >> >> On Mon, Dec 27, 2010 at 5:15 PM, Mark<static.void....@gmail.com> wrote: >>> >>> Solr: 1.4.1 >>> JDBC driver: Connector/J 5.1.14 >>> >>> Looks like its the JDBC driver because It doesn't even work with a simple >>> java program. I know this is a little off subject now, but do you have >>> any >>> clues? Thanks again >>> >>> >>> On 12/27/10 1:58 PM, Erick Erickson wrote: >>>> >>>> More data please. >>>> >>>> Which jdbc driver? Have you tried just printing out the results of using >>>> that >>>> driver in a simple Java program? >>>> >>>> Solr should handle UTF-8 just fine, but the servlet container may have >>>> to >>>> have some settings tweaked, which one of those are you using? >>>> >>>> What version of Solr? >>>> >>>> Best >>>> Erick >>>> >>>> On Mon, Dec 27, 2010 at 3:05 PM, Mark<static.void....@gmail.com> >>>> wrote: >>>> >>>>> Seems like I am missing some configuration when trying to use DIH to >>>>> import >>>>> documents with chinese characters. All the documents save crazy >>>>> nonsense >>>>> like "这是测试" instead of actual chinese characters. >>>>> >>>>> I think its at the JDBC level because if I hardcode one of the fields >>>>> within data-config.xml (using a template transformer) the characters >>>>> show >>>>> up >>>>> correctly. >>>>> >>>>> Any ideas? Thanks >>>>> >> >> > -- -