#559: BibUpload: Cannot bibupload file containing UTF-8 chars
------------------------+------------------
  Reporter:  grfavre    |      Owner:
      Type:  defect     |     Status:  new
  Priority:  critical   |  Milestone:  v1.0
 Component:  BibUpload  |    Version:
Resolution:             |   Keywords:
------------------------+------------------

Comment (by simko):

 Replying to [comment:3 grfavre]:
 > Maybe the error could come from my database ? I migrated mine from
 cdsware 0.5 to 0.7 3 years ago and to invenio 1.0 using the provided sql
 commands.  Is there a way to check this?

 1) First, you may want to check if your database runs in UTF-8 mode.
 Here is an example:

 {{{
 $ /opt/invenio/bin/inveniocfg --detect-system-details
 >>> Going to detect system details...
 * Hostname: pcuds33
 * Invenio version: 1.0.0-rc0.145-6639
 * Python version: 2.6.6 (r266:84292, Dec 27 2010, 00:02:40)  [GCC 4.4.5]
 * Apache version: Apache/2.2.17 (Debian) [/usr/sbin/apache2]
 * MySQLdb version: 1.2.2
 * MySQL version:
     - version: 5.1.49-3
     - character_set_client: utf8
     - character_set_connection: utf8
     - character_set_database: utf8
     - character_set_results: utf8
     - character_set_server: latin1
     - character_set_system: utf8
     - collation_connection: utf8_general_ci
     - collation_database: utf8_general_ci
     - collation_server: latin1_swedish_ci
 >>> System details detected successfully.
 }}}

 (The Latin-1 bits are OK here, as far as the Invenio database and the
 client connections are in UTF-8.)

 2) If OK, then maybe you were affected by Latin-1 to UTF-8
 transformations during your upgrades?  Did you convert your table
 content from Latin-1 to UTF-8 along the way?  (Briefly speaking, I did
 this by mysqldump'ing old data in Latin-1 charset, creating new tables
 with proper UTF-8 default encoding, and loading the dump, and MySQL
 would convert the dump into proper UTF-8.)

 You can run some SELECT statements on your bibxxx tables in a UTF-8
 capable terminal in order to see if your accents are properly stored
 as UTF-8, and not as mangled UTF-8 or something.

-- 
Ticket URL: <http://invenio-software.org/ticket/559#comment:6>
Invenio <http://invenio-software.org>

Reply via email to