Re: [libreoffice-users] Problem with data conversion

2011-08-30 Thread James Wilde

On Aug 29, 2011, at 16:12 , webmaster for Kracked Press Productions wrote:

 
 Would you be willing to download/install 3.3.4 from 
 http://www.libreoffice.org/download/?
 I do not use the repository version, and I use Ubuntu 64bit 10.04.  You would 
 need to install in via the terminal, but 
 http://www.libreoffice.org/get-help/installation/linux/ will walk you through 
 this.  No need to uninstall anything either.

Not ready for that yet.
 
 I wonder if adding the language packs could help you?  You will need to go to 
 3.3.4, unless someone has archived the nl - Nederlands pack for 3.3.3.  I 
 do not understand Asian characters, so I do not know if there is a Mandarin 
 language pack listed.

I have the Swedish and Chinese language packs installed already.
 
 You could also try this Dutch dictionary and see it it works.
 http://libreoffice-na.us/English/add-on-dictionaries-large-list/Dutch_spelling_and_hyphenation_Dictionary--nl-dict-v2.00g2010-09-22.oxt

:)  I take it you're confusing this thread with another about a problem with 
the Dutch language.  I assume you mean Swedish.

 
 As for the characters showing in LibreOffice - I am taking a guess here - 
 could it be something to do with the default font you are using for LO's 
 Calc?  Format  Character should place you to the dialog to set your default 
 font.  I would make sure your font is set for the same one you use in the 
 other applications.  As someone who has worked with collecting fonts [over 
 100,000 of them over the years] I find that many do not have some of the 
 non-English characters that are common in other languages. I would make sure 
 you use a good Unicode font [like Arial Unicode] and see if the problem goes 
 away.

Good try but no cigar.  I always use the same font, and it has both the Swedish 
letters and Mandarin characters.

//James
-- 
For unsubscribe instructions e-mail to: users+h...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted



Re: [libreoffice-users] Problem with data conversion

2011-08-30 Thread James Wilde
A little more information:

On Aug 29, 2011, at 10:22 , James Wilde wrote:

 Can't think of a better subject line.
 
 This is about the Ubuntu repository version of LibO running under Ubuntu 
 11.04 linux.  The repository version is described as follows:
 
 LibreOffice 3.3.3
 OOO 330m 19 (Build:301)
 LibreOffice 3.3.3.1 Ubuntu package 1.3.3.3ubuntu2
 
 I have been testing Gnucash on linux, and in particular the ability to export 
 data to an html file and read that into other programs, specifically 
 LibreOffice Calc.  The whole process went excellently with one small 
 exception: the translation of the Swedish characters, å, ä and ö (and of 
 course the capitals, Å, Ä and Ö which I have not tested).

The html file quite clearly includes the following at the beginning:

meta http-equiv=content-type content=text-html; charset=utf-8

However, I have had the explanation that the conversion below is the result of 
LibO taking the input as ISO-8859-1 and converting it to UTF-8.  If someone in 
here can confirm this, I will write this up as a bug.  As mentioned below in 
the original mail, this does not occur on Windows, nor on the Mac. (I just 
tested it on the Mac).
 
 Instead of seeing each of the three letters, in each case I saw two 
 characters, Ã¥, for example instead of å.  I checked the three characters in 
 a hex editor, and found the following:
 
 å was represented by C3A5 in other applications, but as C383C2A5 in LibO
 ä was represented by C3A4 in other applications, but as C383C2A4 in LibO
 ö was represented by C3B6 in other applications, but as C383C2B6 in LibO
 
 I checked the original data in Gnucash, the intermediary form in the html 
 file and the contents of content.xml in LibO.  The first two had the short 
 form, LibO had the second form.
 
 I have also checked the same information in Windows, although I have not yet 
 used a hex editor in Windows.  Both Excel and LibO Calc imported the data 
 correctly, or rather, displayed it correctly.
 
 Before I report this as a bug, I'd like to be sure it's not something I'm 
 missing in my configuration.  All programs are set to use UTF-8, insofar as 
 one can set it in the program rather than in the operating system.
 
 A late thought occurs to me.  I use LibO Writer for writing Mandarin and have 
 set 'Enabled for Asian languages' under Languages.  Can this have affected 
 the way LibO translates the non-English characters?  Although LibO has 
 actually _added_ two bytes to the characters.
 
//James


-- 
For unsubscribe instructions e-mail to: users+h...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted


[libreoffice-users] Problem with data conversion

2011-08-29 Thread James Wilde
Can't think of a better subject line.

This is about the Ubuntu repository version of LibO running under Ubuntu 11.04 
linux.  The repository version is described as follows:

LibreOffice 3.3.3
OOO 330m 19 (Build:301)
LibreOffice 3.3.3.1 Ubuntu package 1.3.3.3ubuntu2

I have been testing Gnucash on linux, and in particular the ability to export 
data to an html file and read that into other programs, specifically 
LibreOffice Calc.  The whole process went excellently with one small exception: 
the translation of the Swedish characters, å, ä and ö (and of course the 
capitals, Å, Ä and Ö which I have not tested).

Instead of seeing each of the three letters, in each case I saw two characters, 
Ã¥, for example instead of å.  I checked the three characters in a hex editor, 
and found the following:

å was represented by C3A5 in other applications, but as C383C2A5 in LibO
ä was represented by C3A4 in other applications, but as C383C2A4 in LibO
ö was represented by C3B6 in other applications, but as C383C2B6 in LibO

I checked the original data in Gnucash, the intermediary form in the html file 
and the contents of content.xml in LibO.  The first two had the short form, 
LibO had the second form.

I have also checked the same information in Windows, although I have not yet 
used a hex editor in Windows.  Both Excel and LibO Calc imported the data 
correctly, or rather, displayed it correctly.

Before I report this as a bug, I'd like to be sure it's not something I'm 
missing in my configuration.  All programs are set to use UTF-8, insofar as one 
can set it in the program rather than in the operating system.

A late thought occurs to me.  I use LibO Writer for writing Mandarin and have 
set 'Enabled for Asian languages' under Languages.  Can this have affected the 
way LibO translates the non-English characters?  Although LibO has actually 
_added_ two bytes to the characters.

Regards

//James


-- 
For unsubscribe instructions e-mail to: users+h...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted