[fpc-pascal] XML with lazarus UTF8 problem

2009-07-06 Thread Henrik Genssen
Hi,

I am using DOM and XMLRead units for parsing XML files.
The XML file is UTF-8. The text-nodes in the file contain german umlaute.
The file is valid XML!

When it try to get the textnodes containing german umlaute, I get an empty 
string, although
the length of the string seems to be correct.

I uses Lazarus from trunk this morning, FPC 2.2.4.
I even tried fcl-xml from FPC svn trunk with no changes...

-- Hinnack
___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


Re: [fpc-pascal] XML with lazarus UTF8 problem

2009-07-06 Thread Valdas Jankūnas

Henrik Genssen rašė:

Hi,

I am using DOM and XMLRead units for parsing XML files.
The XML file is UTF-8. The text-nodes in the file contain german umlaute.
The file is valid XML!

When it try to get the textnodes containing german umlaute, I get an empty 
string, although
the length of the string seems to be correct.

I uses Lazarus from trunk this morning, FPC 2.2.4.
I even tried fcl-xml from FPC svn trunk with no changes...

-- Hinnack


 Maybe this helps you:

http://news.gmane.org/find-root.php?message_id=%3c4734B336.1080806%40erdves.lt%3e

 I solved this problem by using UTF8Decode/UTF8Encode routines to 
convert between UTF8 - Widestring.


--
  Valdas Jankūnas
___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


Re: [fpc-pascal] XML with lazarus UTF8 problem

2009-07-06 Thread JoshyFun
Hello FPC-Pascal,

Monday, July 6, 2009, 8:13:56 AM, you wrote:

HG I am using DOM and XMLRead units for parsing XML files.
HG The XML file is UTF-8. The text-nodes in the file contain german umlaute.
HG The file is valid XML!

I do not work with XML DOM but if the file is UTF-8 and the text node
contains an ANSI german umlaute the XML is not valid as the text is
not UTF-8 conformant.

fpc UTF-8 decoder clear any UTF-8 string with wrong encode (I think
trunk one replace wrong encodes with '?' mark).

-- 
Best regards,
 JoshyFun

___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


RE: Re: [fpc-pascal] XML with lazarus UTF8 problem

2009-07-06 Thread Henrik Genssen
As I said, the file is valid XML.

It parses well in:
- Firefox
- IE
- libxml
- msxml

regards

Hinnack

reply to message:
date: 06.07.2009 13:17:05
from: JoshyFun joshy...@gmail.com
to: FPC-Pascal users discussions fpc-pascal@lists.freepascal.org
subject: Re: [fpc-pascal] XML with lazarus UTF8 problem

Hello FPC-Pascal,

Monday, July 6, 2009, 8:13:56 AM, you wrote:

HG I am using DOM and XMLRead units for parsing XML files.
HG The XML file is UTF-8. The text-nodes in the file contain german umlaute.
HG The file is valid XML!

I do not work with XML DOM but if the file is UTF-8 and the text node
contains an ANSI german umlaute the XML is not valid as the text is
not UTF-8 conformant.

fpc UTF-8 decoder clear any UTF-8 string with wrong encode (I think
trunk one replace wrong encodes with '?' mark).

-- 
Best regards,
 JoshyFun

___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal

___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


RE: Re: [fpc-pascal] XML with lazarus UTF8 problem

2009-07-06 Thread Michael Van Canneyt



On Mon, 6 Jul 2009, Henrik Genssen wrote:


As I said, the file is valid XML.


No-one disputes this, the question is how the codepage is used in the rest
of your program. You may need to do a manual transformation e.g. to
widestrings in order to be able to use the XML objects in your application.

Michael.



I do not work with XML DOM but if the file is UTF-8 and the text node
contains an ANSI german umlaute the XML is not valid as the text is
not UTF-8 conformant.

fpc UTF-8 decoder clear any UTF-8 string with wrong encode (I think
trunk one replace wrong encodes with '?' mark).

--
Best regards,
JoshyFun

___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


RE: Re: [fpc-pascal] XML with lazarus UTF8 problem

2009-07-06 Thread Henrik Genssen
using UTF8Encode / UTF8Decode does not make any change
I used that already - even leaving it out does not work.

regards

Hinnack


reply to message:
date: 06.07.2009 12:42:46
from: Valdas Jankunas zmu...@gmail.com
to: FPC-Pascal users discussions fpc-pascal@lists.freepascal.org
subject: Re: [fpc-pascal] XML with lazarus UTF8 problem

Henrik Genssen raše:
 Hi,
 
 I am using DOM and XMLRead units for parsing XML files.
 The XML file is UTF-8. The text-nodes in the file contain german umlaute.
 The file is valid XML!
 
 When it try to get the textnodes containing german umlaute, I get an empty 
 string, although
 the length of the string seems to be correct.
 
 I uses Lazarus from trunk this morning, FPC 2.2.4.
 I even tried fcl-xml from FPC svn trunk with no changes...
 
 -- Hinnack

  Maybe this helps you:

http://news.gmane.org/find-root.php?message_id=%3c4734B336.1080806%40erdves.lt%3e

  I solved this problem by using UTF8Decode/UTF8Encode routines to 
convert between UTF8 - Widestring.

-- 
   Valdas Jankunas
___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal

___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


RE: Re: [fpc-pascal] XML with lazarus UTF8 problem

2009-07-06 Thread Marc Santhoff
Moin Hinnack,

Am Montag, den 06.07.2009, 11:28 +0200 schrieb Henrik Genssen:
 using UTF8Encode / UTF8Decode does not make any change
 I used that already - even leaving it out does not work.

I have used that routines for using XML files encoded similarly to
yours.

After reading the file to a DOM treee I'm putting some strings into
GTK-components like this:

  stitle := UTF8Encode(widestring(entry.Title));

(Where entry is a structure pointing to the DOM tree.)

And the umlauts are shown as expected. Did you add cwstring to the
uses clause of your program?

uses
  {$IFDEF UNIX}{$IFDEF UseCThreads}
  cthreads,
  {$ENDIF}{$ENDIF}
  Interfaces, // this includes the LCL widgetset
  Forms
  { you can add units after this }, cwstring, ...

HTH,
-- 
Marc Santhoff m.santh...@web.de

___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


RE: RE: Re: [fpc-pascal] XML with lazarus UTF8 problem

2009-07-06 Thread Henrik Genssen
Moin, Moin,

sounds interesting. Where do I get cwstring?

on this thread someone pointed out, that one needs it only on linux - do I need 
it on darwin, too?
http://lists.freepascal.org/lists/fpc-devel/2007-November/012047.html

Why do I have to Encode and not to Decode UTF8 if the file is UTF8. This is my 
mistake. It works now!
But I do not understand that. Can someone explain?

regards

Hinnack

reply to message:
date: 06.07.2009 14:14:31
from: Marc Santhoff m.santh...@web.de
to: FPC-Pascal users discussions fpc-pascal@lists.freepascal.org
subject: RE: Re: [fpc-pascal] XML with lazarus UTF8 problem

Moin Hinnack,

Am Montag, den 06.07.2009, 11:28 +0200 schrieb Henrik Genssen:
 using UTF8Encode / UTF8Decode does not make any change
 I used that already - even leaving it out does not work.

I have used that routines for using XML files encoded similarly to
yours.

After reading the file to a DOM treee I'm putting some strings into
GTK-components like this:

  stitle := UTF8Encode(widestring(entry.Title));

(Where entry is a structure pointing to the DOM tree.)

And the umlauts are shown as expected. Did you add cwstring to the
uses clause of your program?

uses
  {$IFDEF UNIX}{$IFDEF UseCThreads}
  cthreads,
  {$ENDIF}{$ENDIF}
  Interfaces, // this includes the LCL widgetset
  Forms
  { you can add units after this }, cwstring, ...

HTH,
-- 
Marc Santhoff m.santh...@web.de

___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal

___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


RE: RE: Re: [fpc-pascal] XML with lazarus UTF8 problem

2009-07-06 Thread Marc Santhoff
Am Montag, den 06.07.2009, 12:12 +0200 schrieb Henrik Genssen:
 Moin, Moin,
 
 sounds interesting. Where do I get cwstring?

It comes with fpc, you only need to activate it.

 on this thread someone pointed out, that one needs it only on linux - do I 
 need it on darwin, too?
 http://lists.freepascal.org/lists/fpc-devel/2007-November/012047.html

Dunno, I'm not using Darwin.

 Why do I have to Encode and not to Decode UTF8 if the file is UTF8. This is 
 my mistake. It works now!
 But I do not understand that. Can someone explain?

Don't aks me, I didn't name those functions. IIRC I asked myself the
same question. ;)

Munter bleiben,
-- 
Marc Santhoff m.santh...@web.de

___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


Re[4]: [fpc-pascal] XML with lazarus UTF8 problem

2009-07-06 Thread JoshyFun
Hello FPC-Pascal,

Monday, July 6, 2009, 12:12:18 PM, you wrote:

HG sounds interesting. Where do I get cwstring?
HG on this thread someone pointed out, that one needs it only on
HG linux - do I need it on darwin, too?
HG http://lists.freepascal.org/lists/fpc-devel/2007-November/012047.html
HG Why do I have to Encode and not to Decode UTF8 if the file is
HG UTF8. This is my mistake. It works now!
HG But I do not understand that. Can someone explain?

So the problem was not reading the DOM but displaying the nodes. The
fpc DOM (if my memory serves me) always store everything in WideString
format, regardless the input encoding. That's the reason you cast it
to WideString and them UTF8 encode it to be displayed by the LCL. I
think the cast is not needed, but as the string is a DOMString maybe
it is a must (not checked).

-- 
Best regards,
 JoshyFun

___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


[fpc-pascal] New Allegro.pas release

2009-07-06 Thread Guillermo Martínez Jiménez
I've uploaded two now releases of Allegro.pas.

Allegro.pas is a wrapper to use Allegro with Pascal compilers (Delphi,
Free Pascal, etc). Allegro is a cross-platform library intended for
use in computer games and other types of multimedia programming.

The new release (4.4.0 Alpha2) fixes a bug, adds new functionality
(mainly a 3D software based renderer) and new documentation and
examples. Two of the improvements applies also to the stable versions
and since the newest one lost Delphi compatibility (which will be
hopefully restored in version 5, with the new API) I decided to
release a new 4.2 release too.

The Allegro.pas' project website is at
http://allegro.pas.sourceforge.net/. You can download any release file
from that site.

Guillermo Ñuño Martínez
___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal


[fpc-pascal] Re: New Allegro.pas release

2009-07-06 Thread Guillermo Martínez Jiménez
There are an error in my last message.  The actual Allegro.pas'
 project website is at http://allegro-pas.sourceforge.net/

Sorry

Guillermo Ñuño Martínez
___
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-pascal