Sebastian Trüg wrote:
The raptor API says that all strings (URIs and literals) are utf8. However,
when parsing a file with encoding UTF-8 or encoding ISO8859-1 containing a
literal with a german umlaut, I do not get utf8 in either case.
Can you file a bug and attach that file (or something minimal that
demonstrates it)?
So before searching though the raptor code and trying to figure it out by
myself two questions:
- Does raptor ALWAYS produce utf8 strings?
Yes.
- Is the following code acceptable:
void raptorTriplesHandler( void* userData, const raptor_statement* triple )
{
[...]
switch( triple->object_type ) {
case RAPTOR_IDENTIFIER_TYPE_LITERAL:
fromUtf8( (const char*)triple->object );
[...]
}
[...]
I don't know what that does, but every raptor (& redland) literal string
and URI string are all UTF-8. Everywhere you see unsigned char*, basically.
Dave
_______________________________________________
redland-dev mailing list
[email protected]
http://lists.librdf.org/mailman/listinfo/redland-dev