Hi
I am a virtuoso fan (and newbie), I have installed and played a bit
with it and I think it is great!

Now I am experiencing some problems in IRI names when using non ASCII
characters.
I have collected as much information as I could, hope they are enough
to figure out thwe problem:
These are the two cases:

1) Differences between isql command line tool and Conductor's 'Sparql
Execution' tool
When I do the following statement I can see the triple correctly from
the same interface, but wrongly in the other.
For example, I do from isql

SQL> sparql insert into <http://localhost/test/charsets> {<http://ààà>
rdf:type owl:Thing};
callret-0
VARCHAR
_______________________________________________________________________________

Insert into <http://localhost/test/charsets>, 1 triples -- done

1 Rows. -- 11 msec.
SQL> sparql select * from <http://localhost/test/charsets> where {?a ?b ?c};
a
           b
                      c
VARCHAR
           VARCHAR
                      VARCHAR
_______________________________________________________________________________

http://ààà
           http://www.w3.org/1999/02/22-rdf-syntax-ns#type
                      http://www.w3.org/2002/07/owl#Thing

1 Rows. -- 2 msec.

Then I try to see the result from Conductor, but this is the result:

a       b       c
http://???      http://www.w3.org/1999/02/22-rdf-syntax-ns#type
        http://www.w3.org/2002/07/owl#Thing

The same example but in the opposite order, now I do the insert from Conductor:

insert into <http://localhost/test/charsets> {<http://ààà2> rdf:type owl:Thing}

and

select * from <http://localhost/test/charsets> where {?a ?b ?c}

a       b       c
http://???      http://www.w3.org/1999/02/22-rdf-syntax-ns#type
        http://www.w3.org/2002/07/owl#Thing
http://ààà2     http://www.w3.org/1999/02/22-rdf-syntax-ns#type
        http://www.w3.org/2002/07/owl#Thing

The triple inserted from Conductor displays correctly. But it is not from isql:

SQL> sparql select * from <http://localhost/test/charsets> where {?a ?b ?c};
a
           b
                      c
VARCHAR
           VARCHAR
                      VARCHAR
_______________________________________________________________________________

http://ààà
           http://www.w3.org/1999/02/22-rdf-syntax-ns#type
                      http://www.w3.org/2002/07/owl#Thing
http://ààà2
           http://www.w3.org/1999/02/22-rdf-syntax-ns#type
                      http://www.w3.org/2002/07/owl#Thing

2 Rows. -- 2 msec.


2) Wrong characters when using 'load <IRI>' statement from both interfaces

In both interfaces, when I use the sparql load <IRI> statement, I
cannot see IRI names correctly when some 'à', 'ò' etc... chars are in.
The public rdf/xml file is correct, its encoding is UTF-8. This is not
declared in the xml top declaration (but I have tried to add it
manually, and I obtained the same behaviour).
IRI are written in two ways, inside the rdf/xml:
- http://someasciicharsà
- http://someasciichars&#224;
In both cases the IRI displays wrong in both interfaces.

Other notes:
* I have seen some documentation about CHARSET parameter of the
connection, and tried to change it through ISQL, but I had the same
behaviour both in 1) and 2).
* I have noted that the HTTP header of the Conductor says UTF-8 while
the HTML meta tag says ISO-8859-1, but I do not know if this has some
influence on the general case (maybe there are multiple problems that
I collapse ;) )
* Virtuoso.ini file contains the following configuration
[HTTPServer]
Charset = UTF-8

Do I need to configure something?
Can anybody help me on figure out the problem?

Thank you in advance

Enrico



-- 
Enrico Daga
Technology Expert
--
Ufficio Sistemi Informativi  (DCSPI-USI)
National Research Council (CNR)
P.le Aldo Moro 7 - Rome, Italy
Tel +39 4993 3321
--
Semantic Technology Laboratory (STLab)
Institute for Cognitive Science and Technology (ISTC-CNR)
Via Nomentana 56, Rome - Italy
--
http://stlab.istc.cnr.it/stlab/User:EnricoDaga
http://www.enridaga.net
skype: enri-pan

Reply via email to