Re: [Virtuoso-users] Unicode in Virtuoso

2018-03-30 Thread Erich Bremer
 The fault was mine.  I forgot to specify the "UTF-8" encoding in the file
writer:

writer = new OutputStreamWriter(new GZIPOutputStream(output));

to

writer = new OutputStreamWriter(new GZIPOutputStream(output), "UTF-8");

Although the file looks near identical and even looks correct in some
programs, it wasn't until the encoding is added that the resultant file is
correctly uploaded into Virtuoso.  - Erich

On Tue, Mar 27, 2018 at 7:29 PM, Hugh Williams 
wrote:

> Hi Erich,
>
> When you say bulk update method I presume you mean with the Virtuoso RDF
> Bulk Loader ie https://virtuoso.openlinksw.com/dataspace/doc/
> dav/wiki/Main/VirtBulkRDFLoader  ?
>
> If so are you able to provide a sample NT file for upload to recreate
> locally , as we have not had such complaints and Virtuoso is used for
> hosting datasets in many languages with unicode chars which are uploaded
> using the bulk loader ?
>
> Best Regards
> Hugh Williams
> Professional Services
> OpenLink Software, Inc.  //  http://www.openlinksw.com/
> Weblog   -- http://www.openlinksw.com/blogs/
> LinkedIn -- http://www.linkedin.com/company/openlink-software/
> Twitter  -- http://twitter.com/OpenLink
> Google+  -- http://plus.google.com/100570109519069333827/
> Facebook -- http://www.facebook.com/OpenLinkSoftware
> Universal Data Access, Integration, and Management Technology Providers
>
>
>
> On 27 Mar 2018, at 21:41, Erich Bremer  wrote:
>
> I've uploaded some Unicode characters via sparql-auth endpoint using:
>
> prefix : 
> insert data {graph  { :status
> "卩尺乇ᐯ丨乇山 ㄒ乇乂ㄒ"}}
>
> which will display fine:
> urn:sbu:test http://www.stonybrook.edu/ns/status
>
> "Yay!"
>
> urn:sbu:test http://www.stonybrook.edu/ns/status
>
> "Ω"
>
> urn:sbu:test http://www.stonybrook.edu/ns/status
>
> "卩尺乇ᐯ丨乇山 ㄒ乇乂ㄒ"
>
>
> It seems the unicode data uploaded via the bulk update method ala isql is
> not uploading the characters correctly.  They get converted to question
> marks.  - E
>
> On Thu, Mar 22, 2018 at 8:07 PM, Erich Bremer  wrote:
>
>> I've uploaded some N-Triples files using the bulk uploader method to my
>> Virtuoso instance (07.20.3217).  The NT files look fine, but when I query
>> the data via the SPARQL endpoint, the Unicode characters are all converted
>> to question marks.  How do I properly upload Unicode data into Virtuoso?  -
>> Erich Bremer
>>
>
> 
> --
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! http://sdm.link/slashdot__
> _
> Virtuoso-users mailing list
> Virtuoso-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/virtuoso-users
>
>
>
--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users


Re: [Virtuoso-users] Unicode in Virtuoso

2018-03-27 Thread Hugh Williams
Hi Erich,

When you say bulk update method I presume you mean with the Virtuoso RDF Bulk 
Loader ie 
https://virtuoso.openlinksw.com/dataspace/doc/dav/wiki/Main/VirtBulkRDFLoader  ?

If so are you able to provide a sample NT file for upload to recreate locally , 
as we have not had such complaints and Virtuoso is used for hosting datasets in 
many languages with unicode chars which are uploaded using the bulk loader ?

Best Regards
Hugh Williams
Professional Services
OpenLink Software, Inc.  //  http://www.openlinksw.com/
Weblog   -- http://www.openlinksw.com/blogs/
LinkedIn -- http://www.linkedin.com/company/openlink-software/
Twitter  -- http://twitter.com/OpenLink
Google+  -- http://plus.google.com/100570109519069333827/
Facebook -- http://www.facebook.com/OpenLinkSoftware
Universal Data Access, Integration, and Management Technology Providers



> On 27 Mar 2018, at 21:41, Erich Bremer  wrote:
> 
> I've uploaded some Unicode characters via sparql-auth endpoint using:
> 
> prefix : >
> insert data {graph > 
> { :status "卩尺乇ᐯ丨乇山 ㄒ乇乂ㄒ"}}
> 
> which will display fine:
> urn:sbu:test <>   http://www.stonybrook.edu/ns/status 
>    
> "Yay!"
> urn:sbu:test <>   http://www.stonybrook.edu/ns/status 
>    
> "Ω"
> urn:sbu:test <>   http://www.stonybrook.edu/ns/status 
>    
> "卩尺乇ᐯ丨乇山 ㄒ乇乂ㄒ"
> 
> It seems the unicode data uploaded via the bulk update method ala isql is not 
> uploading the characters correctly.  They get converted to question marks.  - 
> E
> 
> On Thu, Mar 22, 2018 at 8:07 PM, Erich Bremer  > wrote:
> I've uploaded some N-Triples files using the bulk uploader method to my 
> Virtuoso instance (07.20.3217).  The NT files look fine, but when I query the 
> data via the SPARQL endpoint, the Unicode characters are all converted to 
> question marks.  How do I properly upload Unicode data into Virtuoso?  - 
> Erich Bremer
> 
> --
> Check out the vibrant tech community on one of the world's most
> engaging tech sites, Slashdot.org! 
> http://sdm.link/slashdot___
> Virtuoso-users mailing list
> Virtuoso-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/virtuoso-users

--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users


Re: [Virtuoso-users] Unicode in Virtuoso

2018-03-27 Thread Erich Bremer
I've uploaded some Unicode characters via sparql-auth endpoint using:

prefix : 
insert data {graph  { :status
"卩尺乇ᐯ丨乇山 ㄒ乇乂ㄒ"}}

which will display fine:
urn:sbu:test http://www.stonybrook.edu/ns/status

"Yay!"

urn:sbu:test http://www.stonybrook.edu/ns/status

"Ω"

urn:sbu:test http://www.stonybrook.edu/ns/status

"卩尺乇ᐯ丨乇山 ㄒ乇乂ㄒ"


It seems the unicode data uploaded via the bulk update method ala isql is
not uploading the characters correctly.  They get converted to question
marks.  - E

On Thu, Mar 22, 2018 at 8:07 PM, Erich Bremer  wrote:

> I've uploaded some N-Triples files using the bulk uploader method to my
> Virtuoso instance (07.20.3217).  The NT files look fine, but when I query
> the data via the SPARQL endpoint, the Unicode characters are all converted
> to question marks.  How do I properly upload Unicode data into Virtuoso?  -
> Erich Bremer
>
--
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot___
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users