Hello, 

If I try to load this XML file into Sedna: 

<?xml version="1.1" encoding="UTF-8" ?>
<a>&#x1;</a>

I get an error saying that I have a reference to an invalid character
number. The character U+0001 is valid in XML 1.1. Does this mean that
Sedna does not support XML 1.1? If that is the case, do you know of a
way how to detect if a string contains characters which are not allowed
by XML 1.0, so that I can convert them to base 64? I am using Xerces to
convert my data into XML. 

I also have a question regarding space consumption. I am running Sedna
3.5.95, I have loaded 100 XML files with the total size of 1.2GB into
the database and the size of the data folder of this database is now
15GB. Is this normal? What ratio of raw XML data size and database disk
usage should I expect?

Thank you,
Nedim Srndic

On Fri, 2011-11-18 at 20:14 +0400, Ivan Shcheklein wrote:
> Hi Nedim,
> 
> Maximum data which is handled by Sedna now (and we know about it) is
> the Wikipedia dump in the wikixmldb.org demo. 
> 
> 
> Some statistics:
> 
> 
> 1. Descriptive schema size ~100K.
> 2. Raw data: 45GB, loaded data ~ 200GB.
> 3. Wikixmldb.org works pretty well on quite complex queries (with help
> of indices) using 3GB database buffers.
> 
> 
> We didn't have experience with millions descriptive schema nodes in
> data. It should work, though I can't say anything about performance or
> database size. I would recommend you to try to upload your current
> data in Sedna. It's very easy to try. Let us know if you need some
> help.
> 
> 
> BTW, better to use the latest development build:
> 
> 
> http://modis.ispras.ru/FTPContent/sedna/development
> 
> 
> Ivan Shcheklein,
> Sedna Team

-- 
Nedim Srndic, M. Eng.

  Research Assistant
  University of Tübingen
  Wilhelm-Schickard-Institute for Computer Science 
  Dept. Cognitive Systems 
  Sand 1, D-72076 Tübingen, Germany
  tel: +49 7071 29 77175
  
  http://www.ra.cs.uni-tuebingen.de/mitarb/srndic/


------------------------------------------------------------------------------
Write once. Port to many.
Get the SDK and tools to simplify cross-platform app development. Create 
new or port existing apps to sell to consumers worldwide. Explore the 
Intel AppUpSM program developer opportunity. appdeveloper.intel.com/join
http://p.sf.net/sfu/intel-appdev
_______________________________________________
Sedna-discussion mailing list
Sedna-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/sedna-discussion

Reply via email to