Hello Sedna team, 

I am looking for a suitable database for my dataset and Sedna is on the
short list. I am a little worried about scalability. 

I figured out that my descriptive schema will have around 1.8 million
nodes with a maximum depth of 25 to 100 levels, depending on where I
make the cut (a small percentage of my data goes up to the nesting depth
of >8,000, but I am ready to discard that). Do you know from previous
experience if this would work?

Also, the size of my dataset is 65GB and it's supposed to grow by about
10 to 20GB per month (so in one year I will have around 250GB). The size
of the individual XML data is usually between 10kB and 20MB, but can be
bigger. There are around 60,000 XML files to store, with 10,000 to
20,000 new ones per month. Do you often see Sedna being used in such
circumstances?

I have to say that I plan to insert data once a month and do no updates
or deletes. I will be doing lots of queries though. I'm still not sure
about the nature of the queries unfortunately. This is a research
project and I still have to research that part :) 

Greetings,
Nedim Srndic


------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure 
contains a definitive record of customers, application performance, 
security threats, fraudulent activity, and more. Splunk takes this 
data and makes sense of it. IT sense. And common sense.
http://p.sf.net/sfu/splunk-novd2d
_______________________________________________
Sedna-discussion mailing list
Sedna-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/sedna-discussion

Reply via email to