Re: [Sedna-discussion] DB API performance issue - fn:doc-available

2013-04-12 Thread Ivan Lagunov
P.S. This is the corresponding commit:
http://sedna.org/gitweb/sedna.git/commit/a7306ad318593d7ff308628f8cfb542ddfa
672b2

As it's dated by today, I expect it'll be available in the night builds
since tomorrow.

 

@Ivan, please correct me if I'm wrong.

 

Best regards,

Ivan Lagunov

 

From: Ivan Lagunov [mailto:lagi...@gmail.com] 
Sent: Friday, April 12, 2013 5:30 PM
To: 'Charles Foster'
Cc: 'sedna-discussion'; 'Ivan Shcheklein'
Subject: XML:DB API performance issue - fn:doc-available

 

Hi Charles,

 

We're having serious performance issues, probably reached the critical
number of documents in the database. According to Ivan Shcheklein, it is
likely to be caused by using doc(documents) function that actually locks
all collections in the database, thus, drastically decreasing performance.
Your implementation of XML:DB API works great for small datasets but
according to event.log it uses doc(documents) quite a lot. So it may
indeed result in bad performance I'm facing. As an improvement, there have
been implemented a function fn:doc-available that checks if a document is
available in the given collection. It locks only one collection. This is an
example of usage:

 

doc-available(collection, document)

 

Could you please update the Sedna XML:DB API implementation using this
function? I understand it's recommended to migrate to XQJ implementation but
it's really hard to rewrite all the code we have. Nevertheless, I've planned
to investigate required efforts for this migration but we need some solution
on short notice. Thanks a lot!

 

Best regards,

Ivan Lagunov

--
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis  visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter___
Sedna-discussion mailing list
Sedna-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/sedna-discussion


Re: [Sedna-discussion] DB API performance issue - fn:doc-available

2013-04-12 Thread Ivan Shcheklein
Charles,

Development build will be available tomorrow:
http://modis.ispras.ru/FTPContent/sedna/development/ . Version will be =
 3.5.602.

Or you can get latest sources: git clone git://modis.ispras.ru/sedna .

Thank you for your prompt response! As far as I remember, there are other
users waiting for this improvement.

Ivan Shcheklein,
Sedna Team

Ivan Shcheklein,
Sedna Team


On Fri, Apr 12, 2013 at 7:36 PM, Charles Foster char...@cfoster.net wrote:

 Hi Guys,

 Yes, I'll sort this out later and issue a new release.

 Just please tell me which specific version of Sedna this new functionality
 came in please?

 Kind Regards,

 Charles

 On 12 Apr 2013, at 16:33, Ivan Lagunov lagi...@gmail.com wrote:

 P.S. This is the corresponding commit:
 http://sedna.org/gitweb/sedna.git/commit/a7306ad318593d7ff308628f8cfb542ddfa672b2
 
 As it’s dated by today, I expect it’ll be available in the night builds
 since tomorrow.
 ** **
 @Ivan, please correct me if I’m wrong.
 ** **
 Best regards,
 Ivan Lagunov

 *From:* Ivan Lagunov [mailto:lagi...@gmail.com]
 *Sent:* Friday, April 12, 2013 5:30 PM
 *To:* 'Charles Foster'
 *Cc:* 'sedna-discussion'; 'Ivan Shcheklein'
 *Subject:* XML:DB API performance issue - fn:doc-available
 ** **
 Hi Charles,
 ** **
 We’re having serious performance issues, probably reached the critical
 number of documents in the database. According to Ivan Shcheklein, it is
 likely to be caused by using doc(“documents”) function that actually locks
 all collections in the database, thus, drastically decreasing performance.
 Your implementation of XML:DB API works great for small datasets but
 according to event.log it uses doc(“documents”) quite a lot. So it may
 indeed result in bad performance I’m facing. As an improvement, there have
 been implemented a function fn:doc-available that checks if a document is
 available in the given collection. It locks only one collection. This is an
 example of usage:
 ** **
 doc-available(“collection”, “document”)
 ** **
 Could you please update the Sedna XML:DB API implementation using this
 function? I understand it’s recommended to migrate to XQJ implementation
 but it’s really hard to rewrite all the code we have. Nevertheless, I’ve
 planned to investigate required efforts for this migration but we need some
 solution on short notice. Thanks a lot!
 ** **
 Best regards,
 Ivan Lagunov



--
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis  visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter___
Sedna-discussion mailing list
Sedna-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/sedna-discussion


Re: [Sedna-discussion] DB API performance issue - fn:doc-available

2013-04-12 Thread Charles Foster
Hi Guys,

Yes, I'll sort this out later and issue a new release.

Just please tell me which specific version of Sedna this new functionality came 
in please?

Kind Regards,

Charles

On 12 Apr 2013, at 16:33, Ivan Lagunov lagi...@gmail.com wrote:

 P.S. This is the corresponding commit: 
 http://sedna.org/gitweb/sedna.git/commit/a7306ad318593d7ff308628f8cfb542ddfa672b2
 As it’s dated by today, I expect it’ll be available in the night builds since 
 tomorrow.
  
 @Ivan, please correct me if I’m wrong.
  
 Best regards,
 Ivan Lagunov
  
 From: Ivan Lagunov [mailto:lagi...@gmail.com] 
 Sent: Friday, April 12, 2013 5:30 PM
 To: 'Charles Foster'
 Cc: 'sedna-discussion'; 'Ivan Shcheklein'
 Subject: XML:DB API performance issue - fn:doc-available
  
 Hi Charles,
  
 We’re having serious performance issues, probably reached the critical number 
 of documents in the database. According to Ivan Shcheklein, it is likely to 
 be caused by using doc(“documents”) function that actually locks all 
 collections in the database, thus, drastically decreasing performance. Your 
 implementation of XML:DB API works great for small datasets but according to 
 event.log it uses doc(“documents”) quite a lot. So it may indeed result in 
 bad performance I’m facing. As an improvement, there have been implemented a 
 function fn:doc-available that checks if a document is available in the given 
 collection. It locks only one collection. This is an example of usage:
  
 doc-available(“collection”, “document”)
  
 Could you please update the Sedna XML:DB API implementation using this 
 function? I understand it’s recommended to migrate to XQJ implementation but 
 it’s really hard to rewrite all the code we have. Nevertheless, I’ve planned 
 to investigate required efforts for this migration but we need some solution 
 on short notice. Thanks a lot!
  
 Best regards,
 Ivan Lagunov

--
Precog is a next-generation analytics platform capable of advanced
analytics on semi-structured data. The platform includes APIs for building
apps and a phenomenal toolset for data science. Developers can use
our toolset for easy data analysis  visualization. Get a free account!
http://www2.precog.com/precogplatform/slashdotnewsletter___
Sedna-discussion mailing list
Sedna-discussion@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/sedna-discussion