Re: [ZODB-Dev] ZODB memory problems (was: processing a Very Large file)

2005-05-29 Thread Christian Theune
Am Samstag, den 21.05.2005, 17:38 +0200 schrieb Christian Heimes:
 Grab the Zope2 sources and read lib/python/OFS/Image.py. Zope's 
 OFS.Image.Image class (and also Zope3's implementation) is using a so 
 called possible large data class (Pdata) that is a subclass of Persistent.
 
 Pdata is using a simple and genious approach to minimize the memory 
 usage when storing large binary data in ZODB. The data is read from a 
 [...]

Actually Pdata has some drawbacks. When the blobsupport branch gets
declared stable (I think it's not gonna happen in 3.4, but nobody told
me otherwise) we'll have really good blob support without this black
magic.

Cheers,
Christian

-- 
gocept gmbh  co. kg - schalaunische str. 6 - 06366 koethen - germany
www.gocept.com - [EMAIL PROTECTED] - phone +49 3496 30 99 112 -
fax +49 3496 30 99 118 - zope and plone consulting and development


signature.asc
Description: This is a digitally signed message part
___
For more information about ZODB, see the ZODB Wiki:
http://www.zope.org/Wikis/ZODB/

ZODB-Dev mailing list  -  ZODB-Dev@zope.org
http://mail.zope.org/mailman/listinfo/zodb-dev


Re: [ZODB-Dev] ZODB memory problems (was: processing a Very Large file)

2005-05-29 Thread Andreas Jung



--On 29. Mai 2005 11:29:06 +0200 Christian Theune [EMAIL PROTECTED] wrote:


Am Samstag, den 21.05.2005, 17:38 +0200 schrieb Christian Heimes:

Grab the Zope2 sources and read lib/python/OFS/Image.py. Zope's
OFS.Image.Image class (and also Zope3's implementation) is using a so
called possible large data class (Pdata) that is a subclass of
Persistent.

Pdata is using a simple and genious approach to minimize the memory
usage when storing large binary data in ZODB. The data is read from a
[...]


Actually Pdata has some drawbacks. When the blobsupport branch gets
declared stable (I think it's not gonna happen in 3.4, but nobody told
me otherwise) we'll have really good blob support without this black
magic.



The Pdata approach in general is not bad. I have implemented a CVS-like file
repository lately where we store binary content using a pdata like 
structure.
Our largest files are around (100MB) and the performance and efficiency is 
not bad
although it could be better. The bottleneck is either the ZEO communication 
or just the network.
I reach about 3.5 MB/second while reading such a large file from the ZEO 
server.


-aj


pgpZxItW5QTm4.pgp
Description: PGP signature
___
For more information about ZODB, see the ZODB Wiki:
http://www.zope.org/Wikis/ZODB/

ZODB-Dev mailing list  -  ZODB-Dev@zope.org
http://mail.zope.org/mailman/listinfo/zodb-dev


Re: [ZODB-Dev] ZODB memory problems (was: processing a Very Large file)

2005-05-29 Thread Jeremy Hylton
On 5/29/05, Shane Hathaway [EMAIL PROTECTED] wrote:
  Would a multi thread ZEO server improve anything here? Especially
  with concurrent access?
 
 It's possible.  Although ZEO talks over the network using async sockets,
 it reads files synchronously, so I suspect it will frequently sit around
 doing nothing for 10 ms, waiting for the disk to read data.  If your ZEO
 server has a load of 1.0 or more but low CPU usage, this is likely
 happening.  The easiest way to overcome this is to buy gigabytes of RAM
 for the ZEO server--ideally, enough gigabytes to hold your whole database.

A related problem is that the ZEO cache on the client is on disk, too.
 You may end up waiting for a disk seek to get it off disk on the
client.  If you've got it in memory on the server and if the ZEO
protocol were more efficient, that would be a drag.

 Also, the design of ZEO clients tends to serialize communication with
 the ZEO server, so the throughput between client and server is likely to
 be limited significantly by network latency.  ping is a good tool for
 measuring latency; 1 ms is good and .1 ms is excellent.  There are ways
 to tune the network.  You can also reduce the effects of network latency
 by creating and load balancing a lot of ZEO clients.

It's really too bad that ZEO only allows a single outstanding request.
 Restructuring the protocol to allow multiple simulatenous requests
was on the task list years ago, but the protocol implementation is so
complex I doubt it will get done :-(.  I can't help but think building
on top of an existing message/RPC layer would be profitable.  (What's
twisted's RPC layer?)  Or at least something less difficult to use
than asyncore.

Jeremy
___
For more information about ZODB, see the ZODB Wiki:
http://www.zope.org/Wikis/ZODB/

ZODB-Dev mailing list  -  ZODB-Dev@zope.org
http://mail.zope.org/mailman/listinfo/zodb-dev