Re: Manipulating large amounts of data with EOF - feedback, anyone?

Andrew Lindesay Sat, 10 Jan 2009 03:13:36 -0800

Hello Hugi;

"KMMassiveOperation" <--- that's a really great class name.

What I do is to pull GID's or PK's for all of the objects involvedusing raw rows. As you say, sometimes the data sets are large enoughthat I have to further subdivide them on a domain-specific basis, butI won't complicate matters further... In any case, I get lots ofGID's or PK's. Then I batch them up into (for example) lots of 100 orso and then farm the work-load out over JMS (more recently using JSON-RPC through a "JMS adaptor") so that the processing is able to runconcurrently over a number of instances on a number of hosts. Thenumber of instances involved increases the concurrency and hence thepressure on the database system. In the case of writing out CSV orExcel-readable XML files, I push the results from the workers into a"BLOB stream" -- effectively just a series of BLOBs that make up onelong piece of contiguous data. The control and monitoring systemsfor all this are quite complex, but it does work well and I can do itall in EOF without resorting to SQL.


cheers.

Anyway, I would love to hear how other folks are handling hugedatasets. I would love fedback on the technique I'm using, and ieasfor improvement would be great. Just about the only idea I'm notopen to is "just use JDBC" ;-). I've been there and I don't want tobe there. That's why I'm using EOF :-).


___
Andrew Lindesay
www.lindesay.co.nz

_______________________________________________
Do not post admin requests to the list. They will be ignored.
Webobjects-dev mailing list      ([email protected])
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/webobjects-dev/archive%40mail-archive.com

This email sent to [email protected]

Re: Manipulating large amounts of data with EOF - feedback, anyone?

Reply via email to