Core Data binary data optimization(s)

Nathan Vander Wilt Mon, 15 Dec 2008 18:50:30 -0800

Thanks to some patient help from this list, I now have a working CoreData model. One object in the model is basically a glorified vectorpolygon — an array of "point" structures that contain a about a dozendoubles each. I insert a lot of these polygon objects, and often needto draw all of them very quickly.

I originally included the point structures themselves in the model,with a to-many relationship from each polygon to the points, andsorted the points to their proper order in memory when necessary. Thiswas a bit slow, as Core Data seemed to fault each point objectindividually, resulting in a lot of query overhead. So I switched mypolygon objects to have only one attribute: binary data for the point.I thought this would result in significant speedup, but Core Datafaults my objects so often that now I spend even more time unarchivingeach polygon's point data than it took to read each point's row infrom the database!

I've thought about caching the result of my "get all polygons" fetchto speed up redrawing, plus further optimizing my archiving code tohelp initial load and final save speed. If I do this, my thought wasto watch the MOCObjectsDidChange notification for inserted/deletedobjects and update my cache when necessary. (By my understanding, theupdated objects will already be updated at my cached pointers.) Thisseems a little icky, though. Does Core Data provide a cleaner way ofefficiently keeping a fetch result up-to-date?

Another issue with dealing with the points via a binary data archiveis the wasted memory. These polygons would be immutable if Core Dataallowed such a thing, so it seems especially wasteful to keep theunarchived copy of the point array and the persistent data blob bothin memory. If I model a (prefetched) to-one relationship to thepolygon's binary data instead of an attribute, and [mocrefreshObject:binaryData mergeChanges:NO] once unarchived, will thismake it fairly likely I'll only have my unarchived copy of the pointarray data in memory?

Or are the the above optimizations a bad approach to solving thisproblem? I'd like to get a little closer to the "terabyte sizeddatabase with billions of rows/tables/columns" advertised in theperformance section, but it seems that statement assumes I'm fetchingonly a small fraction of those rows at once. For flat primitive datalike this, is it possible to get performance closer to a raw push ofthe data to/from disk with Core Data?


thanks,
-natevw_______________________________________________

Cocoa-dev mailing list (Cocoa-dev@lists.apple.com)

Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com

Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/cocoa-dev/archive%40mail-archive.com

This email sent to arch...@mail-archive.com

Core Data binary data optimization(s)

Reply via email to