We discovered a bug. We're not reading in the entire blob, but we are reading in far too much data.
Fred has a fix waiting in the rafters. I'll post again when it's been pushed. -- Ikai Lan Developer Programs Engineer, Google App Engine Blogger: http://googleappengine.blogspot.com Reddit: http://www.reddit.com/r/appengine Twitter: http://twitter.com/app_engine On Thu, Nov 4, 2010 at 2:36 AM, Cyrille Vincey <[email protected]> wrote: > Not a lot of interesting stuff to say : > 1. My code is quite as simple as your sample code: the only real difference > is that I create 2 parent/child entities in a row for one given csv line > entry. > 2. My csv file contains 4.3 million lines. > 2. I launched the mapper job with 10 shards. > 3. "worker-attempt-XXX" tasks had 20 retries each in average. > 4. The blobstore bytes read quota (100 Go) got reached within the first 3 > hours. > 5. Est. 10% of the entities where actually created after 24h (with my > previous non-blob-based mapper job, those 4.3 million entities where created > within 1 day) > 6. Log does not reveal anything interesting. > > I am currently running a new test with a 500,000 lines csv file (20 Mb > file). > Performance looks better. To me, blob file size may have an influence on > the mapper performance. > > If you need more details, let me know. > > From: "Ikai Lan (Google)" <[email protected]> > Reply-To: <[email protected]> > Date: Wed, 3 Nov 2010 12:22:10 -0700 > To: <[email protected]> > Subject: Re: [appengine-java] Mapper & Blobstore bytes read limit > > This behavior doesn't seem right. No, the entire blob should not be getting > read. We'll look into this. > > Do you have any more details? Could tasks be getting retried? > > -- > Ikai Lan > Developer Programs Engineer, Google App Engine > Blogger: http://googleappengine.blogspot.com > Reddit: http://www.reddit.com/r/appengine > Twitter: http://twitter.com/app_engine > > > > On Tue, Nov 2, 2010 at 9:42 AM, Cyrille Vincey <[email protected]> wrote: > >> I've been testing Ikai's bulkload mapper (see url below) with a pretty big >> csv file (200 Mb). >> It works great, and I encourage most of you to consider implementing this >> for entity uploads. >> >> Yet, I do face one last issue with an unexpected quota : blobstore bytes >> read. >> This quota cannot be tuned via the billing settings, and it's not clear >> whether it limits the speed of my process or not when it's reached. >> >> >> See ? Yep, it's a lot of bytes read… >> Could someone confirm that the blob csv file is *NOT* fully fetched each >> time the mapper iterates on a new line ? >> >> (ikai's post) >> http://ikaisays.com/2010/08/11/using-the-app-engine-mapper-for-bulk-data-import >> / >> >> -- >> You received this message because you are subscribed to the Google Groups >> "Google App Engine for Java" group. >> To post to this group, send email to >> [email protected]. >> To unsubscribe from this group, send email to >> [email protected]<google-appengine-java%[email protected]> >> . >> For more options, visit this group at >> http://groups.google.com/group/google-appengine-java?hl=en. >> > > -- > You received this message because you are subscribed to the Google Groups > "Google App Engine for Java" group. > To post to this group, send email to > [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group at > http://groups.google.com/group/google-appengine-java?hl=en. > > -- > You received this message because you are subscribed to the Google Groups > "Google App Engine for Java" group. > To post to this group, send email to > [email protected]. > To unsubscribe from this group, send email to > [email protected]<google-appengine-java%[email protected]> > . > For more options, visit this group at > http://groups.google.com/group/google-appengine-java?hl=en. > -- You received this message because you are subscribed to the Google Groups "Google App Engine for Java" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/google-appengine-java?hl=en.
<<Capture d¹écran 2010-11-02 à 17 .17.25.png>>
