I want to add some analytics to my app, and I'm trying to get a gauge on whether mapreduce could be fast enough to allow reports to be generated 'on demand'. I know it's not realistic to get an exact answer, I'm just looking for order of magnitude... 1 second? 10 seconds? 100 seconds? For example, user #1 might want to generate a report on how many times each type of event occurred between date-B and date-C. So something like:
in Events, group-by event_type count where (user_id='1' && date >= date-B && date < date-C) This would be querying a single Model, which would have a 4 properties: user_id(int), event_type (int), start_date (date/time), end_date (date_time). Let's assume that the are 10,000,000 Event entities in total, of which 1,000,000 have user_id =1, and 100,000 lie between date A and date B, in which there are 200 different types of event. So our output would have 200 rows, each with a integer value. Let's also assume (just for the sake of example) that we're willing to have however many instances it takes to get the results in ~110% of the quickest possible time. Would that take 1 second? 10 seconds? 100 seconds? 1000 seconds? thanks tom -- You received this message because you are subscribed to the Google Groups "Google App Engine" group. To view this discussion on the web visit https://groups.google.com/d/msg/google-appengine/-/nklGbXH0YDkJ. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/google-appengine?hl=en.
