DC area event: Investing in the Cloud: A Breakfast Conversation

2009-05-11 Thread Jimmy Lin
, and Deepak Singh, representing Amazon Web Services, a major provider of utility computing and cloud infrastructure. The discussion will be moderated by Prof. Jimmy Lin, who leads Maryland's cloud computing efforts in the Google/IBM Academic Cloud Computing Initiative. = Schedule and Logistics 8:00 am – 8

DC area event: Investing in the Cloud: A Breakfast Conversation

2009-04-29 Thread Jimmy Lin
will be moderated by Prof. Jimmy Lin, who leads Maryland's cloud computing efforts in the Google/IBM Academic Cloud Computing Initiative. = Schedule and Logistics 8:00 am – 8:30 am Breakfast and Networking 8:30 am – 9:30 am Plenary session by invited speakers 9:30 am – 10:00 am Panel

Re: Coordination between Mapper tasks

2009-03-28 Thread Jimmy Lin
Hmmm... sounds odd. Given the same memcached servers (config), the hashing should be consistent. FYI, all code for the experiments described in that tech report is in cloud9, the library I use for teaching my courses. Download at: http://www.umiacs.umd.edu/~jimmylin/ Hope this helps! (Let

Re: Using HDFS to serve www requests

2009-03-26 Thread Jimmy Lin
Brian--- Can you share some performance figures for typical workloads with your HDFS/Fuse setup? Obviously, latency is going to be bad but throughput will probably be reasonable... but I'm curious to hear about concrete latency/throughput numbers. And, of course, I'm interested in these

Re: Coordination between Mapper tasks

2009-03-21 Thread Jimmy Lin
Hi Stuart, You might want to look at a memcached solution some students and I worked out for exactly this problem. It's written up in: Jimmy Lin, Anand Bahety, Shravya Konda, and Samantha Mahindrakar. Low-Latency, High-Throughput Access to Static Global Resources within the Hadoop

Re: OT: How to search mailing list archives?

2009-03-08 Thread Jimmy Lin
I've found nabble to be helpful: http://www.nabble.com/Hadoop-core-user-f30590.html -Jimmy Miles Osborne wrote: posts tend to get indexed by Google, so try that Miles 2009/3/8 Stuart White stuart.whi...@gmail.com: This is slightly off-topic, and I realize this question is not specific to

Lazily deserializing Writables

2008-10-02 Thread Jimmy Lin
Hi everyone, I'm wondering if it's possible to lazily deserialize a Writable. That is, when my custom Writable is handed a DataInput from readFields, can I simply hang on to the reference and read from it later? This would be useful if the Writable is a complex data structure that may be

Re: Lazily deserializing Writables

2008-10-02 Thread Jimmy Lin
the serialized version off the wire completely by prepending the size. Then, we can read in the raw bytes and hang on to them for later as we see fit. I would think that leaving the bytes on the DataInput would break things in a very impressive way. -Bryan On Oct 2, 2008, at 2:48 PM, Jimmy Lin wrote

slash in AWS Secret Key, WAS Re: Namenode Exceptions with S3

2008-07-09 Thread Jimmy Lin
I've come across this problem before. My simple solution was to regenerate new keys until I got one without a slash... ;) -Jimmy I have Hadoop 0.17.1 and an AWS Secret Key that contains a slash ('/'). With distcp, I found that using the URL format s3://ID:[EMAIL PROTECTED]/ did not work,

Re: walkthrough of developing first hadoop app from scratch

2008-03-22 Thread Jimmy Lin
Hi Stephen et al., I would take advantage of the Hadoop plug-in for Eclipse to handle the mundane aspects of putting together your job and running it on the cluster. With respect to gentler introductions on application development, you might want to take a look at the following:

Question about key sorting interaction effects

2008-02-08 Thread Jimmy Lin
Hi guys, I was wondering if someone could explain the possible interaction effects between the different methods available to control key sorting. Based on my understanding, there are three separate knobs: - a WritableComparable's compareTo method - registering a WritableComparator