At any large company that makes heavy use of Hadoop, you aren't going to find any concise description of all the ways that hadoop is used.
That said, here is a concise description of some of the ways that hadoop is (was) used at Yahoo: http://www.slideshare.net/ydn/hadoop-yahoo-internet-scale-data-processing On Sun, Feb 27, 2011 at 7:31 PM, Ted Pedersen <[email protected]> wrote: > Thanks for all these great ideas. These are really very helpful. > > What I'm also hoping to find are articles or papers that describe what > particular companies or organizations have done with Hadoop. How does > Facebook use Hadoop for example (that's one of the case studies in the > White book), or how does last.fm use Hadoop (another of the case > studies in the White book). > > One interesting resource is the list of "powered by Hadoop" projects > available here: > > http://wiki.apache.org/hadoop/PoweredBy > > Some of these entries provide links to more detailed discussions of > what an organization is doing, as in the following from Twitter > http://www.slideshare.net/kevinweil/hadoop-pig-and-twitter-nosql-east-2009 > > So any additional descriptions of what specific organizations are > doing with Hadoop (to the extent they are willing to share) would be > really helpful (these sorts of "real world" cases tend to be > particularly motivating). > > Cordially, > Ted > > On Sun, Feb 27, 2011 at 9:23 PM, Simon <[email protected]> wrote: > > I think you can also simulate PageRank Algorithm with hadoop. > > > > Simon - > > > > On Sun, Feb 27, 2011 at 9:20 PM, Lance Norskog <[email protected]> > wrote: > > > >> This is an exercise that will appeal to undergrads: pull the Craiglist > >> personals ads from several cities, and do text classification. Given a > >> training set of all the cities, attempt to classify test ads by city. > >> (If Peter Harrington is out there, I stole this from you.) > >> > >> Lance > >> > >> On Sun, Feb 27, 2011 at 4:55 PM, Ted Dunning <[email protected]> > >> wrote: > >> > Ted, > >> > > >> > Greetings back at you. It has been a while. > >> > > >> > Check out Jimmy Lin and Chris Dyer's book about text processing with > >> > hadoop: > >> > > >> > http://www.umiacs.umd.edu/~jimmylin/book.html > >> > > >> > > >> > On Sun, Feb 27, 2011 at 4:34 PM, Ted Pedersen <[email protected]> > >> wrote: > >> > > >> >> Greetings all, > >> >> > >> >> I'm teaching an undergraduate Computer Science class that is using > >> >> Hadoop quite heavily, and would like to include some case studies at > >> >> various points during this semester. > >> >> > >> >> We are using Tom White's "Hadoop The Definitive Guide" as a text, and > >> >> that includes a very nice chapter of case studies which might even > >> >> provide enough material for my purposes. > >> >> > >> >> But, I wanted to check and see if there were other case studies out > >> >> there that might provide motivating and interesting examples of how > >> >> Hadoop is currently being used. The idea is to find material that > goes > >> >> beyond simply saying "X uses Hadoop" to explaining in more detail how > >> >> and why X are using Hadoop. > >> >> > >> >> Any hints would be very gratefully received. > >> >> > >> >> Cordially, > >> >> Ted > >> >> > >> >> -- > >> >> Ted Pedersen > >> >> http://www.d.umn.edu/~tpederse > >> >> > >> > > >> > >> > >> > >> -- > >> Lance Norskog > >> [email protected] > >> > > > > > > > > -- > > Regards, > > Simon > > > > > > -- > Ted Pedersen > http://www.d.umn.edu/~tpederse >
