Greetings all, Since posting my original request I ran across the following, which is a nice example of what I'd call a case study. Gives a few details at least and is kind of an interesting or creative use of Hadoop...
http://engineering.foursquare.com/2011/02/28/how-we-found-the-rudest-cities-in-the-world-analytics-foursquare/ Enjoy, Ted On Sun, Feb 27, 2011 at 9:31 PM, Ted Pedersen <[email protected]> wrote: > Thanks for all these great ideas. These are really very helpful. > > What I'm also hoping to find are articles or papers that describe what > particular companies or organizations have done with Hadoop. How does > Facebook use Hadoop for example (that's one of the case studies in the > White book), or how does last.fm use Hadoop (another of the case > studies in the White book). > > One interesting resource is the list of "powered by Hadoop" projects > available here: > > http://wiki.apache.org/hadoop/PoweredBy > > Some of these entries provide links to more detailed discussions of > what an organization is doing, as in the following from Twitter > http://www.slideshare.net/kevinweil/hadoop-pig-and-twitter-nosql-east-2009 > > So any additional descriptions of what specific organizations are > doing with Hadoop (to the extent they are willing to share) would be > really helpful (these sorts of "real world" cases tend to be > particularly motivating). > > Cordially, > Ted > > On Sun, Feb 27, 2011 at 9:23 PM, Simon <[email protected]> wrote: >> I think you can also simulate PageRank Algorithm with hadoop. >> >> Simon - >> >> On Sun, Feb 27, 2011 at 9:20 PM, Lance Norskog <[email protected]> wrote: >> >>> This is an exercise that will appeal to undergrads: pull the Craiglist >>> personals ads from several cities, and do text classification. Given a >>> training set of all the cities, attempt to classify test ads by city. >>> (If Peter Harrington is out there, I stole this from you.) >>> >>> Lance >>> >>> On Sun, Feb 27, 2011 at 4:55 PM, Ted Dunning <[email protected]> >>> wrote: >>> > Ted, >>> > >>> > Greetings back at you. It has been a while. >>> > >>> > Check out Jimmy Lin and Chris Dyer's book about text processing with >>> > hadoop: >>> > >>> > http://www.umiacs.umd.edu/~jimmylin/book.html >>> > >>> > >>> > On Sun, Feb 27, 2011 at 4:34 PM, Ted Pedersen <[email protected]> >>> wrote: >>> > >>> >> Greetings all, >>> >> >>> >> I'm teaching an undergraduate Computer Science class that is using >>> >> Hadoop quite heavily, and would like to include some case studies at >>> >> various points during this semester. >>> >> >>> >> We are using Tom White's "Hadoop The Definitive Guide" as a text, and >>> >> that includes a very nice chapter of case studies which might even >>> >> provide enough material for my purposes. >>> >> >>> >> But, I wanted to check and see if there were other case studies out >>> >> there that might provide motivating and interesting examples of how >>> >> Hadoop is currently being used. The idea is to find material that goes >>> >> beyond simply saying "X uses Hadoop" to explaining in more detail how >>> >> and why X are using Hadoop. >>> >> >>> >> Any hints would be very gratefully received. >>> >> >>> >> Cordially, >>> >> Ted >>> >> >>> >> -- >>> >> Ted Pedersen >>> >> http://www.d.umn.edu/~tpederse >>> >> >>> > >>> >>> >>> >>> -- >>> Lance Norskog >>> [email protected] >>> >> >> >> >> -- >> Regards, >> Simon >> > > > > -- > Ted Pedersen > http://www.d.umn.edu/~tpederse > -- Ted Pedersen http://www.d.umn.edu/~tpederse
