Shalom,
On Dec 4 we will host a lecture by Yahoo! at School of Computer
Science at Tel-Aviv University. The day before, we will have a Googel
workshop mainly intend for studnets, but if we will have place, other
can join too.
Hadoop / Mapreduce workshop in Tel-Aviv University
The Tel Aviv University School of CS and Google will host Cloudera on
Dec 3-4 for a 2-day workshop on the design and technologies of
distributed computation for web-scale data processing. On Dec 4, between
17-21 we will have a session from Yahoo! as described below.
The workshop will be led by Cristophe Bisciglia, Founder & Chief
Strategy Officer and Aaron Kimball, Cloudera Founding Engineer.
The first day of the workshop will consist of lectures and a Code lab
with Hadoop and related tools. Hadoop is a public domain software
inspired by Google technologies such as MapReduce and Google File
System. Grad students will have priority in registration - please
specify.
The second day will discuss curriculum design issues for those
considering teaching the material in a full semester course. This
session will be open for relevant people only.
Google encourages academic courses on large scale computing in
universities around the world, including courses given in MIT, Berkeley,
University of Washington, National Taiwan University (NTU), National
Chiao Tung Univ (NCTU) in Taiwan, Tsinghua Univ and Peking Univ in
China.
Please fill the form bellow to register to the workshop.
Thank you.
http://docs.google.com/Doc?id=dgm2jx8k_8chgc59gc
Hadoop at Yahoo! State of technology today and future development.
Talk Abstract
As the most visited site on the Internet Yahoo! has to build products
that scale to thousands of servers. Adoption of Behavioral Targeting &
WebAnalytics into the production cycle forced web companies to build
expensive data pipelines with data warehouse and myriad of compute
clusters. Then, grid computing that for a long time strove to find its
place in the industry turn to be the right tool for the job. In 2006
Yahoo! started with open-source project called Hadoop with a goal to
build stable & scalable grid solution that include distributed file
system (HDFS) and Map Reduce framework. Today Y! Grid Technology Team
completes migration of all Y! data driven businesses to Hadoop. The
project quickly gained momentum in open source community attracting
dozens of contributors. Every month we host Hadoop User Group at Yahoo!
that became a meeting place for Hadoop developers and users from
Facebook, IBM, Google, Ebay and many Silicon Valley startups.
In my presentation I will talk about how Web Giants use data,
technologies that were used so far, and how Hadoop helps to streamline
product development and R&D. We will also cover the current state of
Hadoop technology and next year Roadmap.
BIO.
Michael Pilip is Sr. Product Manager in Y! Cloud Computing & Data
Infrastructure division responsible for Hadoop development and Y!
products migration to grid. Before joining Hadoop Michael worked on Y!
analytical data pipeline that brings data from dozens of thousands web
servers around the globe to Data Marts and Behavioral Targeting
applications. Prior to turning to data business Michael was a lead
developer in Y! Games building the biggst on-line games portal in the
World.
Agenda:
How web giants use user-generated data (WebAnalytics, Behavioral
Targeting, Reporting, R&D) Data Pipelines architecture (Data Collection,
ETL, Data Warehousing, Aggregations, DataMarts, AdServing Systems)
Hadoop to the rescue Hadoop at Yahoo! (development and cluster
operations) Hadoop open-source projects and major trends.
Q&A
_______________________________________________
Discussions mailing list
[email protected]
http://hamakor.org.il/cgi-bin/mailman/listinfo/discussions