The document discusses Amazon Elastic MapReduce, which allows users to run Hadoop jobs on Amazon Web Services infrastructure. It describes how MapReduce works, provides an example of analyzing Apache log files using MapReduce, and discusses how to run and debug MapReduce jobs on Amazon Elastic MapReduce. Finally, it mentions how MapReduce jobs can be scheduled regularly using cron jobs to periodically analyze metrics stored in databases.
36. MapReduce
MapReduce
Google
Map
Reduce Map
Reduce
MapReduce C++ Java Python
Wikipedia “MapReduce”
http://ja.wikipedia.org/wiki/MapReduce
Sunday, September 26, 2010
37. cron
•
PV, UU NFS CSV
• DB
→ DB
• PV, UU
Sunday, September 26, 2010