Week 1 - Jan 27th to Feb 2nd
- Course Info
- Introduction
- Data Center Model
- Data Intensive Sciences
- IaaS, PaaS, and SaaS
- Challenges
|
Week 2 - Feb 3rd to Feb 9th
- Computational Clusters
- Term Projects 1
- Term Projects 2
|
Week 3 - Feb 10th to Feb 16th
- Apache Data Analysis Open Stack
- MapReduce
- Hadoop Framework
- Hadoop Tasks
- Fault Tolerance
|
Week 4 - Feb 16th to Feb 23th
- Programming on a Computer Cluster
- How Hadoop Runs on a MapReduce Job
- Literature Review
- Introduction to BLAST
- BLAST Parallelization
- SIMD vs MIMD; SPMD vs MPMD
- Data Locality
- Optimal Data Locality
- Trask Granularity
- Resource Utilization and Speculative Execution
|
Week 5 - Feb 24th to Mar 2nd
- Growth of Virtual Machines
- Virtualization Implementation Levels
- Virtualization Structures/Tools and Mechanisms
- Virtualization of CPU, Memory and I/O Devices
- Virtual Clusters and Resource Mgmt.
- Virtualization for Data Center Automation
|
Week 6 - Mar 3rd to Mar 9th
- MapReduce Refresher
- Google Search Engine 1
- Google Search Engine 2
- Hadoop PageRank
- Discussions and Parallel Thinking
- Hadoop Extensions
|
Week 7 - Mar 10th to Mar 16th
- There is no new lecture for this week. Continue to work on the Hadoop PageRank project from the previous lesson
|
Week 8 - Mar 17th to Mar 23rd
|
Week 9 - Mar 24th to Mar 30th
- RDBMS vs. NoSQL
- NoSQL Characteristics
- BigTable
- HBase
- HBase Coding
|
Week 10 - Mar 31st to Apr 6th
- Applying for FutureGrid Account
- FutureGrid India OpenStack
- Hadoop WordCount on VMs
|
Week 11 - Apr 7th to Apr 13th
- There is no new lecture for this week. Continue to work on the previous project.
|
Week 12 - Apr 14th to Apr 20th
- There is no new lecture for this week. Continue to work on the previous project.
|
Week 13 - Apr 21st to Apr 27th
- MapReduce Models
- Designing for Big Data
- Twister Iterative MapReduce
- Application Performance
- Twister K-means Explained
- Twister K-means Code
|
Week 14 - Apr 28th to May 4th
- Hangout Lab 1
- Hangout Lab 2
- Hangout Lab 3
|