Results

Enrollment and waitlist data for current and upcoming courses refresh every 10 minutes; all other information as of 6:00 AM.


Refine Search Results

SEIS: Software Eng (Grad)

737-03
Big Data Management
 
R 5:45 pm - 9:00 pm
K. Stahl
 
02/01 - 05/21
34/33/0
Lecture
CRN 24616
3 Cr.
Size: 34
Enrolled: 33
Waitlisted: 0
02/01 - 05/21
M T W Th F Sa Su
     

5:45 pm
9:00 pm
Online

     

Subject: Software Eng (Grad) (SEIS)

CRN: 24616

Online: Sync Distributed | Lecture

Online

Kyle Stahl

This course covers the technical concepts of managing vast amount of unstructured, semi-structured and structured data, collectively called "Big Data". Due to the sheer volume of Big Data, traditional approaches to managing databases does not work well for Big data and does not perform as expected. A distributed architecture for both the file system and the operating system is needed. Some of the techniques used in managing Big Data have the origins in the research and the developments that have been going on for decades in the area of parallel processing and distributed database management systems. This course focuses on why big data sets must be distributed and the issues that distribution introduces. The basic concepts on which distributed data sets are handled are discussed first. Once a foundation is defined, software tools that we use to work with big data sets are studied to provide an in-depth analysis of the concepts introduced. Specifically, we will study the issues distributed data design, data fragmentation, data replication, distributed fault tolerance/recovery. We will also study the use of Hadoop, Pig, Hive, and HBase in dealing big data sets and use real life examples of how these open source software are used. Prerequisites:(SEIS 601 or SEIS 603) and SEIS 630. May take concurrently with SEIS 736.

3 Credits


Advanced Search

Day(s) of the Week
Open/Closed Courses