Advanced Topics in Data Management
CIT 57800/ 3 Cr.
This is an advanced data management course. The topics might change each term it offers. The objective of this course is to cover most emerging topics for data management and explore the cutting-edge technologies in data science. “Big data” is an emerging term to demonstrate the large volume and diversity of data that are generated by different applications every second. “Big data” is exposed to new techniques about how to efficiently store the data, manage the data, analyze the data, and integrate the data. In this course, topics to be discussed include but not limited to emerging data storage and management techniques for large-scale data sets, cloud based data mining tools for analyzing large-scale data collections, information retrieval over large-scale data collections and related data security and privacy issues. The class will also focus on research, evaluate and design data management infrastructure for real-world application domains, such as health care, online marketing, social network analysis and so on.
- Available Online: No
- Credit by Exam: No
- Laptop Required: Yes
Prerequisites/Co-requisites:
TECH 50700 and CIT 52600, basic knowledge about computing architecture, and programming in Java
Software
- Amazon AWS, Google Cloud Platform, Microsoft Azure
Outcomes
CIT Student Outcomes (What are these?)
(i) Demonstrate the solid understanding of the fundamentals and concepts of data management
(ii) Apply the existing advanced data management tools for effective and scalable data management
(iii) Design state-of-the art infrastructure for data management to meet an organization’s need
(iv) Analyze and evaluate different existing data management platforms, techniques for a specific data management need
(v) Design data flows to store and integrate data from multiple data sources
(vi) Compare and assess the existing data management tools or software and recommend the appropriate ones based on the organization’s resource and plan
Topics
- Cloud computing concepts, characteristics and data privacy and security issues
- Introduction to major Cloud computing platforms, such as Amazon AWS, Microsoft Azure and their applications
- Rational Database Management System (RDMS) in Cloud
- Data analysis workflows
- Scalable data management in Cloud using NoSQL
- Distributed data processing and storage in Cloud
- Data analytics tools and their applications in Cloud
- Data visualization tools and their applications in Cloud
- Mining of massive datasets in Cloud
- Data Warehouse in Cloud
- Building Advanced Applications in Cloud
- Research Trends in Big data analysis using the Cloud