Clive Humby, a British mathematician, coined the term “Data is the new oil. It’s valuable, but if unrefined it cannot really be used. It has to be changed into gas, plastic, chemicals, etc., to create a valuable entity that drives profitable activity”. This course is designed to familiarize you with data engineering concepts, ecosystem, lifecycle, processes, and tools. It includes data, data repositories, data integration platforms, data pipelines, different types of languages, and BI and Reporting tools.
In this class we will follow the textbook below. This course is designed towards students who want to learn and pursue data analysis, database administration and data engineering. We will address some use cases from web resources including DataBricks and you will learn:
- the different types of data structures, file formats,and sources of data.
- Extract, transform and load (ETL) processes
- gathering data, wrangling data, querying and analyzing data, performance tuning, security, and compliance to governance regulations.
Delivery Method: Fully in-person
Prerequisites: Students should have basic computer skills; general familiarity of how computing works; basic Networking and computer hardware; and be comfortable working in either Linux, Unix, Windows, or MacOs. Interested students should contact Katie Montovan (firstname.lastname@example.org) by May 9, 2022 for registration.
Course Level: 4000-level
T/F 2:10PM - 4:00PM (Full-term)
Maximum Enrollment: 20
Course Frequency: Every 2-3 years
Categories: All courses , Computer Science , Fully In-Person