Research
First semester

Dataops

Objectives

This class aims to introduce to the DataOps skills set and philosophy. DataOps isa new concept that appeared around 2018. Because companies started to treat data as a product we needed to approach data in a more agile way. In order to simplify data usage for all, engineers need to treat data like an asset. This is DataOps.

During this class students will see and create self-service data platform through concepts like Infrastructure as Code B IaC — Terraform, Ansible), CI/CD and Software Engineering abstraction.

This lecture will also help you navigate through the whole data tools ecosystem and will give you the decision framework to choose a tool over another.

Course outline

– Why DataOps, MLOps and DevOps?

– AWS Cloud platform, a reminder

– Infrastructure as Code

– CI/CD concepts

– Docker 101 and how it changed applications deployment

Prerequisites

Familiar with Python and Unix, Cloud (AWS) basic concepts: S3, EC2, networking, RDS-git