Second semester

Spark Introduction

Objectives

– General architecture of local and distributed computingrn- Producing simple statistical analyses with Sparkrn- Stream processing

Course outline

– MapReducern- Spark MLrn- Pipelinesrn- Streaming, Event-time processingrn

Prerequisites

– Knowledge of Big Data, Cloud computingrn- Python, SQL