Machine-Learning
- Teacher(s)
- Sébastien DA VEIGA, Clément ELVIRA, Fabien NAVARRO, Thomas ROUCH
- Course type
- STATISTICS
- Correspondant
- Sébastien DA VEIGA
- Number of ECTS
- 3.5
- Course code
- 3AML001-3A-EI-SBio
- Distribution of courses
-
Heures de cours : 21
- Language of teaching
- French
- Evaluation methods
- Examen final
Objectives
This course focuses on supervised learning methods for regression and classification. Starting with basic algorithms such as ordinary least squares, we will cover regularization methods (essential in large-scale learning contexts), nonparametric decision rules such as support vector machines (SVMs), and more broadly kernel methods in self-reproducing kernel Hilbert spaces. Next, we will study ensemble techniques, with reminders on bagging and random forests, with a particular focus on boosting (XGBoost). Finally, we will cover some aspects of variable selection.rnThroughout the course, we will emphasize methodological and algorithmic aspects, while providing an overview of the underlying theoretical foundations. Practical sessions will allow students to apply the methods to datasets using Python. The course will alternate between lectures and practical sessions.rnrnTranslated with DeepL.com (free version)
Course outline
Supervised learning; Regression; Classification; Empirical risk minimization; Model evaluation; Cross validation; Model complexity; Large scale optimization; Stochastic gradient descent; Regularization; RIDGE and LASSO; Support Vector Machine; Kernel trick; Ensemble methods (Random forest, Aggregation, Boosting); Feature selection.
Prerequisites
Fundamentals of supervised learning, linear regression, logistic regression, fundamentals of optimization, Python programming