2026.2

Instructor


Humberto Sandmann	humbertors@insper.edu.br

Schedule


Lecture	-	-h00 -h00
Lecture	-	-h00 -h00
Office Hours	-	-h00 -h30

Final Grade

\[ \text{Final} = \left\{\begin{array}{lll} \text{Individual} \geq 5 \bigwedge \text{Team} \geq 5 & \implies & \displaystyle \frac{ \text{Individual} + \text{Team} } {2} \\ \\ \text{Otherwise} & \implies & \min\left(\text{Individual}, \text{Team}\right) \end{array}\right. \]

Syllabus 2026.2

Module 1

Foundations

Concepts & AI
Data
Preprocessing
Neural Networks
Perceptron
MLP
Optimization
Regularization
Metrics

Module 2

Deep Architectures

DL Layers
CNNs
Attention NEW
Transformers NEW
Transfer Learning NEW
LLMs NEW

Module 3

Generative Models

Overview
VAE
GAN
CLIP
Stable Diffusion
Flow Matching
Diffusion Transformers NEW
AR Generation NEW

Course Description

This course provides a comprehensive introduction to Artificial Neural Networks and Deep Learning, using modern frameworks (primarily PyTorch). Topics span mathematical foundations, core architectures (MLPs, CNNs, Transformers), attention mechanisms, generative models (GANs, VAEs, Diffusion, Flow Matching), Diffusion Transformers, and Large Language Models. Equal emphasis is placed on theoretical understanding and practical application.

Learning Objectives

By the end of this course, students will be able to:

Understand Fundamentals: explain gradient descent, backpropagation, activation functions, and regularization.
Master Key Architectures: describe and motivate MLPs, CNNs, Transformers, and LLMs.
Implement with PyTorch: train and debug deep learning models.
Evaluate Performance: apply appropriate metrics and regularization/optimization techniques.
Work with Generative Models: understand and apply GANs, VAEs, Diffusion, and Flow Matching.
Apply Transfer Learning: fine-tune pre-trained models using PEFT techniques (LoRA, QLoRA).
Understand LLMs: grasp architecture, training (RLHF, DPO), and applications of Large Language Models.
Critically Evaluate Research: read and assess current deep learning papers.

Bibliography

Core:

Fleuret, F. (2023). The Little Book of Deep Learning.
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press.

Supplementary:

Nielsen, M. A. (2019). Neural Networks and Deep Learning.
Zhang, A. et al. (2024). Dive into Deep Learning.
Vaswani, A. et al. (2017). Attention Is All You Need.
Brown, T. et al. (2020). Language Models are Few-Shot Learners.