Designing and Understanding Deep Neural Networks
Foundation models (FMs), including GPT-4, DALL·E 3, and Sora, have revolutionized several domains such as computer vision, language technology, and robotics. These deep learning (DL) models, trained on extensive and diverse datasets, are versatile across numerous downstream tasks, underpinning cutting-edge systems with their remarkable generative and few-shot learning capabilities. This comprehensive course offers students foundational knowledge in modeling and system design of FMs, supplemented by practical experience. The curriculum incorporates covers design principles, best practices, and optimization strategies. The course concludes with a four-week project, enabling students to devise a DL-based research project or application, addressing an issue of their interest.
Calendar
Basics
- Feb 17
- Lecture 1
Introduction
- HW 1 out
- Feb 24
- Lecture 2
ML Review
- Mar 3
- Lecture 3
Neural Networks
- Mar 10
- Lecture 4
Optimization
Models
- Mar 17
- Lecture 5
Building Blocks
- HW 2 out
- Mar 24
- Lecture 6
ConvNets
- HW 1 due
- Mar 31
- Lecture 7
RNNs
- Apr 7
- Lecture 8
Transformers
- Project Proposal due
Applications
- Apr 14
- Lecture 9
Sequence to Sequence Models
- HW 3 out
- Apr 21
- Lecture 10
Distribution Shift
- HW 2 due
- Apr 28
- Lecture 11
Robustness
- May 5
- Lecture 12
Generative Models
- May 12
- Lecture 13
Self-Supervised Learning
- HW 4 out
- May 19
- Lecture 14
Foundation Models
- HW 3 due
Projects
- May 26
- Lecture 15
Project Presentation
- June 2
- Lecture 16
Project Presentation
- June 16
- Project Report due
- HW 4 due