Designing and Understanding Deep Neural Networks

Foundation models (FMs), including GPT-4, DALL·E 3, and Sora, have revolutionized several domains such as computer vision, language technology, and robotics. These deep learning (DL) models, trained on extensive and diverse datasets, are versatile across numerous downstream tasks, underpinning cutting-edge systems with their remarkable generative and few-shot learning capabilities. This comprehensive course offers students foundational knowledge in modeling and system design of FMs, supplemented by practical experience. The curriculum incorporates covers design principles, best practices, and optimization strategies. The course concludes with a four-week project, enabling students to devise a DL-based research project or application, addressing an issue of their interest.

Calendar

Basics

Feb 17: Lecture 1 Introduction; HW 1 out
Feb 24: Lecture 2 ML Review
Mar 3: Lecture 3 Neural Networks
Mar 10: Lecture 4 Optimization

Models

Mar 17: Lecture 5 Building Blocks; HW 2 out
Mar 24: Lecture 6 ConvNets; HW 1 due
Mar 31: Lecture 7 RNNs
Apr 7: Lecture 8 Transformers; Project Proposal due

Applications

Apr 14: Lecture 9 Sequence to Sequence Models; HW 3 out
Apr 21: Lecture 10 Distribution Shift; HW 2 due
Apr 27: Lecture 11 Robustness
Apr 28: Lecture 12 Generative Models
May 12: Lecture 13 Self-Supervised Learning; HW 4 out
May 19: Lecture 14 Foundation Models; HW 3 due

Projects

May 26: Lecture 15 Project Presentation
June 2: Lecture 16 Project Presentation
June 16: Project Report due; HW 4 due