Multimodal AI Teaching AI to see, speak, and understand across text, images, video, and audio simultaneously. Topics (7) Beginner Document Understanding Multimodal Fundamentals Video Understanding Intermediate Audio & Speech Models Contrastive Learning Text-to-Video Vision Language Models