• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Coursera
Log In
Join for Free
Coursera
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how to build and apply models that work with text, images, audio, and video together, and use them for real-world applications like chatbots, search, and creative tools.

Skip to search results

Filter by

Subject
Required
 *

Language
Required
 *

The language used throughout the course, in both instruction and assessments.

Learning Product
Required
 *

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn career credentials while taking courses that count towards your Master’s degree.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.
Earn a university-issued career credential in a flexible, interactive format.

Level
Required
 *

Duration
Required
 *

Skills
Required
 *

Subtitles
Required
 *

Educator
Required
 *

Find the Best Multimodal AI Course for Your Goals

  • Status: New
    New
    Status: Free Trial
    Free Trial
    U

    University of Colorado Boulder

    Computer Vision

    Skills you'll gain: Computer Vision, Image Analysis, Computer Graphics, Visualization (Computer Graphics), Keras (Neural Network Library), Deep Learning, Generative Model Architectures, Artificial Intelligence and Machine Learning (AI/ML), Computer Science, Data Science, Tensorflow, Artificial Intelligence, Data Ethics, Applied Machine Learning, Data Processing, Unsupervised Learning, Statistical Methods, Linear Algebra, Supervised Learning, Probability Distribution

    Build toward a degree

    4.3
    Rating, 4.3 out of 5 stars
    ·
    12 reviews

    Intermediate · Specialization · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Pearson

    Programming Generative AI

    Skills you'll gain: Generative AI, Large Language Modeling, PyTorch (Machine Learning Library), Generative Model Architectures, Multimodal Prompts, Image Analysis, Computer Vision, Artificial Neural Networks, Natural Language Processing, Deep Learning, Prompt Engineering, Image Quality, Text Mining, Data Manipulation, Unsupervised Learning, Performance Tuning

    Intermediate · Specialization · 1 - 4 Weeks

  • Status: Free
    Free
    D

    DeepLearning.AI

    Building Multimodal Search and RAG

    Skills you'll gain: Multimodal Prompts, LLM Application, Large Language Modeling, Generative AI, Image Analysis, Applied Machine Learning, Unsupervised Learning, Unstructured Data

    4.5
    Rating, 4.5 out of 5 stars
    ·
    34 reviews

    Intermediate · Project · Less Than 2 Hours

  • Status: New
    New
    Status: Free Trial
    Free Trial
    I

    IBM

    Building AI Agents and Agentic Workflows

    Skills you'll gain: LangChain, Tool Calling, LangGraph, LLM Application, Agentic systems, Generative AI Agents, Responsible AI, Artificial Intelligence and Machine Learning (AI/ML), Generative AI, Application Design, Prompt Engineering, Large Language Modeling, Collaborative Software, Software Design Patterns, System Design and Implementation, Software Development, Python Programming, Application Development, Real Time Data, Data Science

    4.8
    Rating, 4.8 out of 5 stars
    ·
    77 reviews

    Intermediate · Specialization · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    U

    University of Colorado Boulder

    Modern AI Models for Vision and Multimodal Understanding

    Skills you'll gain: Generative Model Architectures, Artificial Intelligence and Machine Learning (AI/ML), Unsupervised Learning, Linear Algebra, Supervised Learning

    Build toward a degree

    Advanced · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Free Trial
    Free Trial
    I

    IBM

    Build Multimodal Generative AI Applications

    Skills you'll gain: Multimodal Prompts, LLM Application, OpenAI, Prompt Engineering, Web Applications, Flask (Web Framework), Application Deployment, Web Development, Software Development

    4.7
    Rating, 4.7 out of 5 stars
    ·
    23 reviews

    Intermediate · Course · 1 - 4 Weeks

What brings you to Coursera today?

  • Status: Free Trial
    Free Trial
    M

    Microsoft

    Generative AI for Sales Professionals

    Skills you'll gain: Microsoft Copilot, Forecasting, Sales Strategy, Sales Presentation, Customer Analysis, Sales Pipelines, Sales Enablement, Data Cleansing, Sales Management, Time Series Analysis and Forecasting, Responsible AI, Sales, Customer Relationship Management (CRM) Software, Taking Meeting Minutes, Microsoft Teams, Email Automation, Customer Insights, Data Quality, Meeting Facilitation, Customer Data Management

    4.5
    Rating, 4.5 out of 5 stars
    ·
    15 reviews

    Beginner · Specialization · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    I

    IBM

    IBM RAG and Agentic AI

    Skills you'll gain: Prompt Engineering, LangChain, Tool Calling, LangGraph, Agentic systems, Multimodal Prompts, Generative AI, LLM Application, Generative AI Agents, Responsible AI, OpenAI, Artificial Intelligence and Machine Learning (AI/ML), Application Design, Application Development, Large Language Modeling, UI Components, Semantic Web, Data Storage Technologies, Databases, Software Development

    4.6
    Rating, 4.6 out of 5 stars
    ·
    325 reviews

    Advanced · Professional Certificate · 3 - 6 Months

  • Status: New
    New
    Status: Preview
    Preview
    A

    AI CERTs

    AI For All

    Skills you'll gain: Responsible AI, AI Product Strategy, ChatGPT, Artificial Intelligence and Machine Learning (AI/ML), Business Process Automation, Technology Strategies, AI Personalization, Emerging Technologies, Business Strategy, Ethical Standards And Conduct, Innovation, Business Planning, Computer Science, Workforce Development, Organizational Strategy, Creative Problem-Solving, Natural Language Processing, Analysis

    Beginner · Course · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    M

    Microsoft

    AI-powered Customer Intelligence with Microsoft Copilot

    Skills you'll gain: Microsoft Copilot, Prompt Engineering, Customer Insights, Sales Strategy, Customer Analysis, Competitive Analysis, Sales Pipelines, Microsoft 365, Persona Development, Data Cleansing, Sales Management, Data Quality, Sales, Customer Relationship Management (CRM) Software, Anomaly Detection, Data Ethics, Generative AI, Marketing Analytics, Marketing Design, Marketing Automation

    4.6
    Rating, 4.6 out of 5 stars
    ·
    61 reviews

    Beginner · Specialization · 3 - 6 Months

  • Status: Free Trial
    Free Trial
    V

    Vanderbilt University

    Agentic AI and AI Agents for Leaders

    Skills you'll gain: Prompt Engineering, ChatGPT, Generative AI Agents, Prompt Patterns, Generative AI, Workflow Management, Agentic systems, LLM Application, Productivity, OpenAI, Artificial Intelligence, AI Personalization, Business Process Automation, AI Product Strategy, Personalized Service, Large Language Modeling, Automation, Responsible AI, Artificial Intelligence and Machine Learning (AI/ML), Expense Management

    4.8
    Rating, 4.8 out of 5 stars
    ·
    7.9K reviews

    Beginner · Specialization · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    V

    Vanderbilt University

    Generative AI Automation

    Skills you'll gain: Prompt Engineering, ChatGPT, Prompt Patterns, Ideation, Verification And Validation, Data Presentation, LLM Application, Productivity, OpenAI, Generative AI, Document Management, Responsible AI, Data Synthesis, Image Analysis, Data Capture, Large Language Modeling, Data Analysis, Organizational Skills, Risk Management Framework, Artificial Intelligence

    4.8
    Rating, 4.8 out of 5 stars
    ·
    7.8K reviews

    Beginner · Specialization · 3 - 6 Months

Searches related to multimodal ai

build multimodal generative ai applications
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
introduction to vertex ai embeddings: text and multimodal
multimodal rag with gpt – build smarter search & ai systems
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
build a diy multimodal question answering system with vertex ai
1234…228

In summary, here are 10 of our most popular multimodal ai courses

  • Computer Vision: University of Colorado Boulder
  • Programming Generative AI: Pearson
  • Building Multimodal Search and RAG: DeepLearning.AI
  • Building AI Agents and Agentic Workflows: IBM
  • Modern AI Models for Vision and Multimodal Understanding: University of Colorado Boulder
  • Build Multimodal Generative AI Applications: IBM
  • Generative AI for Sales Professionals: Microsoft
  • IBM RAG and Agentic AI: IBM
  • AI For All: AI CERTs
  • AI-powered Customer Intelligence with Microsoft Copilot: Microsoft

Frequently Asked Questions about Multimodal Ai

Browse the Multimodal AI courses below—popular starting points on Coursera.

  • Building Multimodal Search and RAG: DeepLearning.AI
  • Modern AI Models for Vision and Multimodal Understanding: University of Colorado Boulder
  • Build Multimodal Generative AI Applications: IBM ‎

Yes, you can start learning Multimodal AI on Coursera for free by accessing the first module of many courses at no cost. This includes video lessons, readings, and even graded assignments—plus Coursera Coach support when available. If you want to keep learning, earn a certificate, or unlock the full course, you can upgrade or apply for financial aid.‎

The specific skills and knowledge you will gain depend on the course you enroll in, but some common skills include multimodal model design, combining text, images, audio, and video, building multimodal applications, and applying them to chatbots, search, and creative tools.‎

This FAQ content has been made available for informational purposes only. Learners are advised to conduct additional research to ensure that courses and other credentials pursued meet their personal, professional, and financial goals.

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Manage Cookie Preferences
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2025 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok