• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Coursera
Log In
Join for Free
Coursera
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how to build and apply models that work with text, images, audio, and video together, and use them for real-world applications like chatbots, search, and creative tools.

Skip to search results

Filter by

Subject
Required
 *

Language
Required
 *

The language used throughout the course, in both instruction and assessments.

Learning Product
Required
 *

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn career credentials while taking courses that count towards your Master’s degree.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.
Earn a university-issued career credential in a flexible, interactive format.

Level
Required
 *

Duration
Required
 *

Skills
Required
 *

Subtitles
Required
 *

Educator
Required
 *

Find the Best Multimodal AI Course for Your Goals

  • Status: Free Trial
    Free Trial
    V

    Vanderbilt University

    Agentic AI and AI Agents for Leaders

    Skills you'll gain: Prompt Engineering, ChatGPT, Generative AI Agents, Prompt Patterns, Generative AI, Workflow Management, Agentic systems, LLM Application, Productivity, OpenAI, Artificial Intelligence, AI Personalization, Business Process Automation, AI Product Strategy, Personalized Service, Large Language Modeling, Automation, Responsible AI, Artificial Intelligence and Machine Learning (AI/ML), Expense Management

    4.8
    Rating, 4.8 out of 5 stars
    ·
    7.9K reviews

    Beginner · Specialization · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    C

    Coursera

    Generative AI and Large Language Models

    Skills you'll gain: Generative AI, Generative Model Architectures, Large Language Modeling, LLM Application, OpenAI, Multimodal Prompts, Responsible AI, Prompt Engineering, PyTorch (Machine Learning Library), Natural Language Processing, Image Analysis, Application Deployment

    Intermediate · Course · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    E

    Edureka

    Generative AI for Automation

    Skills you'll gain: Prompt Patterns, Generative AI Agents, Business Process Automation, Make.com, Large Language Modeling, Automation, ChatGPT, Microsoft Power Automate/Flow, LLM Application, LangChain, Responsible AI, Workflow Management, OpenAI, Tool Calling, No-Code Development, Multimodal Prompts, Slack (Software), Process Optimization, Application Programming Interface (API), Decision Support Systems

    4.4
    Rating, 4.4 out of 5 stars
    ·
    7 reviews

    Beginner · Specialization · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Pearson

    Programming Generative AI: Unit 3

    Skills you'll gain: Multimodal Prompts, Generative AI, Generative Model Architectures, Image Analysis, Prompt Engineering, Image Quality, Computer Vision, Deep Learning, Natural Language Processing, Performance Tuning

    Intermediate · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    I

    IBM

    Generative AI for Executives and Business Leaders

    Skills you'll gain: Prompt Engineering, Responsible AI, AI Product Strategy, Risk Mitigation, Generative AI, Risk Analysis, Feasibility Studies, Data Ethics, Brainstorming, Generative AI Agents, Business Priorities, Return On Investment, Data Strategy, Business Leadership, Goal Setting, Artificial Intelligence, Business Solutions, Business Strategy, Scalability, Business Transformation

    4.7
    Rating, 4.7 out of 5 stars
    ·
    577 reviews

    Intermediate · Specialization · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    C

    Codio

    Multimodal Generative AI: Vision, Speech, and Assistants

    Skills you'll gain: OpenAI, Image Analysis, Generative AI, ChatGPT, LLM Application, Multimodal Prompts, Tool Calling, Application Programming Interface (API), Large Language Modeling, Generative AI Agents, Artificial Intelligence, Natural Language Processing, Computer Vision

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Free Trial
    Free Trial
    S

    Starweaver

    Executive AI Leadership Mastery

    Skills you'll gain: Responsible AI, Technology Roadmaps, Organizational Change, Stakeholder Engagement, Google Gemini, Anthropic Claude, Business Strategy, Strategic Leadership, Business Leadership, ChatGPT, Leadership, Business Transformation, Change Management, Content Strategy, Corporate Communications, Digital Media Strategy, Non-Verbal Communication, Verbal Communication Skills, Communication Strategies, Communication

    4
    Rating, 4 out of 5 stars
    ·
    8 reviews

    Intermediate · Specialization · 1 - 4 Weeks

  • Status: New
    New
    Status: Preview
    Preview
    A

    AI CERTs

    AI For All

    Skills you'll gain: Responsible AI, AI Product Strategy, ChatGPT, Artificial Intelligence and Machine Learning (AI/ML), Business Process Automation, Technology Strategies, AI Personalization, Emerging Technologies, Business Strategy, Ethical Standards And Conduct, Innovation, Business Planning, Computer Science, Workforce Development, Organizational Strategy, Creative Problem-Solving, Natural Language Processing, Analysis

    Beginner · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    V

    Vanderbilt University

    Agentic AI and AI Agents: A Primer for Leaders

    Skills you'll gain: Generative AI Agents, Workflow Management, Agentic systems, Artificial Intelligence, Business Process Automation, Generative AI, AI Product Strategy, Automation, Prompt Engineering, Technology Strategies, Decision Support Systems, Emerging Technologies, Responsible AI

    4.7
    Rating, 4.7 out of 5 stars
    ·
    793 reviews

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Free Trial
    Free Trial
    I

    IBM

    Agentic AI with LangGraph, CrewAI, AutoGen and BeeAI

    Skills you'll gain: Agentic systems, Generative AI Agents, LLM Application, Application Design, Tool Calling, Large Language Modeling, Software Design Patterns, Data Validation

    4.9
    Rating, 4.9 out of 5 stars
    ·
    31 reviews

    Intermediate · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    V

    Vanderbilt University

    Generative AI Assistants

    Skills you'll gain: Prompt Engineering, ChatGPT, Prompt Patterns, Generative AI, Ideation, Verification And Validation, LLM Application, Productivity, OpenAI, AI Personalization, Responsible AI, Personalized Service, Large Language Modeling, Artificial Intelligence, Risk Management Framework, Artificial Intelligence and Machine Learning (AI/ML), Expense Management, Creative Thinking, Productivity Software, Creative Problem-Solving

    4.8
    Rating, 4.8 out of 5 stars
    ·
    7.7K reviews

    Beginner · Specialization · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    M

    Microsoft

    Foundations of AI and Machine Learning

    Skills you'll gain: Data Management, Artificial Intelligence and Machine Learning (AI/ML), Infrastructure Architecture, MLOps (Machine Learning Operations), Application Deployment, Data Processing, Data Cleansing, Artificial Intelligence, Data Security, Application Frameworks, PyTorch (Machine Learning Library), Machine Learning, Tensorflow, Applied Machine Learning, Data Pipelines, Scalability

    4.5
    Rating, 4.5 out of 5 stars
    ·
    175 reviews

    Intermediate · Course · 1 - 3 Months

Searches related to multimodal ai

build multimodal generative ai applications
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
introduction to vertex ai embeddings: text and multimodal
multimodal rag with gpt – build smarter search & ai systems
build a diy multimodal question answering system with vertex ai
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
1234…228

In summary, here are 10 of our most popular multimodal ai courses

  • Agentic AI and AI Agents for Leaders: Vanderbilt University
  • Generative AI and Large Language Models: Coursera
  • Generative AI for Automation: Edureka
  • Programming Generative AI: Unit 3: Pearson
  • Generative AI for Executives and Business Leaders: IBM
  • Multimodal Generative AI: Vision, Speech, and Assistants: Codio
  • Executive AI Leadership Mastery: Starweaver
  • AI For All: AI CERTs
  • Agentic AI and AI Agents: A Primer for Leaders: Vanderbilt University
  • Agentic AI with LangGraph, CrewAI, AutoGen and BeeAI: IBM

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Manage Cookie Preferences
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2025 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok