• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Coursera
Log In
Join for Free
Coursera
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how to build and apply models that work with text, images, audio, and video together, and use them for real-world applications like chatbots, search, and creative tools.

Skip to search results

Filter by

Subject
Required
 *

Language
Required
 *

The language used throughout the course, in both instruction and assessments.

Learning Product
Required
 *

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn career credentials while taking courses that count towards your Master’s degree.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.
Earn a university-issued career credential in a flexible, interactive format.

Level
Required
 *

Duration
Required
 *

Skills
Required
 *

Subtitles
Required
 *

Educator
Required
 *

Find the Best Multimodal AI Course for Your Goals

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Packt

    Advanced Web Exploits, Python Scripting & Network Attacks

    Skills you'll gain: Penetration Testing, OSI Models, Exploitation techniques, Open Web Application Security Project (OWASP), TCP/IP, Network Protocols, Network Security, Vulnerability Scanning, Cybersecurity, Prompt Engineering, Large Language Modeling, Scripting, Python Programming, SQL

    Intermediate · Course · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Packt

    Blender Foundations and Asset Management

    Skills you'll gain: 3D Assets, Computer Graphics, Visualization (Computer Graphics), Virtual Environment, File Management, Generative AI

    Intermediate · Course · 1 - 3 Months

  • Status: Preview
    Preview
    H

    H2O.ai

    H2O GPTe Learning Path

    Skills you'll gain: LLM Application, Large Language Modeling, AI Product Strategy, Generative AI, Web Applications, Artificial Intelligence, Prompt Engineering, Agentic systems, Information Architecture, Application Programming Interface (API), Automation, Data Analysis

    Intermediate · Course · 1 - 3 Months

  • Status: Preview
    Preview
    G

    Google Cloud

    Introduction to Image Generation - Français

    Skills you'll gain: Generative AI, Generative Model Architectures, Google Cloud Platform, Image Analysis, Applied Machine Learning, Unsupervised Learning

    Beginner · Course · 1 - 4 Weeks

  • Status: Free
    Free
    D

    DeepLearning.AI

    Retrieval Optimization: Tokenization to Vector Quantization

    Skills you'll gain: Text Mining, Large Language Modeling, Performance Tuning, Generative AI

    Beginner · Project · Less Than 2 Hours

  • Status: New
    New
    Status: Preview
    Preview
    W

    Whizlabs

    Getting Started with Amazon Bedrock

    Skills you'll gain: Amazon Bedrock, Responsible AI, AWS SageMaker, Amazon Web Services, Generative AI, Agentic systems, Generative AI Agents, Amazon CloudWatch, Amazon S3, Data Ethics, Artificial Intelligence, AWS Identity and Access Management (IAM), Prompt Engineering, Automation, Systems Integration, Scalability

    Intermediate · Course · 1 - 4 Weeks

  • Status: Preview
    Preview
    G

    Google Cloud

    Introduction to Image Generation - 简体中文

    Skills you'll gain: Generative AI, Generative Model Architectures, Image Analysis, Google Cloud Platform, Unsupervised Learning

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Preview
    Preview
    U

    Universidades Anáhuac

    Inteligencia artificial generativa para alumnos

    Skills you'll gain: AI Product Strategy, Generative AI, ChatGPT, Prompt Engineering, Artificial Intelligence, Responsible AI, Data Ethics

    Beginner · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    G

    Google Cloud

    Feature Engineering 日本語版

    Skills you'll gain: Feature Engineering, Tensorflow, Data Processing, Data Transformation, Keras (Neural Network Library), Statistical Machine Learning, Applied Machine Learning, Machine Learning, Data Cleansing, Data Modeling

    4.5
    Rating, 4.5 out of 5 stars
    ·
    10 reviews

    Intermediate · Course · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    D

    Dell

    Dell Technologies IT Sales Representative

    Skills you'll gain: Closing (Sales), Sustainable Business, Sales, Customer Relationship Building, Sales Strategy, Persuasive Communication, Emotional Intelligence, Influencing, Job Analysis, Data Centers, Customer Insights, Interviewing Skills, Sales Presentations, Public Speaking, Sales Prospecting, Negotiation, Account Management, Customer Relationship Management, Lead Generation, Digital Content

    Beginner · Professional Certificate · 3 - 6 Months

  • Status: Free Trial
    Free Trial
    G

    Google Cloud

    Gemini for Security Engineers

    Skills you'll gain: Google Gemini, Google Cloud Platform, Cloud Computing, Cloud Security, Generative AI, Vulnerability Assessments, Security Controls, Vulnerability Management, Security Engineering, Threat Detection

    Beginner · Course · 1 - 4 Weeks

  • Status: Preview
    Preview
    U

    Universidad de los Andes

    Introducción al deep learning contemporáneo

    Skills you'll gain: Deep Learning, Generative Model Architectures, Image Analysis, Artificial Neural Networks, Artificial Intelligence and Machine Learning (AI/ML), Computer Vision, Network Architecture, Natural Language Processing, Machine Learning Algorithms, Time Series Analysis and Forecasting

    Build toward a degree

    4.6
    Rating, 4.6 out of 5 stars
    ·
    8 reviews

    Beginner · Course · 1 - 4 Weeks

Searches related to multimodal ai

build multimodal generative ai applications
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
multimodal rag with gpt – build smarter search & ai systems
introduction to vertex ai embeddings: text and multimodal
build a diy multimodal question answering system with vertex ai
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
1…193194195…232

In summary, here are 10 of our most popular multimodal ai courses

  • Advanced Web Exploits, Python Scripting & Network Attacks: Packt
  • Blender Foundations and Asset Management: Packt
  • H2O GPTe Learning Path: H2O.ai
  • Introduction to Image Generation - Français: Google Cloud
  • Retrieval Optimization: Tokenization to Vector Quantization: DeepLearning.AI
  • Getting Started with Amazon Bedrock: Whizlabs
  • Introduction to Image Generation - 简体中文: Google Cloud
  • Inteligencia artificial generativa para alumnos: Universidades Anáhuac
  • Feature Engineering 日本語版: Google Cloud
  • Dell Technologies IT Sales Representative: Dell

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Manage Cookie Preferences
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2025 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok