• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Coursera
Log In
Join for Free
Coursera
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how to build and apply models that work with text, images, audio, and video together, and use them for real-world applications like chatbots, search, and creative tools.

Skip to search results

Filter by

Subject
Required
 *

Language
Required
 *

The language used throughout the course, in both instruction and assessments.

Learning Product
Required
 *

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn career credentials while taking courses that count towards your Master’s degree.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.
Earn a university-issued career credential in a flexible, interactive format.

Level
Required
 *

Duration
Required
 *

Skills
Required
 *

Subtitles
Required
 *

Educator
Required
 *

Find the Best Multimodal AI Course for Your Goals

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Packt

    Blender Foundations and Asset Management

    Skills you'll gain: 3D Assets, Computer Graphics, Visualization (Computer Graphics), Virtual Environment, File Management, Generative AI

    Intermediate · Course · 1 - 3 Months

  • Status: Preview
    Preview
    G

    Google Cloud

    Introduction to Image Generation - 简体中文

    Skills you'll gain: Generative AI, Generative Model Architectures, Image Analysis, Google Cloud Platform, Unsupervised Learning

    Beginner · Course · 1 - 4 Weeks

  • Status: Preview
    Preview
    A

    Alibaba Cloud Academy

    Alibaba Cloud Native Solutions and Container Service

    Skills you'll gain: Cloud-Native Computing, Cloud Applications, Kubernetes, DevOps, Cloud Platforms, Containerization, Cloud Infrastructure, Cloud Security, Serverless Computing, Scalability, Artificial Intelligence and Machine Learning (AI/ML), Data Migration

    Intermediate · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    G

    Google Cloud

    Intro to TensorFlow 日本語版

    Skills you'll gain: Tensorflow, Keras (Neural Network Library), Google Cloud Platform, Jupyter, Data Pipelines, Deep Learning, Feature Engineering, Applied Machine Learning, Data Import/Export, Scalability

    3.8
    Rating, 3.8 out of 5 stars
    ·
    12 reviews

    Intermediate · Course · 1 - 3 Months

  • Status: Preview
    Preview
    D

    DeepLearning.AI

    Structurer des projets d’apprentissage automatique

    Skills you'll gain: Artificial Intelligence and Machine Learning (AI/ML), Applied Machine Learning, Debugging, Deep Learning, Machine Learning, MLOps (Machine Learning Operations), Data Science, Artificial Neural Networks, Test Case, Analysis, Algorithms, Performance Testing, Performance Tuning

    Beginner · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    G

    Google Cloud

    Feature Engineering en Français

    Skills you'll gain: Feature Engineering, Tensorflow, MLOps (Machine Learning Operations), Dataflow, Data Processing, Dimensionality Reduction, Data Pipelines, Keras (Neural Network Library), Data Transformation, Applied Machine Learning, Data Modeling, Machine Learning

    Intermediate · Course · 1 - 3 Months

  • G

    Google Cloud

    Data Analysis with the FraudFinder Workshop

    Skills you'll gain: Fraud detection, Feature Engineering, Exploratory Data Analysis, Real Time Data, Applied Machine Learning, MLOps (Machine Learning Operations), Application Deployment, Jupyter, Data Analysis, System Monitoring, Data Warehousing, Machine Learning

    Beginner · Project · Less Than 2 Hours

  • Status: Preview
    Preview
    C

    Coursera Instructor Network

    OpenCL Programming

    Skills you'll gain: Field-Programmable Gate Array (FPGA), Scalability, Performance Tuning, C++ (Programming Language), Embedded Software, Computer Architecture, Cross Platform Development, Hardware Architecture, Application Development, C (Programming Language), Program Development, Application Performance Management

    Beginner · Course · 1 - 4 Weeks

  • Status: Free Trial
    Free Trial
    G

    Google Cloud

    Gemini for Security Engineers

    Skills you'll gain: Google Gemini, Google Cloud Platform, Cloud Computing, Cloud Security, Generative AI, Vulnerability Assessments, Security Controls, Vulnerability Management, Security Engineering, Threat Detection

    Beginner · Course · 1 - 4 Weeks

  • Status: Preview
    Preview
    G

    Google Cloud

    Introduction to Large Language Models - בעברית

    Skills you'll gain: Large Language Modeling, LLM Application, Natural Language Processing, Prompt Engineering, Google Cloud Platform, Generative AI

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Packt

    Advanced Web Exploits, Python Scripting & Network Attacks

    Skills you'll gain: Penetration Testing, OSI Models, Exploitation techniques, Open Web Application Security Project (OWASP), TCP/IP, Network Protocols, Network Security, Vulnerability Scanning, Cybersecurity, Prompt Engineering, Large Language Modeling, Scripting, Python Programming, SQL

    Intermediate · Course · 1 - 3 Months

  • Status: Free Trial
    Free Trial
    U

    Universidad Austral

    NLP System Architecture and Dev-Ops

    Skills you'll gain: Natural Language Processing, MLOps (Machine Learning Operations), Application Lifecycle Management, Systems Architecture, Application Development, Algorithms, Software Architecture, Artificial Intelligence and Machine Learning (AI/ML), Tensorflow, Software Development Life Cycle, Machine Learning

    Beginner · Course · 1 - 4 Weeks

Searches related to multimodal ai

build multimodal generative ai applications
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
introduction to vertex ai embeddings: text and multimodal
multimodal rag with gpt – build smarter search & ai systems
build a diy multimodal question answering system with vertex ai
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
1…201202203…231

In summary, here are 10 of our most popular multimodal ai courses

  • Blender Foundations and Asset Management: Packt
  • Introduction to Image Generation - 简体中文: Google Cloud
  • Alibaba Cloud Native Solutions and Container Service : Alibaba Cloud Academy
  • Intro to TensorFlow 日本語版: Google Cloud
  • Structurer des projets d’apprentissage automatique: DeepLearning.AI
  • Feature Engineering en Français: Google Cloud
  • Data Analysis with the FraudFinder Workshop: Google Cloud
  • OpenCL Programming: Coursera Instructor Network
  • Gemini for Security Engineers: Google Cloud
  • Introduction to Large Language Models - בעברית: Google Cloud

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Manage Cookie Preferences
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2025 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok