• For Individuals
  • For Businesses
  • For Universities
  • For Governments
Coursera
Log In
Join for Free
Coursera
  • Browse
  • Multimodal Ai

Multimodal AI Courses

Multimodal AI courses can help you learn how to build and apply models that work with text, images, audio, and video together, and use them for real-world applications like chatbots, search, and creative tools.

Skip to search results

Filter by

Subject
Required
 *

Language
Required
 *

The language used throughout the course, in both instruction and assessments.

Learning Product
Required
 *

Build job-relevant skills in under 2 hours with hands-on tutorials.
Learn from top instructors with graded assignments, videos, and discussion forums.
Learn a new tool or skill in an interactive, hands-on environment.
Get in-depth knowledge of a subject by completing a series of courses and projects.
Earn career credentials from industry leaders that demonstrate your expertise.
Earn career credentials while taking courses that count towards your Master’s degree.
Earn your Bachelor’s or Master’s degree online for a fraction of the cost of in-person learning.
Earn a university-issued career credential in a flexible, interactive format.

Level
Required
 *

Duration
Required
 *

Skills
Required
 *

Subtitles
Required
 *

Educator
Required
 *

Find the Best Multimodal AI Course for Your Goals

  • G

    Google Cloud

    Configuring Vector Search in Spanner

    Skills you'll gain: Google Cloud Platform, Data Import/Export, Database Management, Databases, Generative AI, Data Storage Technologies, Artificial Intelligence

    Beginner · Project · Less Than 2 Hours

  • Status: New
    New
    G

    Google Cloud

    Agentes de IA generativa: transforme sua organização

    Skills you'll gain: Generative AI Agents, Generative AI, AI Product Strategy, Google Cloud Platform, LLM Application, Customer experience improvement, Artificial Intelligence, Innovation, Prompt Engineering, Organizational Strategy, Technology Strategies, Tool Calling

    Beginner · Course · 1 - 3 Months

  • Status: New
    New
    Status: Free Trial
    Free Trial
    P

    Packt

    Natural Language Processing - Transformers with Hugging Face

    Skills you'll gain: Large Language Modeling, Text Mining, Semantic Web, Generative AI, PyTorch (Machine Learning Library), Python Programming, Applied Machine Learning, Unsupervised Learning

    Intermediate · Course · 1 - 4 Weeks

  • P

    Packt

    Introduction to FinTech Using R

    Skills you'll gain: Shiny (R Package), FinTech, Financial Market, Financial Planning, Financial Forecasting, Statistical Programming, Asset Management, Financial Analysis, Artificial Intelligence, Web Applications, Portfolio Management, Predictive Modeling, Time Series Analysis and Forecasting, Algorithms

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Preview
    Preview
    M

    Microsoft

    Microsoft FluêncIA

    Skills you'll gain: Generative AI, Artificial Intelligence, AI Product Strategy, Responsible AI, Natural Language Processing, Ethical Standards And Conduct, Web Analytics and SEO

    Beginner · Course · 1 - 3 Months

  • Status: Free
    Free
    C

    Coursera Project Network

    Coding With Cody Sourcegraph: Optimise Open Source Code

    Skills you'll gain: Debugging, Development Environment, Open Source Technology, Integrated Development Environments, Computer Programming Tools, Software Development, Software Development Tools, Artificial Intelligence, Generative AI

    Intermediate · Guided Project · Less Than 2 Hours

  • G

    Google Cloud

    Analyze Customer Reviews with Gemini Using Python Notebooks

    Skills you'll gain: Google Gemini, Google Cloud Platform, Cloud Management, Applied Machine Learning, Big Data, Jupyter, Cloud Applications, LLM Application, Text Mining, Statistical Reporting, Machine Learning, SQL

    Intermediate · Project · Less Than 2 Hours

  • Status: Free Trial
    Free Trial
    G

    Google Cloud

    Görüntü Üretmeye Giriş

    Skills you'll gain: Generative AI, Generative Model Architectures, Google Cloud Platform, Prompt Engineering, Image Analysis

    Beginner · Course · 1 - 4 Weeks

  • Status: Preview
    Preview
    G

    Google Cloud

    Introduction to Gemini for Google Workspace - 简体中文

    Skills you'll gain: Google Gemini, Responsible AI, Google Workspace, Generative AI, Productivity Software, Gmail, Google Docs

    Beginner · Course · 1 - 4 Weeks

  • Status: New
    New
    Status: Free Trial
    Free Trial
    U

    University of Washington

    Guided Website Development Project

    Skills you'll gain: Web Content Accessibility Guidelines, Browser Compatibility, User Story, HTML and CSS, GitHub, Microsoft Copilot, Web Development, Web Design and Development, Web Design, Application Deployment, Responsive Web Design, Git (Version Control System), Front-End Web Development, Development Testing, Debugging, User Requirements Documents, Semantic Web

    Beginner · Course · 1 - 4 Weeks

  • G

    Google Cloud

    Feature Engineering - 한국어

    Skills you'll gain: Feature Engineering, Dataflow, Data Pipelines, Tensorflow, Data Processing, Data Transformation, Keras (Neural Network Library), Dimensionality Reduction, Machine Learning, Real Time Data, Scalability, Statistical Methods

    Intermediate · Course · 1 - 3 Months

  • Status: New
    New
    Status: Preview
    Preview
    S

    Simplilearn

    Fundamental of Reinforcement Training

    Skills you'll gain: Reinforcement Learning, Artificial Intelligence and Machine Learning (AI/ML), Artificial Intelligence, Applied Machine Learning, Machine Learning, Supervised Learning, Unsupervised Learning, Markov Model

    Beginner · Course · 1 - 4 Weeks

Searches related to multimodal ai

build multimodal generative ai applications
multimodal generative ai: vision, speech, and assistants
modern ai models for vision and multimodal understanding
introduction to vertex ai embeddings: text and multimodal
multimodal rag with gpt – build smarter search & ai systems
build a diy multimodal question answering system with vertex ai
multimodal retrieval augmented generation (rag) using the vertex ai gemini api
1…208209210…230

In summary, here are 10 of our most popular multimodal ai courses

  • Configuring Vector Search in Spanner: Google Cloud
  • Agentes de IA generativa: transforme sua organização: Google Cloud
  • Natural Language Processing - Transformers with Hugging Face: Packt
  • Introduction to FinTech Using R: Packt
  • Microsoft FluêncIA: Microsoft
  • Coding With Cody Sourcegraph: Optimise Open Source Code: Coursera Project Network
  • Analyze Customer Reviews with Gemini Using Python Notebooks: Google Cloud
  • Görüntü Üretmeye Giriş: Google Cloud
  • Introduction to Gemini for Google Workspace - 简体中文: Google Cloud
  • Guided Website Development Project: University of Washington

Other topics to explore

Arts and Humanities
338 courses
Business
1095 courses
Computer Science
668 courses
Data Science
425 courses
Information Technology
145 courses
Health
471 courses
Math and Logic
70 courses
Personal Development
137 courses
Physical Science and Engineering
413 courses
Social Sciences
401 courses
Language Learning
150 courses

Coursera Footer

Skills

  • Artificial Intelligence (AI)
  • Cybersecurity
  • Data Analytics
  • Digital Marketing
  • English Speaking
  • Generative AI (GenAI)
  • Microsoft Excel
  • Microsoft Power BI
  • Project Management
  • Python

Certificates & Programs

  • Google Cybersecurity Certificate
  • Google Data Analytics Certificate
  • Google IT Support Certificate
  • Google Project Management Certificate
  • Google UX Design Certificate
  • IBM Data Analyst Certificate
  • IBM Data Science Certificate
  • Machine Learning Certificate
  • Microsoft Power BI Data Analyst Certificate
  • UI / UX Design Certificate

Industries & Careers

  • Business
  • Computer Science
  • Data Science
  • Education & Teaching
  • Engineering
  • Finance
  • Healthcare
  • Human Resources (HR)
  • Information Technology (IT)
  • Marketing

Career Resources

  • Career Aptitude Test
  • Examples of Strengths and Weaknesses for Job Interviews
  • High-Income Skills to Learn
  • How Does Cryptocurrency Work?
  • How to Highlight Duplicates in Google Sheets
  • How to Learn Artificial Intelligence
  • Popular Cybersecurity Certifications
  • Preparing for the PMP Certification
  • Signs You Will Get the Job After an Interview
  • What Is Artificial Intelligence?

Coursera

  • About
  • What We Offer
  • Leadership
  • Careers
  • Catalog
  • Coursera Plus
  • Professional Certificates
  • MasterTrack® Certificates
  • Degrees
  • For Enterprise
  • For Government
  • For Campus
  • Become a Partner
  • Social Impact
  • Free Courses
  • Share your Coursera learning story

Community

  • Learners
  • Partners
  • Beta Testers
  • Blog
  • The Coursera Podcast
  • Tech Blog

More

  • Press
  • Investors
  • Terms
  • Privacy
  • Help
  • Accessibility
  • Contact
  • Articles
  • Directory
  • Affiliates
  • Modern Slavery Statement
  • Manage Cookie Preferences
Learn Anywhere
Download on the App Store
Get it on Google Play
Logo of Certified B Corporation
© 2025 Coursera Inc. All rights reserved.
  • Coursera Facebook
  • Coursera Linkedin
  • Coursera Twitter
  • Coursera YouTube
  • Coursera Instagram
  • Coursera TikTok