
Images, audio, and video are everywhere in modern orgs but most data pipelines weren't built for any of them.
We just launched a new short course with Snowflake on building multimodal data pipelines.
You’ll build systems that:
- Convert images and audio into structured text (OCR, ASR)
- Generate timestamped descriptions from video with Vision Language Models
- Retrieve across slides, audio, and video with a multimodal RAG pipeline
Taught by Gilberto Hernandez.
Enroll in "Building Multimodal Data Pipelines:" hubs.la/Q04d0QzW0
English
