Have you ever imagined an AI that isn't just smart at "reading" text, but also excels at "seeing" and interpreting images simultaneously? That is the core power of Vision Language Models (VLM).
AI Talent Factory (AITF) is back with another Expert Lecture session that will thoroughly dissect VLM technology alongside the expert himself, Prof. Drs. Ec. Ir. Riyanarto Sarno, M.Sc Ph.D. (Head of the Intelligent Information Management Laboratory, ITS).
In this session, participants won't just be introduced to the surface-level concepts, but will learn exactly how VLM bridges the two main pillars of AI: Computer Vision and Natural Language Processing (NLP). We will explore how these models are trained to process multi-modal inputs, empowering AI to analyze visuals and language in an integrated manner for various real-world case studies.