Skip to main content

Episodes

Find all the episodes !

EP105 - WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words
·3 mins
EP104 - LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
·3 mins
EP103 - Generating Fine-Grained Human Motions Using ChatGPT-Refined Descriptions
·2 mins
EP102 - StableDreamer: Taming Noisy Score Distillation Sampling for Text-to-3D
·2 mins
EP101 - Axiomatic Preference Modeling for Longform Question Answering
·2 mins
EP100 - Training Chain-of-Thought via Latent-Variable Inference
·3 mins
EP99 - Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models
·2 mins
EP98 - Voyager: An Open-Ended Embodied Agent with Large Language Models
·3 mins
EP97 - VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
·2 mins
EP96 - DeepCache: Accelerating Diffusion Models for Free
·3 mins