Demuxed 2025: The media processing pipelines behind AI

Abstract In this talk we will be talking about the complex media processing pipelines behind media AI models (translations, lipsyncing, text to video, etc). AI Models are very picky on media specs (resolutions, frame rates, sample rates, colorspace, etc), also most of the times the AI tools that you see that, for instance, generates aContinue reading “Demuxed 2025: The media processing pipelines behind AI”

Buffer free Lounge: The media processing pipelines behind AI

Abstract Short interview where Allen Helton, Ecosystem Engineer at Momento, sits down with Jordi Cenzano, Software Engineer at Meta. Jordi previews his session on Meta’s AI video restyle, a feature that allows users to transform existing video content with AI-powered presets that change backgrounds, outfits, lighting, or even turning people into statues or animals. YoutubeContinue reading “Buffer free Lounge: The media processing pipelines behind AI”

Buffer free Lounge: The media processing pipelines behind AI

Abstract Short interview where Allen Helton, Ecosystem Engineer at Momento, sits down with Jordi Cenzano, Software Engineer at Meta. Jordi previews his session on Meta’s AI video restyle, a feature that allows users to transform existing video content with AI-powered presets that change backgrounds, outfits, lighting, or even turning people into statues or animals. YoutubeContinue reading “Buffer free Lounge: The media processing pipelines behind AI”

STSWE25: Scaling AI translations at Meta

Abstract We will focus on the challenges we faced from a media processing / scaling point of view, such as: inference latency and scheduling, voice isolation, media timing/alignment, alternate tracks delivery, instrumentation, model evaluation, etc. Youtube link Streaming tech Sweden

Video@Scale 2024: Scaling AI translations at Meta

Abstract In this talk we will show how we implemented a media processing pipeline to perform (autodub / lipsync) media inference at Meta scale. We will focus on the challenges we faced from a media processing / scaling point of view, such as: inference latency and scheduling, voice isolation, media timing/alignment, alternate tracks delivery, instrumentation,Continue reading “Video@Scale 2024: Scaling AI translations at Meta”

Demuxed 2023: MOQ

Abstract Media over QUIC (MOQ) is an IETF proposal for a media transport protocol that involves several big players of online video space (Meta, Youtube, Cisco, Akamai, etc). Ideally MOQ can accommodate most the online media use cases we see today: To start proving some of the novel ideas of the MOQ working group weContinue reading “Demuxed 2023: MOQ”

Sydney Video Nov 2022: RTMP GoAway (at Meta)

Abstract At Meta we implemented RTMP Go Away, this is a new mechanism that allows the live RTMP server to send a signal to the client indicating that it needs to reconnect. This allows the client to create a new connection at a logical media boundary, incurring zero data loss in your live streams. ThisContinue reading “Sydney Video Nov 2022: RTMP GoAway (at Meta)”