Friday, May 3, 2024 12:15pm to 1pm
About this Event
2317 SPEEDWAY , Austin, Texas 78712
https://ifml.institute/events/generating-video-reflecting-two-year-odyssey #TexasAIAtlas Wang, Associate Professor, The University of Texas at Austin will recount the developmental trajectory of video generation models at Picsart AI Research over the past two years—a journey that has taken us from initial baselines to the frontiers of ultra-long video streaming and storytelling. Our inaugural project Text2Video-Zero, presented at ICCV 2023, marked a milestone as the first training-free video generator to leverage pre-trained Stable Diffusion models, serving as a versatile foundation for subsequent works and earning widespread acclaim. Building on this success, our team ventured into creating of the first open-source video generator capable of producing ultra-long sequences. Our new model, StreamingT2V, reliably generates up to 1200 frames—equating to a video duration of 2 minutes—with potential for scaling to even more prolonged timeframes. Concluding the talk, I will share personal insights and reflections gleaned from this intensive R&D period, while highlighting the untapped possibilities for the future video generation models.
0 people are interested in this event
User Activity
No recent activity