STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Paper โข 2501.02976 โข Published 5 days ago โข 44
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption Paper โข 2412.09283 โข Published about 1 month ago โข 19
StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors Paper โข 2412.11586 โข Published 26 days ago โข 11
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement Paper โข 2411.06558 โข Published Nov 10, 2024 โข 34
OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation Paper โข 2407.02371 โข Published Jul 2, 2024 โข 51
RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network Paper โข 2406.18284 โข Published Jun 26, 2024 โข 19