Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper • 2412.18619 • Published 26 days ago • 52
PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior Paper • 2106.06406 • Published Jun 11, 2021
MuPT: A Generative Symbolic Music Pretrained Transformer Paper • 2404.06393 • Published Apr 9, 2024 • 15
D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models Paper • 2406.01375 • Published Jun 3, 2024
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation Paper • 2405.15758 • Published May 24, 2024 • 1
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS Paper • 2406.18009 • Published Jun 26, 2024 • 20
EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms Paper • 2406.14228 • Published Jun 20, 2024 • 1
Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement Paper • 2406.08096 • Published Jun 12, 2024
PromptTTS: Controllable Text-to-Speech with Text Descriptions Paper • 2211.12171 • Published Nov 22, 2022
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS Paper • 2406.18009 • Published Jun 26, 2024 • 20
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers Paper • 2406.05370 • Published Jun 8, 2024 • 15
Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training Paper • 2403.00758 • Published Mar 1, 2024 • 2
TaskBench: Benchmarking Large Language Models for Task Automation Paper • 2311.18760 • Published Nov 30, 2023 • 2
EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction Paper • 2401.06201 • Published Jan 11, 2024 • 2
CoMoSVC: Consistency Model-based Singing Voice Conversion Paper • 2401.01792 • Published Jan 3, 2024 • 8
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models Paper • 2304.00830 • Published Apr 3, 2023 • 2
DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021 Paper • 2110.12612 • Published Oct 25, 2021 • 1
Accuracy Prediction with Non-neural Model for Neural Architecture Search Paper • 2007.04785 • Published Jul 9, 2020 • 1