Skip to content

Music Generation

Music Generation

Publish Date Title Authors PDF Code
2025-06-26 SmoothSinger: A Conditional Diffusion Model for Singing Voice Synthesis with Multi-Resolution Architecture Kehan Sui et.al. 2506.21478v1 null
2025-06-26 Localization-Based Beam Focusing in Near-Field Communications Nima Mozaffarikhosravi et.al. 2506.21325v1 null
2025-06-26 Exploring Adapter Design Tradeoffs for Low Resource Music Generation Atharva Mehta et.al. 2506.21298v1 null
2025-06-26 A Hierarchical Deep Learning Approach for Minority Instrument Detection Dylan Sechet et.al. 2506.21167v1 null
2025-06-24 Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation Jun Wang et.al. 2506.19774v1 null
2025-06-24 A Robust Method for Pitch Tracking in the Frequency Following Response using Harmonic Amplitude Summation Filterbank Sajad Sadeghkhani et.al. 2506.19253v1 null
2025-06-23 A Fourier Explanation of AI-music Artifacts Darius Afchar et.al. 2506.19108v1 null
2025-06-23 Benchmarking Music Generation Models and Metrics via Human Preference Studies Florian Grötschla et.al. 2506.19085v1 null
2025-06-23 LEGATO: Large-scale End-to-end Generalizable Approach to Typeset OMR Guang Yang et.al. 2506.19065v1 null
2025-06-23 Let Your Video Listen to Your Music! Xinyu Zhang et.al. 2506.18881v1 null
2025-06-23 USAD: Universal Speech and Audio Representation via Distillation Heng-Jui Chang et.al. 2506.18843v1 null
2025-06-23 An Audio-centric Multi-task Learning Framework for Streaming Ads Targeting on Spotify Shivam Verma et.al. 2506.18735v1 null
2025-06-23 MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners Fang-Duo Tsai et.al. 2506.18729v2 null
2025-06-23 DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling Anindita Ghosh et.al. 2506.18680v1 null
2025-06-23 TCDiff++: An End-to-end Trajectory-Controllable Diffusion Model for Harmonious Music-Driven Group Choreography Yuqin Dai et.al. 2506.18671v3 null
2025-06-23 Object-aware Sound Source Localization via Audio-Visual Scene Understanding Sung Jin Um et.al. 2506.18557v2 null
2025-06-23 AI-Generated Song Detection via Lyrics Transcripts Markus Frohmann et.al. 2506.18488v1 null
2025-06-23 Large-Scale Training Data Attribution for Music Generative Models via Unlearning Woosung Choi et.al. 2506.18312v1 null
2025-06-22 Two Sonification Methods for the MindCube Fangzheng Liu et.al. 2506.18196v1 null
2025-06-22 AI Harmonizer: Expanding Vocal Expression with a Generative Neurosymbolic Music AI System Lancelot Blanchard et.al. 2506.18143v1 null
2025-06-22 GD-Retriever: Controllable Generative Text-Music Retrieval with Diffusion Models Julien Guinot et.al. 2506.17886v2 null
2025-06-21 CultureMERT: Continual Pre-Training for Cross-Cultural Music Representation Learning Angelos-Nikolaos Kanatas et.al. 2506.17818v1 null
2025-06-21 SLAP: Siamese Language-Audio Pretraining Without Negative Samples for Music Understanding Julien Guinot et.al. 2506.17815v1 null
2025-06-21 Machine Learning-Based Near-Field Localization in Mixed LoS/NLoS Scenarios Parisa Ramezani et.al. 2506.17810v1 null
2025-06-21 Algebraic Structures in Microtonal Music Veronica Flynn et.al. 2506.17778v1 null
2025-06-21 A novel fast short-time root music method for vibration monitoring of high-speed spindles Huiguang Zhang et.al. 2506.17600v1 null
2025-06-20 Episode-specific Fine-tuning for Metric-based Few-shot Learners with Optimization-based Training Xuanyu Zhuang et.al. 2506.17499v1 null
2025-06-20 From Generality to Mastery: Composer-Style Symbolic Music Generation via Large-Scale Pre-training Mingyang Yao et.al. 2506.17497v1 link
2025-06-20 Universal Music Representations? Evaluating Foundation Models on World Music Corpora Charilaos Papaioannou et.al. 2506.17055v1 link
2025-06-20 ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors Junghyun Koo et.al. 2506.16889v2 link