2025-06-26 |
SmoothSinger: A Conditional Diffusion Model for Singing Voice Synthesis with Multi-Resolution Architecture |
Kehan Sui et.al. |
2506.21478v1 |
null |
2025-06-26 |
Localization-Based Beam Focusing in Near-Field Communications |
Nima Mozaffarikhosravi et.al. |
2506.21325v1 |
null |
2025-06-26 |
Exploring Adapter Design Tradeoffs for Low Resource Music Generation |
Atharva Mehta et.al. |
2506.21298v1 |
null |
2025-06-26 |
A Hierarchical Deep Learning Approach for Minority Instrument Detection |
Dylan Sechet et.al. |
2506.21167v1 |
null |
2025-06-24 |
Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation |
Jun Wang et.al. |
2506.19774v1 |
null |
2025-06-24 |
A Robust Method for Pitch Tracking in the Frequency Following Response using Harmonic Amplitude Summation Filterbank |
Sajad Sadeghkhani et.al. |
2506.19253v1 |
null |
2025-06-23 |
A Fourier Explanation of AI-music Artifacts |
Darius Afchar et.al. |
2506.19108v1 |
null |
2025-06-23 |
Benchmarking Music Generation Models and Metrics via Human Preference Studies |
Florian Grötschla et.al. |
2506.19085v1 |
null |
2025-06-23 |
LEGATO: Large-scale End-to-end Generalizable Approach to Typeset OMR |
Guang Yang et.al. |
2506.19065v1 |
null |
2025-06-23 |
Let Your Video Listen to Your Music! |
Xinyu Zhang et.al. |
2506.18881v1 |
null |
2025-06-23 |
USAD: Universal Speech and Audio Representation via Distillation |
Heng-Jui Chang et.al. |
2506.18843v1 |
null |
2025-06-23 |
An Audio-centric Multi-task Learning Framework for Streaming Ads Targeting on Spotify |
Shivam Verma et.al. |
2506.18735v1 |
null |
2025-06-23 |
MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners |
Fang-Duo Tsai et.al. |
2506.18729v2 |
null |
2025-06-23 |
DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling |
Anindita Ghosh et.al. |
2506.18680v1 |
null |
2025-06-23 |
TCDiff++: An End-to-end Trajectory-Controllable Diffusion Model for Harmonious Music-Driven Group Choreography |
Yuqin Dai et.al. |
2506.18671v3 |
null |
2025-06-23 |
Object-aware Sound Source Localization via Audio-Visual Scene Understanding |
Sung Jin Um et.al. |
2506.18557v2 |
null |
2025-06-23 |
AI-Generated Song Detection via Lyrics Transcripts |
Markus Frohmann et.al. |
2506.18488v1 |
null |
2025-06-23 |
Large-Scale Training Data Attribution for Music Generative Models via Unlearning |
Woosung Choi et.al. |
2506.18312v1 |
null |
2025-06-22 |
Two Sonification Methods for the MindCube |
Fangzheng Liu et.al. |
2506.18196v1 |
null |
2025-06-22 |
AI Harmonizer: Expanding Vocal Expression with a Generative Neurosymbolic Music AI System |
Lancelot Blanchard et.al. |
2506.18143v1 |
null |
2025-06-22 |
GD-Retriever: Controllable Generative Text-Music Retrieval with Diffusion Models |
Julien Guinot et.al. |
2506.17886v2 |
null |
2025-06-21 |
CultureMERT: Continual Pre-Training for Cross-Cultural Music Representation Learning |
Angelos-Nikolaos Kanatas et.al. |
2506.17818v1 |
null |
2025-06-21 |
SLAP: Siamese Language-Audio Pretraining Without Negative Samples for Music Understanding |
Julien Guinot et.al. |
2506.17815v1 |
null |
2025-06-21 |
Machine Learning-Based Near-Field Localization in Mixed LoS/NLoS Scenarios |
Parisa Ramezani et.al. |
2506.17810v1 |
null |
2025-06-21 |
Algebraic Structures in Microtonal Music |
Veronica Flynn et.al. |
2506.17778v1 |
null |
2025-06-21 |
A novel fast short-time root music method for vibration monitoring of high-speed spindles |
Huiguang Zhang et.al. |
2506.17600v1 |
null |
2025-06-20 |
Episode-specific Fine-tuning for Metric-based Few-shot Learners with Optimization-based Training |
Xuanyu Zhuang et.al. |
2506.17499v1 |
null |
2025-06-20 |
From Generality to Mastery: Composer-Style Symbolic Music Generation via Large-Scale Pre-training |
Mingyang Yao et.al. |
2506.17497v1 |
link |
2025-06-20 |
Universal Music Representations? Evaluating Foundation Models on World Music Corpora |
Charilaos Papaioannou et.al. |
2506.17055v1 |
link |
2025-06-20 |
ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors |
Junghyun Koo et.al. |
2506.16889v2 |
link |