2025-06-26 |
SiM3D: Single-instance Multiview Multimodal and Multisetup 3D Anomaly Detection Benchmark |
Alex Costanzino et.al. |
2506.21549v1 |
null |
2025-06-26 |
HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation |
Xinzhuo Li et.al. |
2506.21546v1 |
null |
2025-06-26 |
DeOcc-1-to-3: 3D De-Occlusion from a Single Image via Self-Supervised Multi-View Diffusion |
Yansong Qu et.al. |
2506.21544v1 |
null |
2025-06-26 |
StruMamba3D: Exploring Structural Mamba for Self-supervised Point Cloud Representation Learning |
Chuxin Wang et.al. |
2506.21541v1 |
null |
2025-06-26 |
WorldVLA: Towards Autoregressive Action World Model |
Jun Cen et.al. |
2506.21539v1 |
null |
2025-06-26 |
Maximal Matching Matters: Preventing Representation Collapse for Robust Cross-Modal Retrieval |
Hani Alomari et.al. |
2506.21538v1 |
null |
2025-06-26 |
Exploring the Design Space of 3D MLLMs for CT Report Generation |
Mohammed Baharoon et.al. |
2506.21535v1 |
null |
2025-06-26 |
The Kaleidoscope Survey: Strong Gravitational Lensing in Galaxy Clusters with Radial Arcs |
Catherine Cerny et.al. |
2506.21531v1 |
null |
2025-06-26 |
Revealing electron-lattice decoupling by Peltier thermometry and nanoscale thermal imaging in graphene |
Saurabh Kumar Srivastav et.al. |
2506.21523v1 |
null |
2025-06-26 |
G$^{2}$D: Boosting Multimodal Learning with Gradient-Guided Distillation |
Mohammed Rakib et.al. |
2506.21514v1 |
null |
2025-06-26 |
Mitigating Hallucination of Large Vision-Language Models via Dynamic Logits Calibration |
Jiahe Chen et.al. |
2506.21509v1 |
null |
2025-06-26 |
Process mining-driven modeling and simulation to enhance fault diagnosis in cyber-physical systems |
Francesco Vitale et.al. |
2506.21502v1 |
null |
2025-06-26 |
Devising a solution to the problems of Cancer awareness in Telangana |
Priyanka Avhad et.al. |
2506.21500v1 |
null |
2025-06-26 |
Lightweight Physics-Informed Zero-Shot Ultrasound Plane Wave Denoising |
Hojat Asgariandehkordi et.al. |
2506.21499v1 |
null |
2025-06-26 |
Evolution of boundedly rational learning in games |
Marta C. Couto et.al. |
2506.21498v1 |
null |
2025-06-26 |
Towards Reliable Detection of Empty Space: Conditional Marked Point Processes for Object Detection |
Tobias J. Riedlinger et.al. |
2506.21486v1 |
null |
2025-06-26 |
TITAN: Query-Token based Domain Adaptive Adversarial Learning |
Tajamul Ashraf et.al. |
2506.21484v1 |
null |
2025-06-26 |
An equation-based batch distillation simulation to evaluate the effect of multiplicities in thermodynamic activity coefficients |
Jennifer Werner et.al. |
2506.21483v1 |
null |
2025-06-26 |
SmoothSinger: A Conditional Diffusion Model for Singing Voice Synthesis with Multi-Resolution Architecture |
Kehan Sui et.al. |
2506.21478v1 |
null |
2025-06-26 |
NLO QCD effects on angular observables in $e^-p \to e^-(ν_e)Hj$ in presence of non-standard $HVV$ couplings |
Biswajit Das et.al. |
2506.21472v1 |
null |
2025-06-26 |
TopK Language Models |
Ryosuke Takahashi et.al. |
2506.21468v1 |
null |
2025-06-26 |
Optimising 4th-Order Runge-Kutta Methods: A Dynamic Heuristic Approach for Efficiency and Low Storage |
Gavin Lee Goodship et.al. |
2506.21465v1 |
null |
2025-06-26 |
Aligning Spoken Dialogue Models from User Interactions |
Anne Wu et.al. |
2506.21463v1 |
null |
2025-06-26 |
Wild refitting for black box prediction |
Martin J. Wainwright et.al. |
2506.21460v1 |
null |
2025-06-26 |
Spatial Mental Modeling from Limited Views |
Baiqiao Yin et.al. |
2506.21458v1 |
null |
2025-06-26 |
A Comprehensive Dataset for Underground Miner Detection in Diverse Scenario |
Cyrus Addy et.al. |
2506.21451v1 |
null |
2025-06-26 |
ThinkSound: Chain-of-Thought Reasoning in Multimodal Large Language Models for Audio Generation and Editing |
Huadai Liu et.al. |
2506.21448v1 |
null |
2025-06-26 |
Controllable 3D Placement of Objects with Scene-Aware Diffusion Models |
Mohamed Omran et.al. |
2506.21446v1 |
null |
2025-06-26 |
Benchmarking Deep Learning and Vision Foundation Models for Atypical vs. Normal Mitosis Classification with Cross-Dataset Evaluation |
Sweta Banerjee et.al. |
2506.21444v1 |
null |
2025-06-26 |
Learnable Adaptive Time-Frequency Representation via Differentiable Short-Time Fourier Transform |
Maxime Leiber et.al. |
2506.21440v1 |
null |