Wijngaard, G., Formisano, E., Esposito, M., & Dumontier, M. (2025). Audio-Language Datasets of Scenes and Events: A Survey. IEEE Access, 13, 20328–20360. https://doi.org/10.1109/ACCESS.2025.3534621