Hi, I'm Guillem
Research Engineer
Research Engineer at BMAT. PhD at the Music Technology Group and ex intern at Deezer Research.
Contact meAbout Me
My introduction
I'm a Research Engineer at BMAT working on music identification and metadata reconciliation. I'm currently studying audio identification technologies and how to increase its robustness in noisy environments. Even though now I'm focused on Music Information Retrieval tasks, my research interests go beyond that, as my previous works show. Moreover, I'm a maintainer of Mirdata, Soundata, two open-source libraries for downloading, loading & working with music and sound datasets.
Skills
My datasheetCoding
Programming languages and toolsPython
DL libraries
Django
Bash
HTML, css, JS
Software Stack
Tech components that I use regularlyDocker
Git
AWS (ECR, EC2, S3)
Languages
Languages I can communicate withCatalan
NativeSpanish
NativeEnglish
ProficientSelected Publications
List of publications, you can find a full list in my scholar profile
G. Cortès-Sebastià (2025). "Music identification with audio fingerprinting: an
industrial perspective". PhD Thesis.
[Link]
G. Cortès-Sebastià, B. Martin, E. Molina, X. Serra, R. Hennequin (2025). "PeakNetFP: Peak-based Neural Audio Fingerprinting Robust to Extreme Time Stretching". ISMIR 2025 [preprint, GitHub]
G. Cortès-Sebastià, M. Miron, E. Molina, A. Ciurana, X. Serra (2025). "Enhanced television broadcast monitoring with source separation-assisted audio fingerprinting: A case study". Multimedia Tools and Applications [article, GitHub]
R. O. Araz, G. Cortès-Sebastià, E. Molina, J. Serrà, X. Serra, Y. Mitsufuji, D. Bogdanov (2025). "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification". ISMIR 2025 [preprint, GitHub]
M. Fuentes, G. Plaja-Roglans, G. Cortès-Sebastià, T. Khandelwal, M. Miron, X. Serra, J. Bello, & J. Salamon. (2024). Soundata: Reproducible use of audio datasets. Journal of Open Source Software, 9(98), 6634. [paper, GitHub]
G. Cortès, A. Ciurana, E. Molina, M. Miron, O. Meyers, J. Six, X. Serra. (2022). "BAF: An Audio Fingerprinting Dataset for Broadcast Monitoring", In Proceedings of the 23rd International Society for Music Information Retrieval Conference, pp. 908–916. ISMIR 2022. Bengaluru, 4-8 December, 2022. [paper, GitHub]
G. Cortès. (2020). "Towards Robust End-to-End Speech Translation". MSc Thesis. [link]
J. Luque, G. Cortès, C. Segura, A. Maravilla, J. Esteban, J. Fabregat. (2018). "End-to-End Photoplethysmography (PPG) Based Biometric Authentication by Using Convolutional Neural Networks", 26th European Signal Processing Conference (EUSIPCO), 538-542. 2018. [paper]
G. Cortès, R. Blaauboer, J. Munday. (2018). Chirp Challenge winner. Abbey Road Hackathon. [press note]
G. Cortès-Sebastià, B. Martin, E. Molina, X. Serra, R. Hennequin (2025). "PeakNetFP: Peak-based Neural Audio Fingerprinting Robust to Extreme Time Stretching". ISMIR 2025 [preprint, GitHub]
G. Cortès-Sebastià, M. Miron, E. Molina, A. Ciurana, X. Serra (2025). "Enhanced television broadcast monitoring with source separation-assisted audio fingerprinting: A case study". Multimedia Tools and Applications [article, GitHub]
R. O. Araz, G. Cortès-Sebastià, E. Molina, J. Serrà, X. Serra, Y. Mitsufuji, D. Bogdanov (2025). "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification". ISMIR 2025 [preprint, GitHub]
M. Fuentes, G. Plaja-Roglans, G. Cortès-Sebastià, T. Khandelwal, M. Miron, X. Serra, J. Bello, & J. Salamon. (2024). Soundata: Reproducible use of audio datasets. Journal of Open Source Software, 9(98), 6634. [paper, GitHub]
G. Cortès, A. Ciurana, E. Molina, M. Miron, O. Meyers, J. Six, X. Serra. (2022). "BAF: An Audio Fingerprinting Dataset for Broadcast Monitoring", In Proceedings of the 23rd International Society for Music Information Retrieval Conference, pp. 908–916. ISMIR 2022. Bengaluru, 4-8 December, 2022. [paper, GitHub]
G. Cortès. (2020). "Towards Robust End-to-End Speech Translation". MSc Thesis. [link]
J. Luque, G. Cortès, C. Segura, A. Maravilla, J. Esteban, J. Fabregat. (2018). "End-to-End Photoplethysmography (PPG) Based Biometric Authentication by Using Convolutional Neural Networks", 26th European Signal Processing Conference (EUSIPCO), 538-542. 2018. [paper]
G. Cortès, R. Blaauboer, J. Munday. (2018). Chirp Challenge winner. Abbey Road Hackathon. [press note]
Path
My personal journeyPhD in Audio Fingerprinting
MTG - UPF
2020 - 2025
MSc in Advanced Telecommunication Technologies
ETSETB - UPC · BarcelonaTech
2018 - 2020
BSc in Telecommunications Technologies and Services Engineering
ETSETB - UPC · BarcelonaTech
2013 - 2018
Professional Grade in Classical Guitar
Escola de Música Creu Alta
2007 - 2019
Research Engineer
R&D - BMAT
2025 - Now
Research PhD Internship
Deezer
2023 - 2024 (6 months)
Research Engineer (PhD Candidate)
R&D - BMAT
2020 - 2025
Research Assistant
School of Informatics - UEDIN
2020
Software Developer
Charts - BMAT
2018-2020
Research Assistant
Telefónica I+D
2017 - 2018
Laboratory Technician
GCEM Lab
2014-2015