Audio Processing and Digital Acoustics

The audio processing group at LCAV performs research and education on various topics related to capturing, processing, coding, and rendering of acoustic signals with special focus on 3D-audio. We try to develop expertise in every aspect of this broad field, going from foundations of signal processing, through the physics of wave phenomena, all the way to the human auditory perception. The research is carried out in cooperation with partners from art, industry, and science.

Over the years, we have worked on a broad range of topics that includes:

Directional sound capture and playback (beamforming)
Room equalization and acoustic echo control
Room acoustics simulation
Virtual acoustics/auralization
Automatic multichannel format conversion (upmix)
Sound perception and spatial hearing
Sound field reproduction
Spatial audio coding
Spatial sampling and coding of sound fields

You can also consult our archives for a for a more detailed description of past projects.

Currently, LCAV focuses on various aspects of location-aware audio signal processing. We crafted this term to succinctly cover both typical and highly atypical problems where the terms sound and localization happen to coexist: from vanilla sound source localization with microphone arrays, through more unconventional simultaneous localization of sound sources and microphones and mapping of a room (the infamous acoustic SLAM), to finally localizing concurrent sound sources using a single, albeit unconventional microphone.

Recent LCAV publications in this area:

DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging

M. M. J-A. Simeoni; S. Kashani; P. Hurley; M. Vetterli

2019. Thirty-third Conference on Neural Information Processing Systems (NeurIPS), Vancouver, British Columbia, Canada, December 9-14, 2019.

Detailed record

Structure from sound with incomplete data

M. Krekovic; G. Baechler; I. Dokmanic; M. Vetterli

2018. 43rd International Conference on Acoustics, Speech and Signal Processing, Calgary, Alberta, Canada, April 15–20, 2018.

Detailed record

Acoustic DoA Estimation by One Unsophisticated Sensor

D. El Badawy; I. Dokmanic; M. Vetterli

2017. 13th International Conference on Latent Variable Analysis and Signal Separation, Grenoble, France, February 21-23, 2017. p. 89 – 98. DOI : 10.1007/978-3-319-53547-0_9.

Detailed record

View at publisher

Omnidirectional bats, point-to-plane distances, and the price of uniqueness

M. Krekovic; I. Dokmanic; M. Vetterli

2017. 42nd International Conference on Acoustics, Speech and Signal Processing, New Orleans, USA, March 5-9, 2017. p. 3261 – 3265. DOI : 10.1109/ICASSP.2017.7952759.

Detailed record

View at publisher

Hardware And Software For Reproducible Research In Audio Array Signal Processing

E. Bezzam; R. Scheibler; J. Azcarreta; H. Pan; M. Simeoni et al.

2017. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LANew Orleans, LA, USA, MAR 05-09, 20175-9 March, 2017. p. 6591 – 6592. DOI : 10.1109/ICASSP.2017.8005297.

Detailed record

View at publisher

FRIDA: FRI-Based DOA Estimation For Arbitrary Array Layouts

H. Pan; R. Scheibler; E. F. Bezzam; I. Dokmanic; M. Vetterli

2017. ICASSP 2017, New Orleans, USA, March 5-9, 2017. p. 3186 – 3190. DOI : 10.1109/ICASSP.2017.7952744.

Detailed record

View at publisher

From Acoustic Room Reconstruction to SLAM

I. Dokmanic; L. Daudet; M. Vetterli

2016. 41st International Conference on Acoustics, Speech, and Signal Processing, Shanghai, China, March 20-25, 2016. p. 6345 – 6349. DOI : 10.1109/ICASSP.2016.7472898.

Detailed record

View at publisher

Look, no beacons! Optimal all-in-one EchoSLAM

M. Krekovic; I. Dokmanic; M. Vetterli

2016. 50th Asilomar Conference on Signals, Systems, and Computers, Asilomar, Pacific Grove, CA, November 6-9, 2016.

Detailed record

EchoSLAM: Simultaneous Localization and Mapping with Acoustic Echoes

M. Krekovic; I. Dokmanic; M. Vetterli

2016. 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Shanghai, China, 20-25 March 2016. p. 11 – 15. DOI : 10.1109/ICASSP.2016.7471627.

Detailed record

View at publisher

Raking echoes in the time domain

R. Scheibler; I. Dokmanic; M. Vetterli

2015. ICASSP 2015 – 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), South Brisbane, Queensland, Australia, 19-24 April 2015. p. 554 – 558. DOI : 10.1109/ICASSP.2015.7178030.

Detailed record

View at publisher

Raking the Cocktail Party

I. Dokmanic; R. Scheibler; M. Vetterli

IEEE Journal of Selected Topics in Signal Processing. 2015. Vol. 9, num. 5, p. 825 – 836. DOI : 10.1109/JSTSP.2015.2415761.

Detailed record

View at publisher

How to Localize Ten Microphones in One Fingersnap

I. Dokmanic; L. Daudet; M. Vetterli

2014. 22nd European Signal Processing Conference, Lisbon, Portugal, September 1-5, 2014. p. 2275 – 2279.

Detailed record

Digital acoustics: processing wave fields in space and time using DSP tools

F. Pinto; M. Kolundzija; M. Vetterli

APSIPA Transactions on Signal and Information Processing. 2014. Vol. 3, num. e18, p. 1 – 21. DOI : 10.1017/ATSIP.2014.13.

Detailed record

View at publisher

Acoustic Echoes Reveal Room Shape

I. Dokmanic; R. Parhizkar; A. Walther; Y. M. Lu; M. Vetterli

Proceedings Of The National Academy Of Sciences Of The United States Of America (PNAS). 2013. Vol. 110, num. 30, p. 12186 – 12191. DOI : 10.1073/pnas.1221464110.

Detailed record

View at publisher

Multi-channel low-frequency room equalization using perceptually motivated constrained optimization

M. Kolundzija; C. Faller; M. Vetterli

2012. IEEE International Conference on Acoustics, Speech, and Signal Processing, Kyoto, Japan, March 25-30, 2012. p. 533 – 536. DOI : 10.1109/ICASSP.2012.6287934.

Detailed record

View at publisher

Reproducing Sound Fields Using MIMO Acoustic Channel Inversion

M. Kolundzija; C. Faller; M. Vetterli

Journal of the Audio Engineering Society. 2011. Vol. 59, num. 10, p. 721 – 734.

Detailed record

Spatiotemporal Gradient Analysis of Differential Microphone Arrays

M. Kolundzija; C. Faller; M. Vetterli

Journal of the Audio Engineering Society. 2011. Vol. 52, num. 1/2, p. 20 – 28.

Detailed record

Design of a Compact Cylindrical Loudspeaker Array for Spatial Sound Reproduction

M. Kolundzija; C. Faller; M. Vetterli

2011. AES 130th Convention, May 13-16, 2011.

Detailed record

LCAV-APDA