Toward interpretable polyphonic sound event detection with attention maps based on local prototypes

Por favor, use este identificador para citar o enlazar este ítem: https://hdl.handle.net/20.500.12008/29961 Cómo citar

Registro completo de metadatos

Campo DC	Valor	Lengua/Idioma
dc.contributor.author	Zinemanas, Pablo	-
dc.contributor.author	Rocamora, Martín	-
dc.contributor.author	Fonseca, Eduardo	-
dc.contributor.author	Font, Frederic	-
dc.contributor.author	Serra, Xavier	-
dc.date.accessioned	2021-10-25T17:05:00Z	-
dc.date.available	2021-10-25T17:05:00Z	-
dc.date.issued	2021	-
dc.identifier.citation	Zinemanas, P., Rocamora, M., Fonseca, E. y otros. Toward interpretable polyphonic sound event detection with attention maps based on local prototypes [en línea]. EN: 6th Workshop on Detection and Classification of Acoustic Scenes and Events, DCASE 2021, Barcelona, Spain, 15-19 nov. 2021, pp. 50-54.	en
dc.identifier.uri	http://dcase.community/workshop2021/proceedings	-
dc.identifier.uri	http://dcase.community/workshop2021/	-
dc.identifier.uri	http://dcase.community/documents/workshop2021/proceedings/DCASE2021Workshop_Zinemanas_22.pdf	-
dc.identifier.uri	https://hdl.handle.net/20.500.12008/29961	-
dc.description.abstract	Understanding the reasons behind the predictions of deep neural networks is a pressing concern as it can be critical in several application scenarios. In this work, we present a novel interpretable model for polyphonic sound event detection. It tackles one of the limitations of our previous work, i.e. the difficulty to deal with a multi-label setting properly. The proposed architecture incorporates a prototype layer and an attention mechanism. The network learns a set of local prototypes in the latent space representing a patch in the input representation. Besides, it learns attention maps for positioning the local prototypes and reconstructing the latent space. Then, the predictions are solely based on the attention maps. Thus, the explanations provided are the attention maps and the corresponding local prototypes. Moreover, one can reconstruct the prototypes to the audio domain for inspection. The obtained results in urban sound event detection are comparable to that of two opaque baselines but with fewer parameters while offering interpretability.	en
dc.format.extent	5 p.	es
dc.format.mimetype	application/pdf	es
dc.language.iso	en	es
dc.publisher	Universitat Pompeu Fabra	en
dc.relation.ispartof	6th Workshop on Detection and Classification of Acoustic Scenes and Events, DCASE 2021, Barcelona, Spain, 15-19 nov. 2021, pp. 50-54.	es
dc.rights	Las obras depositadas en el Repositorio se rigen por la Ordenanza de los Derechos de la Propiedad Intelectual de la Universidad de la República.(Res. Nº 91 de C.D.C. de 8/III/1994 – D.O. 7/IV/1994) y por la Ordenanza del Repositorio Abierto de la Universidad de la República (Res. Nº 16 de C.D.C. de 07/10/2014)	es
dc.subject	Interpretability	en
dc.subject	Sound event detection	en
dc.subject	Prototypes	en
dc.title	Toward interpretable polyphonic sound event detection with attention maps based on local prototypes	en
dc.type	Ponencia	es
dc.contributor.filiacion	Zinemanas Pablo, Universitat Pompeu Fabra, Barcelona, Spain	-
dc.contributor.filiacion	Rocamora Martín, Universidad de la República (Uruguay). Facultad de Ingeniería.	-
dc.contributor.filiacion	Fonseca Eduardo, Universitat Pompeu Fabra, Barcelona, Spain	-
dc.contributor.filiacion	Font Frederic, Universitat Pompeu Fabra, Barcelona, Spain	-
dc.contributor.filiacion	Serra Xavier, Universitat Pompeu Fabra, Barcelona, Spain	-
dc.rights.licence	Licencia Creative Commons Atribución - No Comercial - Sin Derivadas (CC - By-NC-ND 4.0)	es
udelar.academic.department	Procesamiento de Señales	-
udelar.investigation.group	Procesamiento de Audio	-
Aparece en las colecciones:	Publicaciones académicas y científicas - Instituto de Ingeniería Eléctrica

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
ZRFFS21.pdf	Versión publicada	723,73 kB	Adobe PDF	Visualizar/Abrir

Mostrar el registro sencillo del ítem

Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons