english Icono del idioma   español Icono del idioma  

Por favor, use este identificador para citar o enlazar este ítem: https://hdl.handle.net/20.500.12008/31397 Cómo citar
Registro completo de metadatos
Campo DC Valor Lengua/Idioma
dc.contributor.authorFuentes, Magdalena-
dc.contributor.authorSteers, Bea-
dc.contributor.authorZinemanas, Pablo-
dc.contributor.authorRocamora, Martín-
dc.contributor.authorBondi, Luca-
dc.contributor.authorWilkins, Julia-
dc.contributor.authorShi, Qianyi-
dc.contributor.authorHou, Yao-
dc.contributor.authorDas, Samarjit-
dc.contributor.authorSerra, Xavier-
dc.contributor.authorBello, Juan Pablo-
dc.date.accessioned2022-05-03T12:01:35Z-
dc.date.available2022-05-03T12:01:35Z-
dc.date.issued2022-
dc.identifier.citationFuentes, M., Steers, B., Zinemanas, P. y otros. Urban sound & sight : Dataset and benchmark for audio-visual urban scene understanding [en línea]. EN: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Singapore, 23-27 may, pp 141-145. Piscataway, NJ : IEEE, 2022. DOI 10.1109/ICASSP43922.2022.9747644es
dc.identifier.urihttps://ieeexplore.ieee.org/document/9747644-
dc.identifier.urihttps://hdl.handle.net/20.500.12008/31397-
dc.description.abstractAutomatic audio-visual urban traffic understanding is a growing area of research with many potential applications of value to industry, academia, and the public sector. Yet, the lack of well-curated resources for training and evaluating models to research in this area hinders their development. To address this we present a curated audio-visual dataset, Urban Sound & Sight (Urbansas), developed for investigating the detection and localization of sounding vehicles in the wild. Urbansas consists of 12 hours of unlabeled data along with 3 hours of manually annotated data, including bounding boxes with classes and unique id of vehicles, and strong audio labels featuring vehicle types and indicating off-screen sounds. We discuss the challenges presented by the dataset and how to use its annotations for the localization of vehicles in the wild through audio models.es
dc.format.mimetypeapplication/pdfes
dc.language.isoenes
dc.publisherIEEEes
dc.relation.ispartofICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Singapore, 23-27 may 2022, pp. 141-145.es
dc.rightsLas obras depositadas en el Repositorio se rigen por la Ordenanza de los Derechos de la Propiedad Intelectual de la Universidad de la República.(Res. Nº 91 de C.D.C. de 8/III/1994 – D.O. 7/IV/1994) y por la Ordenanza del Repositorio Abierto de la Universidad de la República (Res. Nº 16 de C.D.C. de 07/10/2014)es
dc.subjectLocation awarenesses
dc.subjectTraininges
dc.subjectIndustrieses
dc.subjectAnnotationses
dc.subjectConferenceses
dc.subjectSignal processinges
dc.subjectBenchmark testinges
dc.subjectAudio-visuales
dc.subjectUrban researches
dc.subjectTraffices
dc.subjectDatasetes
dc.titleUrban sound & sight : Dataset and benchmark for audio-visual urban scene understandinges
dc.typePonenciaes
dc.contributor.filiacionFuentes Magdalena, New York University, New York, NY-
dc.contributor.filiacionSteers Bea, New York University, New York, NY-
dc.contributor.filiacionZinemanas Pablo, Universitat Pompeu Fabra, Barcelona, Spain-
dc.contributor.filiacionRocamora Martín, Universidad de la República (Uruguay). Facultad de Ingeniería.-
dc.contributor.filiacionBondi Luca, Bosch Research, Pittsburgh, PA, USA-
dc.contributor.filiacionWilkins Julia, New York University, New York, NY-
dc.contributor.filiacionShi Qianyi, New York University, New York, NY-
dc.contributor.filiacionHou Yao, New York University, New York, NY-
dc.contributor.filiacionDas Samarjit, Bosch Research, Pittsburgh, PA, USA-
dc.contributor.filiacionSerra Xavier, Universitat Pompeu Fabra, Barcelona, Spain-
dc.contributor.filiacionBello Juan Pablo, New York University, New York, NY-
dc.rights.licenceLicencia Creative Commons Atribución - No Comercial - Sin Derivadas (CC - By-NC-ND 4.0)es
dc.identifier.doi10.1109/ICASSP43922.2022.9747644-
Aparece en las colecciones: Publicaciones académicas y científicas - Instituto de Ingeniería Eléctrica

Ficheros en este ítem:
Fichero Descripción Tamaño Formato   
FSZRBWSHDSB22.pdfCamera-Ready5,55 MBAdobe PDFVisualizar/Abrir


Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons Creative Commons