english Icono del idioma   español Icono del idioma  

Por favor, use este identificador para citar o enlazar este ítem: https://hdl.handle.net/20.500.12008/40734 Cómo citar
Registro completo de metadatos
Campo DC Valor Lengua/Idioma
dc.contributor.advisorCancela, Pablo-
dc.contributor.advisorCapdehourat, Germán-
dc.contributor.authorRíos, Braulio-
dc.date.accessioned2023-10-19T12:14:21Z-
dc.date.available2023-10-19T12:14:21Z-
dc.date.issued2023-
dc.identifier.citationRíos, B. Audio-based classroom activity detection for primary school lessons [en línea]. Tesis de maestría. Montevideo : Udelar. FI., 2023.es
dc.identifier.issn1688-2806-
dc.identifier.urihttps://hdl.handle.net/20.500.12008/40734-
dc.description.abstractClassroom Activity Detection (CAD) is a challenging task, especially for primary school lessons, where student participation is fragmented, short, and often concurrent with teacher speech and background noise. This thesis proposes and evaluates three CAD models: two based on supervised audio classification (trained on a proprietary dataset that was annotated for this work), and one based on unsupervised diarization. These models are assessed through the visualization of the estimated label density, rather than typical CAD segment visualizations. This approach proves to be more effective in dealing with the highly fragmented segments observed in this specific use case. The main metric to compare these models is the correlation coefficient between estimated and ground-truth label densities. The density and correlation are used to evaluate the accuracy of the models in capturing the temporal distribution of the different classroom activities. Complimentary to that, another metric that is also used is the error in the total time estimated for each label (e.g., estimated Teacher Talking Time or TTT). The supervised models, based on an LSTM neural network and a decision tree classifier, achieve similar classification performance, outperforming the unsupervised diarization pipeline. Even a small amount of training data is enough for the supervised models to achieve the performance of the diarization system, and they generalize well to previously unseen voices. The unsupervised diarization model does not require training data for this particular task, but its performance is not as good as the supervised models to detect the teacher’s voice. Additionally, it cannot distinguish properly between the labels “single student” and “group work”. Overall, the supervised CAD models proposed in this thesis demonstrate promising results for primary school lessons, even with limited training data. These models could be used to develop valuable tools to support classroom observation and evaluation.es
dc.description.sponsorshipBeca de Maestría ANIIes
dc.format.extent100 p.es
dc.format.mimetypeapplication/pdfes
dc.language.isoenes
dc.publisherUdelar.FI.es
dc.rightsLas obras depositadas en el Repositorio se rigen por la Ordenanza de los Derechos de la Propiedad Intelectual de la Universidad de la República.(Res. Nº 91 de C.D.C. de 8/III/1994 – D.O. 7/IV/1994) y por la Ordenanza del Repositorio Abierto de la Universidad de la República (Res. Nº 16 de C.D.C. de 07/10/2014)es
dc.subjectClassroom activity detectiones
dc.subjectClassroom monitoringes
dc.subjectDiarizationes
dc.subjectAudio classificationes
dc.subjectCeibales
dc.subjectEdteches
dc.subjectEducational technologyes
dc.subjectPrimary school educationes
dc.subjectLSTMes
dc.subjectSpeech processinges
dc.subjectMachine learninges
dc.subjectSupervised learninges
dc.subjectUnsupervised learninges
dc.subjectAudio processinges
dc.titleAudio-based classroom activity detection for primary school lessonses
dc.typeTesis de maestríaes
dc.contributor.filiacionRíos Braulio, Universidad de la República (Uruguay). Facultad de Ingeniería.-
thesis.degree.grantorUniversidad de la República (Uruguay). Facultad de Ingeniería.es
thesis.degree.nameMagíster en Ciencia de Datos y Aprendizaje Automáticoes
dc.rights.licenceLicencia Creative Commons Atribución - No Comercial - Sin Derivadas (CC - By-NC-ND 4.0)es
Aparece en las colecciones: Tesis de Posgrado - Facultad de Ingeniería

Ficheros en este ítem:
Fichero Descripción Tamaño Formato   
Ri23.pdfTesis de maestría7,3 MBAdobe PDFVisualizar/Abrir


Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons Creative Commons