english Icono del idioma   español Icono del idioma  

Por favor, use este identificador para citar o enlazar este ítem: https://hdl.handle.net/20.500.12008/29755 Cómo citar
Registro completo de metadatos
Campo DC Valor Lengua/Idioma
dc.contributor.authorGiménez, Eduardo-
dc.contributor.authorEtcheverry, Lorena-
dc.contributor.authorOlmedo, Federico-
dc.contributor.authorBuil Aranda, Carlos-
dc.contributor.authorToro, Matías-
dc.contributor.authorPastorini, Marcos-
dc.coverage.spatialUruguay.es
dc.date.accessioned2021-10-06T16:45:59Z-
dc.date.available2021-10-06T16:45:59Z-
dc.date.issued2021-
dc.identifier.citationGiménez, E., Etcheverry, L., Olmedo, F. y otros. LETEO: Scalable anonymization of big data and its application to learning analytics [en línea]. Montevideo : Udelar. FI.,2021.es
dc.identifier.urihttps://hdl.handle.net/20.500.12008/29755-
dc.descriptionANII Fondo sectorial de investigación con datos - 2018es
dc.description.abstractCreated in 2007, Plan Ceibal is an inclusion and equal opportunities plan with the aim of supporting Uruguayan educational policies with technology. Throughout these years, and within the framework of its tasks, Ceibal has an important amount of data related to the use of technology in education, necessary to manage the plan and fulfill the assigned legal tasks. However, the data does not they can be studied without accounting for the problem of de identifying the users of the Plan. To exploit this data, Ceibal has deployed an instance of the Hortonworks Data Platform (HDP), a open source platform for the storage and parallel processing of massive data (big data). HDP offers a wide range of functional components ranging from large file storage (HDFS) to distributed programming of machine learning algorithms (Apache Spark / MLlib). However, as of today there are no solutions for the de-identification of personal code data open and integrated into the Hortonworks ecosystem. On the one hand, the deidentification tools existing data have not been designed so that they can easily scale to large volumes of data, and they also do not offer easy integration mechanisms with HDFS. This forces you to export the data outside of the platform that stores them to be able to anonymize them, with the consequent risk of exposure of confidential information. On the other hand, the few integrated solutions in the Hortonworks ecosystem are owners and the cost of their licenses is very significant. The objective of this project is to promote the use of the enormous amount of educational and technological data that Ceibal possesses, lifting one of the greatest obstacles that exist for that, namely, the preservation of privacy and the protection of the personal data of the beneficiaries of the Plan. To this end, this project seeks to generate anonymization tools that extend the HDP platform. On In particular, it seeks to develop open source modules to integrate into said platform, which implement a set of programmed anonymization techniques and algorithms in a distributed manner using Apache Spark and that can be applied to data sets stored in HDFS files.es
dc.format.extent16 p.es
dc.format.mimetypeapplication/pdfes
dc.language.isoeses
dc.publisherUdelar. FI.es
dc.rightsLas obras depositadas en el Repositorio se rigen por la Ordenanza de los Derechos de la Propiedad Intelectual de la Universidad de la República.(Res. Nº 91 de C.D.C. de 8/III/1994 – D.O. 7/IV/1994) y por la Ordenanza del Repositorio Abierto de la Universidad de la República (Res. Nº 16 de C.D.C. de 07/10/2014)es
dc.subjectAnonymizationes
dc.subjectBig dataes
dc.subjectLearning analyticses
dc.titleLETEO: Scalable anonymization of big data and its application to learning analyticses
dc.typeReporte técnicoes
dc.contributor.filiacionGiménez Eduardo, Information and Communication Technologies for Verticals (ICT4V)-
dc.contributor.filiacionEtcheverry Lorena, Universidad de la República (Uruguay). Facultad de Ingeniería. Instituto de Computación.-
dc.rights.licenceLicencia Creative Commons Atribución - No Comercial - Sin Derivadas (CC - By-NC-ND 4.0)es
Aparece en las colecciones: Reportes Técnicos - Instituto de Computación

Ficheros en este ítem:
Fichero Descripción Tamaño Formato   
GEOBTP21.pdf785,02 kBAdobe PDFVisualizar/Abrir


Este ítem está sujeto a una licencia Creative Commons Licencia Creative Commons Creative Commons