A generalist reinforcement learning agent for compressing multiple convolutional neural networks

dc.audience.educationlevelOtros/Other
dc.contributor.advisorConant ablos, Santiago Enrique
dc.contributor.authorGonzález Sahagún, Gabriel
dc.contributor.catalogeremipsanchez
dc.contributor.committeememberOrtíz Bayliss, José Carlos
dc.contributor.committeememberCruz Duarte, Jorge Mario
dc.contributor.committeememberGutiérrez Rodríguez, Andrés Eduardo
dc.contributor.departmentSchool of Engineering and Sciences
dc.contributor.institutionCampus Monterrey
dc.date.accepted2024-12-11
dc.date.accessioned2025-01-10T18:46:13Z
dc.date.issued2024-12-11
dc.descriptionhttps://orcid.org/0000-0001-6270-3164
dc.description.abstractDeep Learning has achieved state-of-the-art accuracy in multiple fields. A common practice in computer vision is to reuse a pre-trained model for a completely different dataset of the same type of task, a process known as transfer learning, which reduces training time by reusing the filters of the convolutional layers. However, while transfer learning can reduce training time, the model might overestimate the number of parameters needed for the new dataset. As models now achieve near-human performance or better, there is a growing need to reduce their size to facilitate deployment on devices with limited computational resources. Various compression techniques have been proposed to address this issue, but their effectiveness varies depending on hyperparameters. To navigate these options, researchers have worked on automating model compression. Some have proposed using reinforcement learning to teach a deep learning model how to compress another deep learning model. This study compares multiple approaches for automating the compression of convolutional neural networks and proposes a method for training a reinforcement learning agent that works across multiple datasets without the need for transfer learning. The agents were tested using leaveone- out cross-validation, learning to compress a set of LeNet-5 models and testing on another LeNet-5 model with different parameters. The metrics used to evaluate these solutions were accuracy loss and the number of parameters of the compressed model. The agents suggested compression schemes that were on or near the Pareto front for these metrics. Furthermore, the models were compressed by more than 80% with minimal accuracy loss in most cases. The significance of these results is that by escalating this methodology for larger models and datasets, an AI assistant for model compression similar to ChatGPT can be developed, potentially revolutionizing model compression practices and enabling advanced deployments in resource-constrained environments.
dc.description.degreeDoctor of Philosophy in Computer Science
dc.format.mediumTexto
dc.identificator339999
dc.identifier.citationGonzález Sahagún, G (2024). A generalist reinforcement learning agent for compressing multiple convolutional neural networks [Tesis doctoral]. Instituto Tecnológico y de Estudios Superiores de Monterrey. Recuperado de: https://hdl.handle.net/11285/703008
dc.identifier.orcidhttps://orcid.org/0009-0009-7931-4654
dc.identifier.urihttps://hdl.handle.net/11285/703008
dc.language.isoeng
dc.publisherInstituto Tecnológico y de Estudios Superiores de Monterrey
dc.relationInstituto Tecnológico y de Estudios Superiores de Monterrey
dc.relationCONAHCYT
dc.relation.isFormatOfacceptedVersion
dc.rightsopenAccess
dc.rights.urihttp://creativecommons.org/licenses/by/4.0
dc.subject.classificationINGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::OTRAS ESPECIALIDADES TECNOLÓGICAS::OTRAS
dc.subject.keywordDeep Learning
dc.subject.keywordModel Compressión
dc.subject.keywordModel Compressión
dc.subject.keywordComputer Visión
dc.subject.lcshTechnology
dc.titleA generalist reinforcement learning agent for compressing multiple convolutional neural networks
dc.typeTesis de doctorado

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
GonzálezSahagun_TesisDoctoradopdfa.pdf
Size:
23.58 MB
Format:
Adobe Portable Document Format
Description:
Tesis Doctorado
Loading...
Thumbnail Image
Name:
GonzálezSahagun_ActaGradopdfa.pdf
Size:
221.36 KB
Format:
Adobe Portable Document Format
Description:
Acta de Grado
Loading...
Thumbnail Image
Name:
GonzalezSahagun_CartaAutorizacion.pdf
Size:
193.61 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.28 KB
Format:
Item-specific license agreed upon to submission
Description:
logo

El usuario tiene la obligación de utilizar los servicios y contenidos proporcionados por la Universidad, en particular, los impresos y recursos electrónicos, de conformidad con la legislación vigente y los principios de buena fe y en general usos aceptados, sin contravenir con su realización el orden público, especialmente, en el caso en que, para el adecuado desempeño de su actividad, necesita reproducir, distribuir, comunicar y/o poner a disposición, fragmentos de obras impresas o susceptibles de estar en formato analógico o digital, ya sea en soporte papel o electrónico. Ley 23/2006, de 7 de julio, por la que se modifica el texto revisado de la Ley de Propiedad Intelectual, aprobado

DSpace software copyright © 2002-2025

Licencia