Hardware-aware neural architecture search for enhancing text generation

dc.audience.educationlevelInvestigadores/Researchers
dc.audience.educationlevelOtros/Other
dc.contributor.advisorSosa Hernández, Víctor Adrián
dc.contributor.authorSánchez Miranda, Israel
dc.contributor.catalogeremipsanchez
dc.contributor.committeememberCastillo Juárez, Esteban
dc.contributor.committeememberOrtiz Bayliss, José Carlos
dc.contributor.committeememberJuárez Gambino, Joel Omar
dc.contributor.departmentSchool of Engineering and Sciences
dc.contributor.institutionCampus Estado de México
dc.contributor.mentorPescador Rojas, Miriam
dc.date.accepted2025-05-30
dc.date.accessioned2025-07-01T21:27:09Z
dc.date.issued2025-06
dc.descriptionhttps://orcid.org/0000-0002-1099-8148
dc.description.abstractIn recent years, neural network optimization has become critical in Natural Language Processing (NLP) tasks. However, manual tuning processes are time-consuming and heavily influenced by the designer’s prior knowledge, limiting the exploration of alternative architecture designs. Consequently, only a narrow subset of neural network architectures is typically considered for tasks such as text generation. Furthermore, neural network tuning requires specialized expertise, posing a barrier for non-experts and hindering broader innovation in the field. This research addresses these challenges by implementing a specialized Hardware-Aware Neural Architecture Search (HW-NAS) methodology, tailored specifically for text generation tasks under resource-constrained environments. The proposed NAS approach leverages a compact, efficient search space encoding key transformer architectural components, while adopting multi-objective optimization to simultaneously maximize text generation quality, measured via the METEOR score, and minimize the parameter count to enhance hardware adaptability. Two different evolutionary-based NAS strategies were explored: a custom Lexicographic Evolutionary Strategy (LexSMS-MODES) and SMS-EMOA, focusing on balancing exploration, exploitation, and computational efficiency. Experimental evaluations were conducted in both unconstrained environments and constrained hardware platforms. The optimized architectures demonstrated consistent improvements over the baseline model across multiple performance measures, including BLEU, ROUGE, and GLEU. Notably, METEOR scores showed values close to 0.72 in unconstrained settings. Although significant performance degradation was observed under constrained environments (approximately 57%–59% reduction in METEOR scores), the discovered models maintained a competitive edge when compared to several state-ofthe-art light-weight and NAS-based solutions. Hardware-aware evaluations revealed that NAS-generated models achieved substantial reductions in memory usage, GPU load, and CPU frequency deltas, despite not explicitly optimizing hardware indicators during the search. Statistical tests confirmed the stability of the discovered models across multiple hardware performance metrics. Comparisons against external works showed that while the proposed method successfully produced light-weight and efficient architectures, there remains room for improvement regarding inference latency and hardware adaptation strategies.
dc.description.degreeMaster of Science in Computer Science
dc.format.mediumTexto
dc.identificator330406
dc.identifier.citationSánchez Miranda, I. (2025). Hardware-aware neural architecture search for enhancing text generation [Tesis maestría]. Instituto Tecnológico y de Estudios Superiores de Monterrey. Recuperado de: https://hdl.handle.net/11285/703797
dc.identifier.cvu1317940
dc.identifier.urihttps://hdl.handle.net/11285/703797
dc.language.isoeng
dc.publisherInstituto Tecnológico y de Estudios Superiores de Monterrey
dc.relationSecretaría de Ciencia, Humanidades, Tecnología e Innovación (CONAHCyT)
dc.relationInstituto Tecnológico y de Estudios Superiores de Monterrey
dc.relation.isFormatOfacceptedVersion
dc.rightsopenAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0
dc.subject.classificationINGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA DE LOS ORDENADORES::INTELIGENCIA ARTIFICIAL
dc.subject.classificationINGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA DE LOS ORDENADORES::ARQUITECTURA DE ORDENADORES
dc.subject.keywordNeural Architecture Search
dc.subject.keywordText generation
dc.subject.keywordMulti-objective optimization
dc.subject.keywordBio-inspired algorithms
dc.subject.lcshTechnology
dc.subject.lcshScience
dc.titleHardware-aware neural architecture search for enhancing text generation
dc.typeTesis de maestría

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
SanchezMiranda_TesisMaestria_pdfa.pdf
Size:
5.49 MB
Format:
Adobe Portable Document Format
Description:
Tesis Maestría
Loading...
Thumbnail Image
Name:
SanchezMiranda_ActaGradoDeclaracionAutoria_pdfa.pdf
Size:
293.87 KB
Format:
Adobe Portable Document Format
Description:
Acta de Grado
Loading...
Thumbnail Image
Name:
SanchezMiranda_CartaAutorizacion_pdf.pdf
Size:
151.22 KB
Format:
Adobe Portable Document Format
Description:
Carta Autorización

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.28 KB
Format:
Item-specific license agreed upon to submission
Description:
logo

El usuario tiene la obligación de utilizar los servicios y contenidos proporcionados por la Universidad, en particular, los impresos y recursos electrónicos, de conformidad con la legislación vigente y los principios de buena fe y en general usos aceptados, sin contravenir con su realización el orden público, especialmente, en el caso en que, para el adecuado desempeño de su actividad, necesita reproducir, distribuir, comunicar y/o poner a disposición, fragmentos de obras impresas o susceptibles de estar en formato analógico o digital, ya sea en soporte papel o electrónico. Ley 23/2006, de 7 de julio, por la que se modifica el texto revisado de la Ley de Propiedad Intelectual, aprobado

DSpace software copyright © 2002-2026

Licencia