A work on optimizers for binarized neural networks: a second order approach

dc.audience.educationlevelInvestigadores/Researcherses_MX
dc.contributor.advisorGonzalez Mendoza, Miguel
dc.contributor.authorSuárez Ramírez, Cuauhtémoc Daniel
dc.contributor.catalogertolmquevedoes_MX
dc.contributor.committeememberOchoa Ruiz, Gilberto
dc.contributor.committeememberMorales González Quevedo, Annette
dc.contributor.committeememberSanchez Castellanos, Hector Manuel
dc.contributor.departmentSchool of Engineering and Scienceses_MX
dc.contributor.institutionCampus Monterreyes_MX
dc.contributor.mentorChang Fernández, Leonardo
dc.date.accepted2020-11
dc.date.accessioned2022-01-10T23:06:34Z
dc.date.available2022-01-10T23:06:34Z
dc.date.created2020-11-21
dc.date.issued2020-11
dc.descriptionhttps://orcid.org/0000-0001-6451-9109es_MX
dc.description.abstractOptimization of Binarized Neural Networks (BNNs) relies on approximating the real-valued weights with their binarized representations. Current techniques for weight-updating uses the same optimizers as traditional Neural Networks (NNs). There has only been one effort to directly train the BNNs with bit-flips by using a raw first moment estimate of the gradients and comparing it against a threshold for deciding when to flip a weight (Bop). In this thesis, we iteratively improve this approach by drawing parallels to the Adam optimizer with the inclusion of a second raw moment estimate to normalize the average of the gradients before doing the comparison with a threshold (Bop2ndOrder). Additionally, we tested the effect of using a scheduler on the threshold value as an equivalent to a regularizer, along with bias-corrected and not corrected versions of the optimizer. The proposed optimizer was tested using three different architectures with CIFAR-10 and Imagenet2012; in both datasets this proved to converge faster, being more robust to changes of the hyper-parameters, and achieving better accuracies. Moreover, we also proposed a proof of concept Probabilistic Binary Optimizer (PBop) which treats each weight as loaded coins (Bernoulli distribution) proving that, even though the results are not on par with state-of-the-art, the concept is feasible for Image Classification although it requires a deep exploration of the effect of the scaler.es_MX
dc.description.degreeMaster of Science in Computer Sciencees_MX
dc.format.mediumTextoes_MX
dc.identificator7||33||3304||120304es_MX
dc.identifier.citationSuarez Ramirez, C. D. (2020). A work on optimizers for binarized neural networks: a second order approach (Tesis de Maestría). Instituto Tecnológico y de Estudios Superiores de Monterrey. Recuperado de: https://hdl.handle.net/11285/643391es_MX
dc.identifier.cvu504827es_MX
dc.identifier.orcidhttps://orcid.org/0000-0003-3131-1330es_MX
dc.identifier.urihttps://hdl.handle.net/11285/643391
dc.language.isoenges_MX
dc.publisherInstituto Tecnológico y de Estudios Superiores de Monterreyes_MX
dc.relation.impreso2020-12-01
dc.relation.isFormatOfversión publicadaes_MX
dc.rightsopenAccesses_MX
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0es_MX
dc.subject.classificationINGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA DE LOS ORDENADORES::INTELIGENCIA ARTIFICIALes_MX
dc.subject.keywordBinarizationes_MX
dc.subject.keywordBinarized Neural Networkses_MX
dc.subject.keywordDeep Learninges_MX
dc.subject.keywordOptimizationes_MX
dc.subject.keywordOptimizeres_MX
dc.subject.keywordQuantizationes_MX
dc.subject.lcshTechnologyes_MX
dc.titleA work on optimizers for binarized neural networks: a second order approaches_MX
dc.typeTesis de maestría

Files

Original bundle

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
CuauhtemocSuarez__MCC_Thesis.pdf
Size:
38.45 MB
Format:
Adobe Portable Document Format
Description:
Loading...
Thumbnail Image
Name:
CartaAutorizacion_CuauhtemocSuarez.pdf
Size:
791.56 KB
Format:
Adobe Portable Document Format
Description:
Authorization letter

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.3 KB
Format:
Item-specific license agreed upon to submission
Description:
logo

El usuario tiene la obligación de utilizar los servicios y contenidos proporcionados por la Universidad, en particular, los impresos y recursos electrónicos, de conformidad con la legislación vigente y los principios de buena fe y en general usos aceptados, sin contravenir con su realización el orden público, especialmente, en el caso en que, para el adecuado desempeño de su actividad, necesita reproducir, distribuir, comunicar y/o poner a disposición, fragmentos de obras impresas o susceptibles de estar en formato analógico o digital, ya sea en soporte papel o electrónico. Ley 23/2006, de 7 de julio, por la que se modifica el texto revisado de la Ley de Propiedad Intelectual, aprobado

DSpace software copyright © 2002-2026

Licencia