Lights, camera, and domain shift: using superpixels for domain generalization in image segmentation for multimodal endoscopies

dc.audience.educationlevelInvestigadores/Researcherses_MX
dc.contributor.advisorOchoa Ruiz, Gilberto
dc.contributor.authorMartínez García Peña, Rafael
dc.contributor.catalogerpuemcuervo, emipsanchez
dc.contributor.committeememberFalcón Morales, Luis Eduardo
dc.contributor.committeememberGónzales Mendoza, Miguel
dc.contributor.departmentSchool of Engineering and Scienceses_MX
dc.contributor.institutionCampus Monterreyes_MX
dc.contributor.mentorAli, Sharib
dc.date.accepted2023-05-26
dc.date.accessioned2025-03-05T21:13:57Z
dc.date.issued2023-05
dc.description.abstractDeep Learning models have made great advancements in image processing. Their ability to identify key parts of images and provide fast and accurate segmentation has been proven and used in many fields, such as city navigation and object recognition. However, there is one field that is both in need of the extra information that computers can provide and has proven elusive for the goals of robustness and accuracy: Medicine. In the medical field, limitations in the amount of data and in the variation introduced by factors such as differences in instrumentation introduce a grave threat to the accuracy of a model known as domain shift. Domain shift occurs when we train with data that has a set of characteristics that is not wholly representative of the entire set of data a task encompasses. When it is present, models that have no tools to deal with it can observe a degradation to their accuracy to such degree that they can be transformed from usable to useless. To better explore this topic, we discuss two techniques: Domain adaptation, where we find how to make a model better at predicting for specific domain of data inside a task, and Domain generalization, where we find how to make a model better at predicting data for any domain inside a task. In addition, we discuss several image segmentation models that have shown good results for medical tasks: U-Net, Attention U-Net, DeepLab, Efficient U-Net, and EndoUDA. Following this exploration, we propose a solution model based on a domain generalization technique: Patch-based consistency. We use a superpixel generator known as SLIC (Simple Linear Iterative Clustering) to provide low-level, domain-agnostic information to different models in order to encourage our networks to learn more global features. This framework, which we refer to as SUPRA (SUPeRpixel Augmented), is used in tandem with U-Net, Attention U-Net, and Efficient U-Net in an effort to improve results in endoscopies where light modalities are switched: Something commonly seen in lesion detection tasks (particularly in Barrett's Esophagus and Polyp detection). We find that the best of these models, SUPRA-UNet, shows significant qualities that make it a better choice than unaugmented networks for lesion detection: Not only does it provide less noisy and smoother predictions, but it outperforms other networks by over 20% IoU versus the best results (U-Net) in a target domain that presents significant lighting differences from the training set.
dc.description.degreeMaster of Science in Computer Sciencees_MX
dc.format.mediumTextoes_MX
dc.identificator7||33||3314||331499es_MX
dc.identifier.citationMartínez García Peña, R. (2023). Lights, camera, and domain shift: using superpixels for domain generalization in image segmentation for multimodal endoscopies [Tesis maestría]. Instituto Tecnológico y de Estudios Superiores de Monterrey. Recuperado de: https://hdl.handle.net/11285/703268
dc.identifier.cvu1151389es_MX
dc.identifier.urihttps://hdl.handle.net/11285/703268
dc.language.isoenges_MX
dc.publisherInstituto Tecnológico y de Estudios Superiores de Monterreyes_MX
dc.relation.isFormatOfacceptedVersiones_MX
dc.rightsopenAccess
dc.rights.urihttps://creativecommons.org/licenses/by-sa/4.0es_MX
dc.subject.classificationINGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA MÉDICA::OTRASes_MX
dc.subject.keywordDeep learninges_MX
dc.subject.keywordSuperpixelses_MX
dc.subject.keywordU-netes_MX
dc.subject.keywordDomain generalizationes_MX
dc.subject.keywordImage segmentationes_MX
dc.subject.keywordBarrett's esophaguses_MX
dc.subject.keywordComputer visiones_MX
dc.subject.lcshSciencees_MX
dc.titleLights, camera, and domain shift: using superpixels for domain generalization in image segmentation for multimodal endoscopieses_MX
dc.typeTesis de Maestría / master Thesises_MX

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
MartinezGarciaPena_TesisMaestríapdfa.pdf
Size:
37.86 MB
Format:
Adobe Portable Document Format
Description:
Tesis Maestría
Loading...
Thumbnail Image
Name:
MartinezGarciaPena_ActaGradoDeclaracionAutoriapdfa.pdf
Size:
488.72 KB
Format:
Adobe Portable Document Format
Description:
Acta de Grado y Declaració de Autoría
Loading...
Thumbnail Image
Name:
MartinezGarciaPena_CartaAutorizacionpdfa.pdf
Size:
457.6 KB
Format:
Adobe Portable Document Format
Description:
Carta Autorización

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.3 KB
Format:
Item-specific license agreed upon to submission
Description:
logo

El usuario tiene la obligación de utilizar los servicios y contenidos proporcionados por la Universidad, en particular, los impresos y recursos electrónicos, de conformidad con la legislación vigente y los principios de buena fe y en general usos aceptados, sin contravenir con su realización el orden público, especialmente, en el caso en que, para el adecuado desempeño de su actividad, necesita reproducir, distribuir, comunicar y/o poner a disposición, fragmentos de obras impresas o susceptibles de estar en formato analógico o digital, ya sea en soporte papel o electrónico. Ley 23/2006, de 7 de julio, por la que se modifica el texto revisado de la Ley de Propiedad Intelectual, aprobado

DSpace software copyright © 2002-2026

Licencia