Image captioning for automated grading and understanding of pre-cancerous inflammations in ulcerative colitis on endoscopic images

dc.audience.educationlevelInvestigadores/Researchers
dc.audience.educationlevelEstudiantes/Students
dc.audience.educationlevelOtros/Other
dc.contributor.advisorOchoa Ruiz, Gilberto
dc.contributor.authorValencia Velarde, Flor Helena
dc.contributor.catalogeremimmayorquin
dc.contributor.committeememberHinojosa Cervantes, Salvador Miguel
dc.contributor.committeememberGonzalez Mendoza, Miguel
dc.contributor.departmentSchool of Engineering and Scienceses_MX
dc.contributor.institutionCampus Monterreyes_MX
dc.contributor.mentorAli, Sharib
dc.date.accepted2024-06-12
dc.date.accessioned2025-05-15T01:39:11Z
dc.date.issued2024
dc.descriptionhttps://orcid.org/0000-0002-9896-8727
dc.description.abstractThis thesis presents the development and results of an automated system for grading and understanding ulcerative colitis (UC) through image captioning. UC is a chronic inflammatory disease of the large intestine, characterized by alternating periods of remission and relapse. The conventional method for assessing UC severity involves the Mayo Endoscopic Scoring (MES) system, which depends on the visual evaluation of mucosal characteristics. This method is subjective and can result in considerable variability between different observers. The primary objective of this thesis is to investigate and evaluate contemporary methodologies for developing an image captioning model that can generate MES scores and descriptive captions for mucosal features observed in endoscopic images. This research involved an extensive examination of various convolutional neural networks (CNNs) for visual feature extraction and the implementation of several sequence models for natural language processing (NLP), including Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and Recurrent Neural Networks (RNNs). Our system was rigorously evaluated on a dataset consisting of 982 images obtained from both public repositories and proprietary collections. The combination of DenseNet121 for CNN-based feature extraction and 2 layers GRU for sequence generation yielded the best performance, achieving a BLEU-4 score of 0.7352. This high level of similarity between the reference and predicted captions indicates the model’s effectiveness in accurately capturing and describing critical mucosal features necessary for UC grading. While our system performed well in predicting MES-0 to MES-2 categories, it encountered challenges in accurately predicting MES-3 classifications. This discrepancy is likely due to the underrepresentation of severe cases in the training dataset. Despite this limitation, the system’s ability to generate comprehensive descriptions of mucosal features represents a significant advancement in the automated evaluation of UC. The contributions of this thesis include the creation of a dataset for UC captioning task, a detailed analysis of various CNN architectures and sequence models, an extensive evaluation of their performance, and the development of a robust framework for automated UC grading and description generation. Our findings suggest that combining advanced visual feature extraction techniques with sophisticated NLP models can significantly improve the accuracy and reliability of automated medical diagnosis systems. By reducing inter-observer variability and providing a valuable tool for training new clinicians, this automated grading and captioning system has the potential to enhance diagnostic accuracy and clinical decision-making in UC management. This work represents a substantial step forward in the field of endoscopic imaging, underscoring the importance of integrating machine learning techniques in clinical practice. Additionally, by generating detailed descriptions, this approach helps mitigate the “black box” nature of deep learning, offering more transparency and interpretability in automated medical diagnoses.es_MX
dc.description.degreeMaestro en Ciencias Computacionaleses_MX
dc.format.mediumTextoes_MX
dc.identificator7||320503||329999||320101||120304
dc.identifier.citationValencia Velarde, F. H. (2024). Image captioning for automated grading and understanding of pre-cancerous inflammations in ulcerative colitis on endoscopic images. [Tesis maestría] Instituto Tecnológico y de Estudios Superiores de Monterrey. Recuperado de: https://hdl.handle.net/11285/703667
dc.identifier.cvu1239025es_MX
dc.identifier.orcidhttps://orcid.org/0009-0001-2676-221X
dc.identifier.urihttps://hdl.handle.net/11285/703667
dc.language.isoenges_MX
dc.publisherInstituto Tecnológico y de Estudios Superiores de Monterreyes_MX
dc.relationInstituto Tecnológico y de Estudios Superiores de Monterrey
dc.relationCONAHCYT
dc.relation.isFormatOfpublishedVersiones_MX
dc.rightsopenAccesses_MX
dc.rights.urihttp://creativecommons.org/licenses/by/4.0es_MX
dc.subject.classificationMEDICINA Y CIENCIAS DE LA SALUD::CIENCIAS MÉDICAS::MEDICINA INTERNA::GASTROENTEROLOGÍA
dc.subject.classificationMEDICINA Y CIENCIAS DE LA SALUD::CIENCIAS MÉDICAS::OTRAS ESPECIALIDADES MÉDICAS::OTRAS
dc.subject.classificationCIENCIAS FÍSICO MATEMÁTICAS Y CIENCIAS DE LA TIERRA::MATEMÁTICAS::CIENCIA DE LOS ORDENADORES::INTELIGENCIA ARTIFICIAL
dc.subject.keywordUlcerative colitis
dc.subject.keywordEndoscopy
dc.subject.keywordImage captioning
dc.subject.lcshScience
dc.subject.lcshMedicine
dc.titleImage captioning for automated grading and understanding of pre-cancerous inflammations in ulcerative colitis on endoscopic images
dc.typeTesis de Maestría / master Thesises_MX

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
ValenciaVelarde_TesisMaestria.pdf
Size:
23.09 MB
Format:
Adobe Portable Document Format
Description:
Tesis Maestría
Loading...
Thumbnail Image
Name:
ValenciaVelarde_CartaAutorizacion.pdf
Size:
138.23 KB
Format:
Adobe Portable Document Format
Description:
Carta Autorización
Loading...
Thumbnail Image
Name:
ValenciaVelarde_FirmasActadeGrado.pdf
Size:
422.57 KB
Format:
Adobe Portable Document Format
Description:
Firmas Acta de Grado

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.3 KB
Format:
Item-specific license agreed upon to submission
Description:
logo

El usuario tiene la obligación de utilizar los servicios y contenidos proporcionados por la Universidad, en particular, los impresos y recursos electrónicos, de conformidad con la legislación vigente y los principios de buena fe y en general usos aceptados, sin contravenir con su realización el orden público, especialmente, en el caso en que, para el adecuado desempeño de su actividad, necesita reproducir, distribuir, comunicar y/o poner a disposición, fragmentos de obras impresas o susceptibles de estar en formato analógico o digital, ya sea en soporte papel o electrónico. Ley 23/2006, de 7 de julio, por la que se modifica el texto revisado de la Ley de Propiedad Intelectual, aprobado

DSpace software copyright © 2002-2026

Licencia