Deep learning and natural language processing for computer aided diagnosis
| dc.audience.educationlevel | Investigadores/Researchers | |
| dc.audience.educationlevel | Estudiantes/Students | |
| dc.audience.educationlevel | Maestros/Teachers | |
| dc.audience.educationlevel | Otros/Other | |
| dc.contributor.advisor | Tamez Peña, Jose Gerardo | |
| dc.contributor.author | Hussain, Sadam | |
| dc.contributor.cataloger | emipsanchez | |
| dc.contributor.committeemember | Santos Díaz, Alejandro | |
| dc.contributor.committeemember | Martínez Ledesma, Juan Emmanuel | |
| dc.contributor.committeemember | Bron, Esther E. | |
| dc.contributor.committeemember | Mery, Domingo | |
| dc.contributor.department | School of Engineering and Sciences | |
| dc.contributor.institution | Campus Monterrey | |
| dc.date.accepted | 2025-06 | |
| dc.date.accessioned | 2025-07-22T01:35:12Z | |
| dc.date.embargoenddate | 2026-07-21 | |
| dc.date.issued | 2025-06 | |
| dc.description.abstract | Multimodal artificial intelligence (AI) is a cutting-edge technique that integrates diverse modalities, such as imaging and textual data, to enhance classification and regression tasks. This dissertation focuses on the integration, comparison, and evaluation of multimodal AI for breast cancer diagnosis and prognosis. To achieve these objectives, we curated a comprehensive multimodal dataset comprising digital mammograms and corresponding radiological reports. Leveraging this dataset, we introduced and assessed various state-of-the-art (SOTA) multimodal techniques for three key tasks: breast cancer classification, reduction of false-positive biopsies with explainable AI (XAI), and short- term (5-year) risk prediction of breast cancer.In this work, we also introduced a benchmark dataset of radiological reports from breast cancer patients and provided baseline performance evaluations using SOTA machine learning (ML), deep learning (DL), and large language models (LLMs) for BI-RADS category classification. Our approach evaluated the performance of diverse SOTA multimodal architectures, including ResNet, VGG, E!cientNet, MobileNet, and Vision Transformers (ViT). For textual data processing, we employed both general-purpose and domain-specific pretrained LLMs such as BERT, bioGPT, ClinicalBERT, and DeBERTa, which were also integrated into multimodal architectures for enhanced lassification.Notably, our proposed multiview multimodal feature fusion (MMFF) architecture, combining SE-ResNet50 with an artificial neural network (ANN), achieved an AUC of 0.965 for breast cancer classification, significantly outperforming both single-modal and multimodal SOTA architectures. For reducing unnecessary breast biopsies, our multimodal approach achieved an AUC of 0.72, showcasing its clinical utility in minimizing patient burden. Moreover, our ViT and bioGPT-based multimodal architecture achieved an AUC of 0.77 for short-term risk prediction, outperforming the SOTA MIRAI model, which achieved an AUC of 0.59 on our in-house dataset. This work highlights the potential of multimodal AI in advancing breast cancer diagnosis and prognosis, demonstrating its superiority over traditional and unimodal approaches across multiple critical tasks. | |
| dc.description.degree | Doctor of Philosophy in Computer Sciences | |
| dc.format.medium | Texto | |
| dc.identificator | 220212 | |
| dc.identificator | 120320 | |
| dc.identificator | 330413 | |
| dc.identificator | 320111 | |
| dc.identifier.citation | Hussain, Sadam (2025). Deep learning and natural language processing for computer aided diagnosis [Tesis doctoral]. Instituto Tecnológico y de Estudios Superiores de Monterrey. Recuperado de: https://hdl.handle.net/11285/703887 | |
| dc.identifier.uri | https://hdl.handle.net/11285/703887 | |
| dc.language.iso | eng | |
| dc.publisher | Instituto Tecnológico y de Estudios Superiores de Monterrey | |
| dc.relation | Instituto Tecnológico y de Estudios Superiores de Monterrey, Campus Monterrey | |
| dc.relation | CONAHCYT | |
| dc.relation.isFormatOf | acceptedVersion | |
| dc.rights | openAccess | |
| dc.rights.embargoreason | Se solicita el embargo porque algunos capítulos aún no se han publicado y están en proceso. | |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0 | |
| dc.subject.classification | INGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA DE LOS ORDENADORES::INTELIGENCIA ARTIFICIAL | |
| dc.subject.classification | INGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA DE LOS ORDENADORES::SISTEMAS DE CONTROL MÉDICO | |
| dc.subject.classification | CIENCIAS FÍSICO MATEMÁTICAS Y CIENCIAS DE LA TIERRA::MATEMÁTICAS::CIENCIA DE LOS ORDENADORES::DISPOSITIVOS DE TRANSMISIÓN DE DATOS | |
| dc.subject.classification | MEDICINA Y CIENCIAS DE LA SALUD::CIENCIAS MÉDICAS::CIENCIAS CLÍNICAS::RADIOLOGÍA | |
| dc.subject.classification | INGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA ELECTRÓNICA::RAYOS X | |
| dc.subject.keyword | Multimodal Learning | |
| dc.subject.keyword | Breast Cancer Classification | |
| dc.subject.keyword | Computer Aided Diagnosis | |
| dc.subject.keyword | Medical Image Analysis | |
| dc.subject.keyword | BI-RADS Classification | |
| dc.subject.keyword | Radiology Report Analysis | |
| dc.subject.lcsh | Technology | |
| dc.subject.lcsh | Science | |
| dc.title | Deep learning and natural language processing for computer aided diagnosis | |
| dc.type | Tesis de doctorado |
Files
Original bundle
1 - 3 of 3
Loading...
- Name:
- Hussain_TesisDoctorado_pdfa.pdf
- Size:
- 26.74 MB
- Format:
- Adobe Portable Document Format
- Description:
- Tesis Doctorado
Loading...
- Name:
- Hussain_ActaGrado_pdfa.pdf
- Size:
- 996.34 KB
- Format:
- Adobe Portable Document Format
- Description:
- Acta de Grado
Loading...
- Name:
- Hussain_CartaAutorizacion.docx
- Size:
- 45.14 KB
- Format:
- Microsoft Word XML
- Description:
- Carta Autorización
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.28 KB
- Format:
- Item-specific license agreed upon to submission
- Description:

