The role of capitalization and character repetition in identifying depression on social Media: a bilingual approach

dc.audience.educationlevelOtros/Other
dc.contributor.advisorZareei, Mahdi
dc.contributor.authorBurgueño Paz, Luis Humberto
dc.contributor.catalogeremipsanchez
dc.contributor.committeememberRoshan Biswal, Rajesh
dc.contributor.departmentSchool of Engineering and Sciences
dc.contributor.institutionCampus Monterrey
dc.contributor.mentorGarcía Ceja, Enrique Alejandro
dc.date.accepted2024-11-24
dc.date.accessioned2025-01-04T12:22:20Z
dc.date.issued2024-11-24
dc.descriptionhttps://orcid.org/0000-0001-6623-1758
dc.description.abstractDepression is a mental disorder that affects millions of people worldwide, but a significant portion of the affected people don’t receive adequate treatment. There has been an increasing interest from researchers to detect this condition through social media posts in order to prompt for early treatment. However, most of the research has been focused on the Caucasian Western English-speaking population, limiting the applicability of their findings across diverse cultural contexts. While research has shown the use of nonverbal cues to convey sentiment, their role on depression detection remains under-explored. This thesis aims to assess the effect of nonverbal cues, specifically capitalization and character repetition, on depression detection using datasets both in English and Spanish. This effect was explored through three existing datasets. The first dataset included a collection of Reddit posts and comments in the English language and was selected to assess the effect on a dataset coming from one of the most reputable mental health competitions in Natural Language Processing. The second dataset consisted of a collection of Spanish- language messages from Telegram to verify whether findings in the English language would hold for Spanish. The third dataset, also built from Reddit posts, was used to analyze the impact of these features when classifying by depression severity levels rather than binary labels. Four classifiers were used throughout this research: Logistic Regression, Random Forest, Support Vector Machine, and Neural Network. Overall, the impact of capitalization and character repetition for depression detection was found to be minimal. These features had the most effect on English Reddit data with binary labels, while showing limited impact on Spanish data or when classifying by severity levels. Additionally, models using only character repetition outperformed those relying on capitalization features.
dc.description.degreeMaster of Science in Computer Science
dc.format.mediumTexto
dc.identificator339999
dc.identifier.citationBurgueno Paz, L. H. (2024). The role of capitalization and character repetition in identifying depression on social Media: a bilingual approach [Tesis maestría]. Instituto Tecnológico y de Estudios Superiores de Monterrey. Recuperado de: https://hdl.handle.net/11285/702964
dc.identifier.cvu1276308
dc.identifier.orcidhttps://orcid.org/0009-0005-1531-3872
dc.identifier.urihttps://hdl.handle.net/11285/702964
dc.identifier.urihttps://doi.org/10.60473/ritec.40
dc.language.isoeng
dc.publisherInstituto Tecnológico y de Estudios Superiores de Monterrey
dc.relation.isFormatOfacceptedVersion
dc.rightsopenAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0
dc.subject.classificationINGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::OTRAS ESPECIALIDADES TECNOLÓGICAS::OTRAS
dc.subject.keywordDepression
dc.subject.keywordDetection
dc.subject.keywordSocial media
dc.subject.keywordMental health
dc.subject.keywordMachine learning
dc.subject.lcshTechnology
dc.titleThe role of capitalization and character repetition in identifying depression on social Media: a bilingual approach
dc.typeTesis de Maestría / master Thesis

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
BurguenoPaz_TesisMaestriapdfa.pdf
Size:
1.65 MB
Format:
Adobe Portable Document Format
Description:
Tesis Maestría
Loading...
Thumbnail Image
Name:
BurguenoPaz_ActaGradoDeclaracionAutoriapdfa.pdf
Size:
390.83 KB
Format:
Adobe Portable Document Format
Description:
Acta de Grado y Declaración Autoría
Loading...
Thumbnail Image
Name:
BurguenoPaz_CartaAutorizacionpdfa.pdf
Size:
136.15 KB
Format:
Adobe Portable Document Format
Description:
Carta Autorización

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.28 KB
Format:
Item-specific license agreed upon to submission
Description:
logo

El usuario tiene la obligación de utilizar los servicios y contenidos proporcionados por la Universidad, en particular, los impresos y recursos electrónicos, de conformidad con la legislación vigente y los principios de buena fe y en general usos aceptados, sin contravenir con su realización el orden público, especialmente, en el caso en que, para el adecuado desempeño de su actividad, necesita reproducir, distribuir, comunicar y/o poner a disposición, fragmentos de obras impresas o susceptibles de estar en formato analógico o digital, ya sea en soporte papel o electrónico. Ley 23/2006, de 7 de julio, por la que se modifica el texto revisado de la Ley de Propiedad Intelectual, aprobado

DSpace software copyright © 2002-2026

Licencia