Ciencias Exactas y Ciencias de la Salud

Permanent URI for this collectionhttps://hdl.handle.net/11285/551039

Pertenecen a esta colección Tesis y Trabajos de grado de las Maestrías correspondientes a las Escuelas de Ingeniería y Ciencias así como a Medicina y Ciencias de la Salud.

Browse

Search Results

Now showing 1 - 4 of 4
  • Tesis de maestría
    Comparing Databases for the Prediction of Student’s Academic Performance using Data Science on the Novel Educational Model Tec21 at Tecnológico de Monterrey
    (Instituto Tecnológico y de Estudios Superiores de Monterrey, 2021-06) Lara Castor, Miguel Andrés; HERNANDEZ GRESS, NEIL; 21847; Hernández Gress, Neil; tolmquevedo, emipsanchez; Batres Prieto, Rafael; Garza Villareal, Sara Elena; Escuela de Ingeniería y Ciencias; Campus Monterrey; Ceballos Cancino, Héctor Gibrán
    Many studies have been made on the prediction of student's academic performance using Data Science. The students with poor academic performance as well as dropout students make a huge impact on the graduation rates, reputation, and finances of an educational institution. These studies take the advantage of the digitization of the admission and academic data of the students and the increasing computational power. However, since August 2019 Tecnologico de Monterrey has been doing it using entrance tests called Initial Evaluations. Unfortunately, the Initial evaluations did not provide useful predictions for the students of the fall semester in 2019. Therefore, this study aimed to compare the Initial Evaluations and the admissions data using Data Science models to predict the student's academic performance. The admission data was composed of five databases: Initial Evaluations, Emotions, Curriculum, Admission Exam and Grades of the first semester. A similar methodology to Cross Industry Standard Process for Data Mining was used to compare the models based on admission data and the models based only Initial Evaluations. A large number of experiments were carried out combining different data of admissions, feature reduction techniques and classification models. The experiments showed that the models based on admission data predicts the student's academic performance with higher accuracy than the models based only on Initial Evaluations. Nevertheless, some variables of the Initial Evaluations were relevant to the models based on admission data. Moreover, the accuracy of the experiments was in the range of the results from the related studies. The results of this study indicates that the Initial Evaluations provide useful information for the prediction of student's academic performance in the domain of Data Science.
  • Tesis de maestría
    Intent discovery from conversational logs to prepare a student admission chatbot for Tecnológico de Monterrey
    (Instituto Tecnológico y de Estudios Superiores de Monterrey, 2021-05) Treviño Lozano, Rolando; Hernández Gress, Neil; tolmquevedo; Alvarado Uribe, Joanna; Castro Sánchez, Noé Alejandro; Escuela de Ingeniería y Ciencias; Campus Monterrey; Ceballos Cancino, Héctor Gibrán
    Online chat services allow companies to serve and attend to their customers to resolve problems or doubts about a specific concept. Lately, conversational bots have been adapting to this domain, allowing a broader attention capacity while easing interactions between users and the company while also easing work for agents, increasing productivity and service quality. To design a chatbot is a time-consuming task as the designer has to provide the core key concepts known as intents that the conversational bot will respond to and provide example sentences and their respective answers. We propose a framework that receives as input data corresponding to conversational transcripts between prospects and agents and transform them through the use of regular expressions into a tabular dataset of the conversations in log format easing their analysis and representation to be converted into a convenient word representation of TF-IDF which serves as input for applying unsupervised machine learning algorithms as Non-Matrix Factorization for Topic Modeling and K-Means for utterance clustering to discover possible intents, which can then be passed on to the design of a knowledge base, which this last step of intent discovery allows an iterative process to process new conversations and identify changes in the intents or the addition of new ones. Results demonstrate that it is possible to cluster the utterances and find clusters that align to a possible intent out of a list of possible intents and such list is subject to change in time for continuously improving intent discovery. A cosine similarity threshold was set at 0.47 to differentiate correctly aligned clusters from those not aligned; 18 intents out of 55 were able to be correctly aligned with an initial intents list, and a total of 35 different intents were able to be captured by the clustering process. No exact similar research was found in the literature, as other works on the domain imply an already curated and labeled dataset to being working on classifying the intents rather than discovering them during the knowledge base design, also they do not take into account the whole process of transforming the raw conversations into a tabular and processed dataset.
  • Tesis de maestría
    Analyzing factors that impact alumni income with a machine learning approach
    (Instituto Tecnológico y de Estudios Superiores de Monterrey, 2021-05) Gómez Cravioto, Daniela Alejandra; Hernández Gress, Neil; puelquio; Ceballos Cancino, Héctor; López Guajardo, Rafael; Ceballos Cancino, Héctor Gibrán; School of Engineering and Sciences; Campus Monterrey; Preciado Arreola, José Luis
    This thesis presents an exploration of different machine-learning algorithms and different approaches for predicting alumni income. The aim is to obtain insights regarding the strongest predictors for income and a ``high" earners class. The study examines the alumni sample data obtained from a survey from Tec de Monterrey, a multi-campus Mexican private university. Survey results encompass 17,898 observations before cleaning and preprocessing and 12,275 observations after this. The dataset includes values for income and a large set of independent variables, including demographic and occupational attributes of the former students and academic attributes from the institution's history. For the problem of income prediction, there have been several attempts in both social science and econometric studies. However, this study investigates whether the accuracy of conventional algorithms in econometric research to predict income can be improved with a data science approach. Furthermore, we present insights obtained with explainable AI techniques. The results show that the Gradient Boosting Model outperformed the parametric models, Linear Regression and Logistic Regression, in predicting the current income of alumni with statistically significant results (p<0.05) in three different approaches: OLS regression, Multi-class Classification, and Binary Classification. The study also identified that for predicting the alum's first income after graduation, the Linear and Logistic Regression models were the most accurate methods, as the non-parametric models did not show a significant improvement. Succinctly, we identified that age, gender, working hours per week, their first income after graduation, and those factors related to their job position and their firm contributed to explaining their income. Simultaneously, post-graduation education and family background had an insignificant contribution to the model. In addition, the results, which showed a gender wage gap indicate that further work is required to enable equality in Mexico.
  • Tesis de maestría
    Financial Habits of Mexican Women using Machine Learning Algorithms
    (Instituto Tecnológico y de Estudios Superiores de Monterrey, 2020-04) Lozano Medina, Jessica Ivonne; Hernández Gress, Neil; RR/tolmquevedo; Ceballos Cancino, Héctor Gibrán; Flores Segovia, Miguel Alejandro; School of Engineering and Sciences; Campus Monterrey; Hervert Escobar, Laura
    This research was conducted under the Master in Computational Science program at Tecnológico de Monterrey. The proposal is a model to assess a profile risk for Mexican women, who require the service of a financial portfolio offered by a financial institution. Typically, women are scored with a lower financial risk than men. However, the understanding of variables and indicators that lead to such results, are not fully understood. Furthermore, the stochastic nature of the data makes it difficult to generate a suitable profile to offer an adequate financial portfolio to the women segment. Therefore, there is a great interest for developing methods that correctly model the behavior, and aid the decision-making process in financial services. Several models in the State-of-art for this type of analysis is done with linear programming and statistical techniques. Therefore, this study will use a benchmark of Machine Learning algorithms, such as Unsupervised and Supervised Learning algorithms, to extract information on four different datasets relevant to the population of interest. The first phase involves applying state-of-the-art techniques on public datasets of the Mexican population, whereas the second phase involves a future research involving a financial institution to create the model for the Women segment. It was found that financial habits of the population are heavily dependent on the region. There also an important group in the population characterized for not possessing an account in a financial institution and also not having emergency funds. In the case of the profiles of women, the most important attributes were their civil status and their participation in the workforce. The largest group of women are housewives, though the second largest group consists of married women who also participate in the workforce.
En caso de no especificar algo distinto, estos materiales son compartidos bajo los siguientes términos: Atribución-No comercial-No derivadas CC BY-NC-ND http://www.creativecommons.mx/#licencias
logo

El usuario tiene la obligación de utilizar los servicios y contenidos proporcionados por la Universidad, en particular, los impresos y recursos electrónicos, de conformidad con la legislación vigente y los principios de buena fe y en general usos aceptados, sin contravenir con su realización el orden público, especialmente, en el caso en que, para el adecuado desempeño de su actividad, necesita reproducir, distribuir, comunicar y/o poner a disposición, fragmentos de obras impresas o susceptibles de estar en formato analógico o digital, ya sea en soporte papel o electrónico. Ley 23/2006, de 7 de julio, por la que se modifica el texto revisado de la Ley de Propiedad Intelectual, aprobado

DSpace software copyright © 2002-2025

Licencia