Architecture for a named entity recognition and relation extraction model using word embeddings variations for building a dynamic skills taxonomy
| dc.audience.educationlevel | Investigadores/Researchers | |
| dc.audience.educationlevel | Estudiantes/Students | |
| dc.audience.educationlevel | Otros/Other | |
| dc.contributor.advisor | Noguez Monroy, Juana Julieta | |
| dc.contributor.author | González Gómez, Luis José | |
| dc.contributor.cataloger | emimmayorquin | |
| dc.contributor.committeemember | González Nucamendi, Andrés | |
| dc.contributor.committeemember | Valverde Rebaza, Jorge Carlos | |
| dc.contributor.committeemember | Benes, Bedrich | |
| dc.contributor.committeemember | Caratozzolo Martelliti, Patricia Olga | |
| dc.contributor.department | School of Engineering and Sciences | es_MX |
| dc.contributor.institution | Campus Ciudad de México | es_MX |
| dc.date.accepted | 2023-12-01 | |
| dc.date.accessioned | 2025-05-15T00:29:47Z | |
| dc.date.issued | 2023-11 | |
| dc.description | https://orcid.org/0000-0002-6000-3452 | |
| dc.description.abstract | In this work, we present an architecture for extracting meaningful insights from unstructured documents through the use of Natural Language Processing (NLP) techniques to maintain a dynamic taxonomy of skills. NLP methods like Named Entity Recognition (NER) and Relation Extraction (RE), enable computers to find entities of interest such as skills and occupations in unstructured documents related to the Jobs Industry and the current and future state of job Knowledge, Skills and Abilities (KSA). The organization of a taxonomy of skills seeks to reflect the relations between occupations and the skills, knowledge and abilities required to perform it. It also aims to account for the current and future changes in the found relations. To do so, a Relation Extraction Model is proposed. This model is trained to find relations between entities like skills and occupations. It achieves this by having a general understanding of how skills, knowledge, abilities and occupations relate. These skills and occupations form the base for a hierarchical organized structure of concepts visualized as a related taxonomy. Current skills taxonomies are static and often built upon collected data that grows old quickly. Reports from the World Economic Forum (WEF) and the Organization for Economic Cooperation and Development (OECD) signal mismatches between current KSAs and future requirements due to emerging occupations and re-skilling needs. The architecture presented in this thesis enables a dynamic taxonomy capable of reflecting increasing, declining and mismatched skills in relation with distinct occupations. The results of its application are promising in terms of the models performance and accuracy. It has also proven to be effective in providing an end to end pipeline covering all aspects from the text collection gathering, its pre-processing, natural language processing and final visualization. | es_MX |
| dc.description.degree | Doctor en Ciencias Computacionales | es_MX |
| dc.format.medium | Texto | es_MX |
| dc.identificator | 120304 | |
| dc.identifier.citation | Gonzalez Gomez, L. J. (2023, November). Architecture for a named entity recognition and relation extraction model using word embeddings variations for building a dynamic skills taxonomy. [Tesis doctordo] Instituto Tecnológico y de Estudios Superiores de Monterrey. Recuperado de: https://hdl.handle.net/11285/703665 | |
| dc.identifier.cvu | 453672 | es_MX |
| dc.identifier.orcid | https://orcid.org/0009-0005-8843-0359 | |
| dc.identifier.uri | https://hdl.handle.net/11285/703665 | |
| dc.language.iso | eng | es_MX |
| dc.publisher | Instituto Tecnológico y de Estudios Superiores de Monterrey | es_MX |
| dc.relation | Instituto Tecnológico y de Estudios Superiores de Monterrey | |
| dc.relation | CONAHCYT | |
| dc.relation.isFormatOf | acceptedVersion | es_MX |
| dc.rights | openAccess | es_MX |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0 | es_MX |
| dc.subject.classification | INGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA DE LOS ORDENADORES::INTELIGENCIA ARTIFICIAL | |
| dc.subject.keyword | Natural language processing | |
| dc.subject.keyword | Named entity recognition | |
| dc.subject.keyword | Relation extraction | |
| dc.subject.keyword | Skills taxonomy | |
| dc.subject.keyword | Ksa knowledge skills abilities | |
| dc.subject.lcsh | Science | |
| dc.subject.lcsh | Technology | |
| dc.title | Architecture for a named entity recognition and relation extraction model using word embeddings variations for building a dynamic skills taxonomy | es_MX |
| dc.type | Tesis Doctorado / doctoral Thesis | es_MX |
Files
Original bundle
1 - 3 of 3
Loading...
- Name:
- GonzalezGomez_TesisDoctorado.pdf
- Size:
- 1.67 MB
- Format:
- Adobe Portable Document Format
- Description:
- Tesis Doctorado
Loading...
- Name:
- GonzalezGomez_CartaAutorizacion.pdf
- Size:
- 71.87 KB
- Format:
- Adobe Portable Document Format
- Description:
- Carta Autorización
Loading...
- Name:
- GonzalezGomez_FirmasActadeGrado.pdf
- Size:
- 3.48 MB
- Format:
- Adobe Portable Document Format
- Description:
- Firmas Acta de Grado
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.3 KB
- Format:
- Item-specific license agreed upon to submission
- Description:

