End-to-End Violence Detection Using Pedestrian Detection, Pose Estimation, and Temporal GRUs for Surveillance Applications
| dc.audience.educationlevel | Investigadores/Researchers | |
| dc.audience.educationlevel | Maestros/Teachers | |
| dc.audience.educationlevel | Estudiantes/Students | |
| dc.audience.educationlevel | Otros/Other | |
| dc.contributor.advisor | Conant Pablos, Santiago Enrique | |
| dc.contributor.author | Salazar Vasquez, Fredy Antonio | |
| dc.contributor.cataloger | emipsanchez | |
| dc.contributor.committeemember | Ortiz bayliss, José Carlos | |
| dc.contributor.department | School of Engineering and Sciences | |
| dc.contributor.institution | Campus Monterrey | |
| dc.date.accepted | 2025-08-13 | |
| dc.date.accessioned | 2025-08-16T13:12:21Z | |
| dc.date.issued | 2025-05-27 | |
| dc.description | https://orcid.org/0000-0001-6270-3164 | |
| dc.description.abstract | In recent years, surveillance systems have played an increasingly prominent role in both public and private settings. These systems monitor activities in real time and provide information to security personnel and authorities. Their constant observation helps prevent incidents and maintain order. Traditional surveillance systems record events but do not fully exploit the valuable information they capture. New technologies allow valuable data to be extracted, turning surveillance into an active tool for security. With the development of tools like object detection, pose estimation, and neural networks, surveillance systems can now interpret the scenes they capture. Rather than simply recording footage, these systems are becoming active participants in security by extracting meaningful information from visual data. Despite these advances, it remains a challenge to identify violent acts using visual information. The main challenge is to analyze the data in a way that identifies risks. Although cameras capture lot of information, traditional systems do not always use them preventively. These systems must predict risky situations by detecting aggressive behavior or suspicious activities early. This work primarily focuses on addressing the development of techniques to improve the detection of violence in surveillance videos by optimizing specific processes such as pedestrian detection, human posture estimation, object tracking, and violent behavior classification. Pedestrian detection is optimized using advanced models like YOLO, enhancing accuracy in high-density environments. Posture estimation is improved through advanced pose detection algorithms that reduce manual intervention. Object tracking is enhanced by implementing Deep SORT to maintain reliable identity tracking across video frames. Violent behavior classification is fine-tuned using a deep neural network architecture based on Gated Recurrent Units (GRU), which captures temporal movement patterns. Video footage from the KranokNV database is processed to identify joint angles of pedestrians, and the VID dataset is used to evaluate system performance. This integrated approach aims to achieve faster, more accurate, and more reliable detection of violent situations, contributing to public safety. Additionally, the evaluation considers spatial and temporal features, such as velocity, acceleration, motion energy, abrupt changes, symmetry, and expansion radius. The processed data was smoothed with the Kalman filter, achieving an accuracy of 99.44%. The results indicate continuous detection capability and improvement in generalization throughout the training process. | |
| dc.description.degree | Master of Science in Computer Science | |
| dc.format.medium | Texto | |
| dc.identificator | 120304 | |
| dc.identificator | 330417 | |
| dc.identifier.citation | Salazar Vasquez, F. A. (2025). End-to-End Violence Detection Using Pedestrian Detection, Pose Estimation, and Temporal GRUs for Surveillance Applications [Tesis maestría]. Instituto Tecnológico y de Estudios Superiores de Monterrey. Recuperado de: https://hdl.handle.net/11285/703992 | |
| dc.identifier.orcid | https://orcid.org/0009-0001-0664-2309 | |
| dc.identifier.uri | https://hdl.handle.net/11285/703992 | |
| dc.language.iso | eng | |
| dc.publisher | Instituto Tecnológico y de Estudios Superiores de Monterrey | |
| dc.relation.isFormatOf | acceptedVersion | |
| dc.rights | openAccess | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0 | |
| dc.subject.classification | INGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA DE LOS ORDENADORES::INTELIGENCIA ARTIFICIAL | |
| dc.subject.classification | INGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA DE LOS ORDENADORES::SISTEMAS EN TIEMPO REAL | |
| dc.subject.keyword | Violence detection | |
| dc.subject.keyword | Pedestriand detection | |
| dc.subject.keyword | Pose estimation | |
| dc.subject.keyword | Object tracking | |
| dc.subject.keyword | Neural networks | |
| dc.subject.keyword | Gru | |
| dc.subject.keyword | Cnn | |
| dc.subject.lcsh | Technology | |
| dc.title | End-to-End Violence Detection Using Pedestrian Detection, Pose Estimation, and Temporal GRUs for Surveillance Applications | |
| dc.type | Tesis de maestría |
Files
Original bundle
1 - 3 of 3
Loading...
- Name:
- SalazarVasquez_TesisMaestria_pdfa.pdf
- Size:
- 9.77 MB
- Format:
- Adobe Portable Document Format
- Description:
- Tesis Maestría
Loading...
- Name:
- SalazarVasquez_ActaGradoDeclaracionAutoria_pdfa.pdf
- Size:
- 308.43 KB
- Format:
- Adobe Portable Document Format
- Description:
- Acta de Grado y Declaración de Autoría
Loading...
- Name:
- SalazarVasquez_CartaAutorizacion_pdfa.pdf
- Size:
- 69.85 KB
- Format:
- Adobe Portable Document Format
- Description:
- Carta Autorización
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.28 KB
- Format:
- Item-specific license agreed upon to submission
- Description:

