End-to-End Violence Detection Using Pedestrian Detection, Pose Estimation, and Temporal GRUs for Surveillance Applications

dc.audience.educationlevelInvestigadores/Researchers
dc.audience.educationlevelMaestros/Teachers
dc.audience.educationlevelEstudiantes/Students
dc.audience.educationlevelOtros/Other
dc.contributor.advisorConant Pablos, Santiago Enrique
dc.contributor.authorSalazar Vasquez, Fredy Antonio
dc.contributor.catalogeremipsanchez
dc.contributor.committeememberOrtiz bayliss, José Carlos
dc.contributor.departmentSchool of Engineering and Sciences
dc.contributor.institutionCampus Monterrey
dc.date.accepted2025-08-13
dc.date.accessioned2025-08-16T13:12:21Z
dc.date.issued2025-05-27
dc.descriptionhttps://orcid.org/0000-0001-6270-3164
dc.description.abstractIn recent years, surveillance systems have played an increasingly prominent role in both public and private settings. These systems monitor activities in real time and provide information to security personnel and authorities. Their constant observation helps prevent incidents and maintain order. Traditional surveillance systems record events but do not fully exploit the valuable information they capture. New technologies allow valuable data to be extracted, turning surveillance into an active tool for security. With the development of tools like object detection, pose estimation, and neural networks, surveillance systems can now interpret the scenes they capture. Rather than simply recording footage, these systems are becoming active participants in security by extracting meaningful information from visual data. Despite these advances, it remains a challenge to identify violent acts using visual information. The main challenge is to analyze the data in a way that identifies risks. Although cameras capture lot of information, traditional systems do not always use them preventively. These systems must predict risky situations by detecting aggressive behavior or suspicious activities early. This work primarily focuses on addressing the development of techniques to improve the detection of violence in surveillance videos by optimizing specific processes such as pedestrian detection, human posture estimation, object tracking, and violent behavior classification. Pedestrian detection is optimized using advanced models like YOLO, enhancing accuracy in high-density environments. Posture estimation is improved through advanced pose detection algorithms that reduce manual intervention. Object tracking is enhanced by implementing Deep SORT to maintain reliable identity tracking across video frames. Violent behavior classification is fine-tuned using a deep neural network architecture based on Gated Recurrent Units (GRU), which captures temporal movement patterns. Video footage from the KranokNV database is processed to identify joint angles of pedestrians, and the VID dataset is used to evaluate system performance. This integrated approach aims to achieve faster, more accurate, and more reliable detection of violent situations, contributing to public safety. Additionally, the evaluation considers spatial and temporal features, such as velocity, acceleration, motion energy, abrupt changes, symmetry, and expansion radius. The processed data was smoothed with the Kalman filter, achieving an accuracy of 99.44%. The results indicate continuous detection capability and improvement in generalization throughout the training process.
dc.description.degreeMaster of Science in Computer Science
dc.format.mediumTexto
dc.identificator120304
dc.identificator330417
dc.identifier.citationSalazar Vasquez, F. A. (2025). End-to-End Violence Detection Using Pedestrian Detection, Pose Estimation, and Temporal GRUs for Surveillance Applications [Tesis maestría]. Instituto Tecnológico y de Estudios Superiores de Monterrey. Recuperado de: https://hdl.handle.net/11285/703992
dc.identifier.orcidhttps://orcid.org/0009-0001-0664-2309
dc.identifier.urihttps://hdl.handle.net/11285/703992
dc.language.isoeng
dc.publisherInstituto Tecnológico y de Estudios Superiores de Monterrey
dc.relation.isFormatOfacceptedVersion
dc.rightsopenAccess
dc.rights.urihttp://creativecommons.org/licenses/by/4.0
dc.subject.classificationINGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA DE LOS ORDENADORES::INTELIGENCIA ARTIFICIAL
dc.subject.classificationINGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA DE LOS ORDENADORES::SISTEMAS EN TIEMPO REAL
dc.subject.keywordViolence detection
dc.subject.keywordPedestriand detection
dc.subject.keywordPose estimation
dc.subject.keywordObject tracking
dc.subject.keywordNeural networks
dc.subject.keywordGru
dc.subject.keywordCnn
dc.subject.lcshTechnology
dc.titleEnd-to-End Violence Detection Using Pedestrian Detection, Pose Estimation, and Temporal GRUs for Surveillance Applications
dc.typeTesis de maestría

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
SalazarVasquez_TesisMaestria_pdfa.pdf
Size:
9.77 MB
Format:
Adobe Portable Document Format
Description:
Tesis Maestría
Loading...
Thumbnail Image
Name:
SalazarVasquez_ActaGradoDeclaracionAutoria_pdfa.pdf
Size:
308.43 KB
Format:
Adobe Portable Document Format
Description:
Acta de Grado y Declaración de Autoría
Loading...
Thumbnail Image
Name:
SalazarVasquez_CartaAutorizacion_pdfa.pdf
Size:
69.85 KB
Format:
Adobe Portable Document Format
Description:
Carta Autorización

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.28 KB
Format:
Item-specific license agreed upon to submission
Description:
logo

El usuario tiene la obligación de utilizar los servicios y contenidos proporcionados por la Universidad, en particular, los impresos y recursos electrónicos, de conformidad con la legislación vigente y los principios de buena fe y en general usos aceptados, sin contravenir con su realización el orden público, especialmente, en el caso en que, para el adecuado desempeño de su actividad, necesita reproducir, distribuir, comunicar y/o poner a disposición, fragmentos de obras impresas o susceptibles de estar en formato analógico o digital, ya sea en soporte papel o electrónico. Ley 23/2006, de 7 de julio, por la que se modifica el texto revisado de la Ley de Propiedad Intelectual, aprobado

DSpace software copyright © 2002-2026

Licencia