Environmental monitoring to estimate indoor occupancy levels based on Semi-supervised machine learning and data fusion for building management

dc.audience.educationlevelEmpresas/Companies
dc.contributor.advisorCeballos Cancino, Hector Gibran
dc.contributor.authorMena Martínez, Alma Rosa
dc.contributor.catalogeremipsanchez
dc.contributor.committeememberAlvarado Uribe, Joanna
dc.contributor.committeememberCantu, Francisco J.
dc.contributor.committeememberGarcía, Juan Pablo
dc.contributor.departmentSchool of Engineering and Sciences
dc.contributor.institutionCampus Monterrey
dc.contributor.mentorSchmitt, Jan
dc.date.accepted2024-12-03
dc.date.accessioned2025-01-08T04:04:15Z
dc.date.issued2024-12-03
dc.descriptionhttps://orcid.org/0000-0002-2460-3442
dc.description.abstractOccupancy information is essential for space management, energy efficiency, and in times of the COVID-19 pandemic, for crowd control. Obtaining labeled data is challenging due to hardware limitations, privacy considerations, and the required underlying costs. Furthermore, venues over 200 m2 require data fusion techniques. Therefore, this thesis mainly focuses on exploring the potential of Semi-Supervised Learning (SSL), which only needs a few labeled data and a large amount of unlabeled data, to estimate the occupancy levels in enclosed spaces. This study presents an empirical comparison between Supervised ML and SSL models as well as data fusion techniques in real-life university classrooms and offices (uncontrolled conditions) at the University of the West of England, Bristol, UK, and Tecnologico de Monterrey, Mexico. The data was collected for three weeks at each scenario using an in-house developed Internet of Things (IoT) device that measures air temperature, relative humidity, and atmospheric pressure. The ground truth records were gathered through manual logging of occupancy levels. Datasets’ sizes averaged 2350 entries with only 280 labeled instances per dataset. Support Vector Machine (SVM), Random Forest (RF), and Multi-Layer Perceptron (MLP) were used to define a performance baseline for supervised ML. Self-Training (ST) and Label Propagation (LP) were tested for SSL. In addition, several feature fusion methods were explored, including Chi-squared, ANOVA F-test, Spearman and Kendall’s Tau correlation, Mutual Information, Averages, Recursive Feature Elimination, and Principal Component Analysis. The models were evaluated using Accuracy, Precision, Recall, F1-score, Confusion Matrix, and High - Quality Supervised Baseline. ST achieved superior performance compared to baseline models (SVM, RF, MLP) with a highest average accuracy of 90.96% compared to SVM (86.66%). Furthermore, the data fusion results indicated that the Chi-squared approach for feature fusion outperformed others with an F1-score average of 95% and an accuracy average of 99%. These results demonstrate the effectiveness of SSL for indirect occupancy estimation while reducing the need for extensive data collection and labeling.
dc.description.degreeDoctor of Philosophy in Computer Science
dc.format.mediumTexto
dc.identifier.citationMena Martínez, A. R. (2024). Environmental monitoring to estimate indoor occupancy levels based on Semi-supervised machine learning and data fusion for building management [Tesis doctoral]. Instituto Tecnológico y de Estudios Superiores de Monterrey. Recuperado de: https://hdl.handle.net/11285/702990
dc.identifier.cvu920335
dc.identifier.orcidhttps://orcid.org/0000-0003-4471-4446
dc.identifier.urihttps://hdl.handle.net/11285/702990
dc.language.isoeng
dc.publisherInstituto Tecnológico y de Estudios Superiores de Monterrey
dc.relationInstituto Tecnológico y de Estudios Superiores de Monterrey
dc.relationCONAHCYT
dc.relation.isFormatOfacceptedVersion
dc.rightsopenAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0
dc.subject.classificationINGENIERÍA Y TECNOLOGÍA::CIENCIAS TECNOLÓGICAS::TECNOLOGÍA DE LOS ORDENADORES::DISEÑO CON AYUDA DE ORDENADOR
dc.subject.keywordIndoor occupancy
dc.subject.keywordSemi-supervised machine learning
dc.subject.keywordSelf-training
dc.subject.keywordData
dc.subject.lcshTechnology
dc.titleEnvironmental monitoring to estimate indoor occupancy levels based on Semi-supervised machine learning and data fusion for building management
dc.typeTesis de doctorado

Files

Original bundle

Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
MenaMartinez_TesisDoctoradopdfa.pdf
Size:
12.89 MB
Format:
Adobe Portable Document Format
Description:
Tesis Doctorado
Loading...
Thumbnail Image
Name:
MenaMartinez_ActaGradoDeclaracionAutoriapdfa.pdf
Size:
493.67 KB
Format:
Adobe Portable Document Format
Description:
Acta de Grado y Declaración de Autoría
Loading...
Thumbnail Image
Name:
MenaMartinez_CartaFirmadapdfa.pdf
Size:
102.09 KB
Format:
Adobe Portable Document Format
Description:
Carta Firmada
Loading...
Thumbnail Image
Name:
MenaMartinez_CartaAutorizacionpdf.pdf
Size:
5.78 MB
Format:
Adobe Portable Document Format
Description:
Carta Autorización

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.28 KB
Format:
Item-specific license agreed upon to submission
Description:
logo

El usuario tiene la obligación de utilizar los servicios y contenidos proporcionados por la Universidad, en particular, los impresos y recursos electrónicos, de conformidad con la legislación vigente y los principios de buena fe y en general usos aceptados, sin contravenir con su realización el orden público, especialmente, en el caso en que, para el adecuado desempeño de su actividad, necesita reproducir, distribuir, comunicar y/o poner a disposición, fragmentos de obras impresas o susceptibles de estar en formato analógico o digital, ya sea en soporte papel o electrónico. Ley 23/2006, de 7 de julio, por la que se modifica el texto revisado de la Ley de Propiedad Intelectual, aprobado

DSpace software copyright © 2002-2025

Licencia