TY - JOUR U1 - Wissenschaftlicher Artikel A1 - Shajalal, Md A1 - Bohlouli, Milad A1 - Das, Hari Prasanna A1 - Boden, Alexander A1 - Stevens, Gunnar T1 - Improved Thermal Comfort Model Leveraging Conditional Tabular GAN Focusing on Feature Selection JF - IEEE Access N2 - The indoor thermal comfort in both homes and workplaces significantly influences the health and productivity of inhabitants. The heating system, controlled by Artificial Intelligence (AI), can automatically calibrate the indoor thermal condition by analyzing various physiological and environmental variables. To ensure a comfortable indoor environment, smart home systems can adjust parameters related to thermal comfort based on accurate predictions of inhabitants’ preferences. Modeling personal thermal comfort preferences poses two significant challenges: the inadequacy of data and its high dimensionality. An adequate amount of data is a prerequisite for training efficient machine learning (ML) models. Additionally, high-dimensional data tends to contain multiple irrelevant and noisy features, which might hinder ML models’ performance. To address these challenges, we propose a framework for predicting personal thermal comfort preferences, combining the conditional tabular generative adversarial network (CTGAN) with multiple feature selection techniques. We first address the data inadequacy challenge by applying CTGAN to generate synthetic data samples, incorporating challenges associated with multimodal distributions and categorical features. Then, multiple feature selection techniques are employed to identify the best possible sets of features. Experimental results based on a wide range of settings on a standard dataset demonstrated state-of-the-art performance in predicting personal thermal comfort preferences. The results also indicated that ML models trained on synthetic data achieved significantly better performance than models trained on real data. Overall, our method, combining CTGAN and feature selection techniques, outperformed existing known related work in thermal comfort prediction in terms of multiple evaluation metrics, including area under the curve (AUC), Cohen’s Kappa, and accuracy. Additionally, we presented a global, model-agnostic explanation of the thermal preference prediction system, providing an avenue for thermal comfort experiment designers to consciously select the data to be collected. KW - Personal thermal comfort KW - generative adversarial network KW - feature selection KW - machine learning KW - data inadequacy Y1 - 2024 UN - https://nbn-resolving.org/urn:nbn:de:hbz:1044-opus-78244 SN - 2169-3536 SS - 2169-3536 U6 - https://doi.org/10.1109/ACCESS.2024.3366453 DO - https://doi.org/10.1109/ACCESS.2024.3366453 VL - 12 SP - 30039 EP - 30053 PB - IEEE ER -