pub H-BRS | 006 Spezielle Computerverfahren

Towards Robust and Interpretable Practical Applications of Automatic Mental State Analysis Using a Dynamic and Hybrid Facial Action Estimation Approach (2020)

Hassan, Teena

This dissertation presents a probabilistic state estimation framework for integrating data-driven machine learning models and a deformable facial shape model in order to estimate continuous-valued intensities of 22 different facial muscle movements, known as Action Units (AU), defined in the Facial Action Coding System (FACS). A practical approach is proposed and validated for integrating class-wise probability scores from machine learning models within a Gaussian state estimation framework. Furthermore, driven mass-spring-damper models are applied for modelling the dynamics of facial muscle movements. Both facial shape and appearance information are used for estimating AU intensities, making it a hybrid approach. Several features are designed and explored to help the probabilistic framework to deal with multiple challenges involved in automatic AU detection. The proposed AU intensity estimation method and its features are evaluated quantitatively and qualitatively using three different datasets containing either spontaneous or acted facial expressions with AU annotations. The proposed method produced temporally smoother estimates that facilitate a fine-grained analysis of facial expressions. It also performed reasonably well, even though it simultaneously estimates intensities of 22 AUs, some of which are subtle in expression or resemble each other closely. The estimated AU intensities tended to the lower range of values, and were often accompanied by a small delay in onset. This shows that the proposed method is conservative. In order to further improve performance, state-of-the-art machine learning approaches for AU detection could be integrated within the proposed probabilistic AU intensity estimation framework.

Towards Designing Privacy-Compliant Social Robots for Use in Private Households: A Use Case Based Identification of Privacy Implications and Potential Technical Measures for Mitigation (2020)

Horstmann, Bjorn ; Diekmann, Niels ; Buschmeier, Hendrik ; Hassan, Teena

Supplementary material for the publication “Towards Designing Privacy-Compliant Social Robots for Use in Private Households: A Use Case Based Identification of Privacy Implications and Potential Technical Measures for Mitigation” (2020)

Horstmann, Björn ; Diekmann, Niels ; Buschmeier, Hendrik ; Hassan, Teena

Machine Learning and End-to-End Deep Learning for Monitoring Driver Distractions From Physiological and Visual Signals (2020)

Gjoreski, Martin ; Gams, Matja Z. ; Lustrek, Mitja ; Genc, Pelin ; Garbas, Jens-U. ; Hassan, Teena

It is only a matter of time until autonomous vehicles become ubiquitous; however, human driving supervision will remain a necessity for decades. To assess the drive's ability to take control over the vehicle in critical scenarios, driver distractions can be monitored using wearable sensors or sensors that are embedded in the vehicle, such as video cameras. The types of driving distractions that can be sensed with various sensors is an open research question that this study attempts to answer. This study compared data from physiological sensors (palm electrodermal activity (pEDA), heart rate and breathing rate) and visual sensors (eye tracking, pupil diameter, nasal EDA (nEDA), emotional activation and facial action units (AUs)) for the detection of four types of distractions. The dataset was collected in a previous driving simulation study. The statistical tests showed that the most informative feature/modality for detecting driver distraction depends on the type of distraction, with emotional activation and AUs being the most promising. The experimental comparison of seven classical machine learning (ML) and seven end-to-end deep learning (DL) methods, which were evaluated on a separate test set of 10 subjects, showed that when classifying windows into distracted or not distracted, the highest F1-score of 79%; was realized by the extreme gradient boosting (XGB) classifier using 60-second windows of AUs as input. When classifying complete driving sessions, XGB's F1-score was 94%. The best-performing DL model was a spectro-temporal ResNet, which realized an F1-score of 75%; when classifying segments and an F1-score of 87%; when classifying complete driving sessions. Finally, this study identified and discussed problems, such as label jitter, scenario overfitting and unsatisfactory generalization performance, that may adversely affect related ML approaches.

Towards an Interaction-Centered and Dynamically Constructed Episodic Memory for Social Robots (2020)

Hassan, Teena ; Kopp, Stefan

Forschungsprojekt beyondSPAI - Sichere Überwachung adaptiver Schutzräume im nahen Wirkungsbereich von Kollaborierenden Industrierobotern u.a. mittels intelligentem NIR-Kamerasystem (2020)

Hammer, Christof ; Jung, Norbert

Kollaborative Industrieroboter werden für produzierende Unternehmen immer kosteneffizienter. Während diese Systeme für den menschlichen Mitarbeiter eine große Hilfe sein können, stellen sie gleichzeitig ein ernstes Gesundheitsrisiko dar, wenn die zwingend notwendigen Sicherheitsmaßnahmen nur unzureichend umgesetzt werden. Herkömmliche Sicherheitseinrichtungen wie Zäune oder Lichtvorhänge bieten einen guten Schutz, aber solch statische Schutzvorrichtungen sind in neuen, hochdynamischen Arbeitsszenarien problematisch. Im Forschungsprojekt BeyondSPAI wurde ein Funktionsmuster eines Multisensorsystems zur Absicherung solcher dynamischer Arbeitsszenarien entworfen, implementiert und im Feld getestet. Kern des Systems ist eine robuste optische Materialklassifikation, die mit Hilfe eines intelligenten InGaAs-Kamerasystems Haut von anderen typischen Werkstückoberflächen (z.B. Holz, Metalle od. Kunststoffe) unterscheiden kann. Diese einzigartige Eigenschaft wird genutzt, um menschliche Mitarbeiter zuverlässig zu erkennen, so dass ein konventioneller Roboter in Folge als personenbewusster Cobot arbeiten kann. Das System ist modular und kann leicht mit weiteren Sensoren verschiedenster Art erweitert werden. Es kann an verschiedene Marken von Industrierobotern angepasst werden und lässt sich schnell an bestehenden Robotersystemen integrieren. Die vier vom System bereitgestellten Sicherheitsausgänge können dazu verwendet werden - abhängig von der durchdrungenen Überwachungszone - entweder eine Warnung auszugeben, die Bewegung des Roboters auf eine sichere Geschwindigkeit zu verlangsamen, oder den Roboter sicher anzuhalten. Sobald alle Zonen wieder als „eindeutig frei von Personen“ identifiziert sind, kann der Roboter wieder beschleunigen, seine ursprüngliche Bewegung wiederaufnehmen und die Arbeit fortsetzen.

Einsatz von Künstlicher Intelligenz im internationalen Spitzensport – Eine Erhebung des Status Quo (2020)

Hammes, Fabian ; Link, Daniel ; Lames, Martin ; Hagg, Alexander ; Asteroth, Alexander ; Pfeiffer, Mark

Anhand eines Literaturreviews sowie Interviews mit ausgewählten Expert/innen aus 7 Ländern wird der aktuelle Stand der KI im internationalen Spitzensport beschrieben. Die Ergebnisse zeigen primär Aktivitäten in den Bereichen Bilderkennung, Signalverarbeitung und Regelerkennung.

Einsatzmöglichkeiten und Transfer von Künstlicher Intelligenz im internationalen Spitzensport – zwischen Small und Big Data (2020)

Hagg, Alexander ; Asteroth, Alexander ; Pfeiffer, Mark ; Hammes, Fabian ; Link, Daniel

Ausgehend von einer Sichtung aktueller Ergebnisse in der KI werden Faktoren identifiziert, die zur erfolgreichen Anwendung führen. Ein Abgleich mit den Gegebenheiten im Spitzen-sport führt zur Identifikation potenzieller Handlungsfelder.

Hey Human, If your Facial Emotions are Uncertain, You Should Use Bayesian Neural Networks! (2020)

Matin, Maryam ; Valdenegro-Toro, Matias

Facial emotion recognition is the task to classify human emotions in face images. It is a difficult task due to high aleatoric uncertainty and visual ambiguity. A large part of the literature aims to show progress by increasing accuracy on this task, but this ignores the inherent uncertainty and ambiguity in the task. In this paper we show that Bayesian Neural Networks, as approximated using MC-Dropout, MC-DropConnect, or an Ensemble, are able to model the aleatoric uncertainty in facial emotion recognition, and produce output probabilities that are closer to what a human expects. We also show that calibration metrics show strange behaviors for this task, due to the multiple classes that can be considered correct, which motivates future work. We believe our work will motivate other researchers to move away from Classical and into Bayesian Neural Networks.

Research Project beyondSPAI - The Safe and Reliable Monitoring of Adaptive Safety Zones in the Proximity of Collaborating Industrial Robots Using an Intelligent InGaAs Camera System (2020)

Hammer, Christof ; Jung, Norbert

FaceHaptics: Robot Arm based Versatile Facial Haptics for Immersive Environments (2020)

Wilberz, Alexander ; Leschtschow, Dominik ; Trepkowski, Christina ; Maiero, Jens ; Kruijff, Ernst ; Riecke, Bernhard

This paper introduces FaceHaptics, a novel haptic display based on a robot arm attached to a head-mounted virtual reality display. It provides localized, multi-directional and movable haptic cues in the form of wind, warmth, moving and single-point touch events and water spray to dedicated parts of the face not covered by the head-mounted display.The easily extensible system, however, can principally mount any type of compact haptic actuator or object. User study 1 showed that users appreciate the directional resolution of cues, and can judge wind direction well, especially when they move their head and wind direction is adjusted dynamically to compensate for head rotations. Study 2 showed that adding FaceHaptics cues to a VR walkthrough can significantly improve user experience, presence, and emotional responses.

Foreword to the Special Section on the Symposium on Virtual and Augmented Reality 2019 (SVR 2019) (2020)

Hinkenjann, André ; Sandor, Christian ; Teixeira, João Marcelo ; Rieder, Rafael

Open Access

006 Spezielle Computerverfahren

Refine

H-BRS Bibliography

Departments, institutes and facilities

Document Type

Year of publication

Language

Has Fulltext

Keywords

12 search hits