pub H-BRS | 006 Spezielle Computerverfahren

»Wie fälscht man Fingerabdrücke, Herr Jung?« (2021)

Vection underwater (2022)

Original dataset Tasks: Two self-motion perception tasks (Move-to-Target and Adjust-Target) and a Size/Distance Perception task Conditions: in-pool / in-lab and supine/upright Self-reported vection experience

Using Visual and Auditory Cues to Locate Out-of-View Objects in Head-Mounted Augmented Reality (2021)

Binetti, Nicola ; Wu, Luyan ; Chen, Shiping ; Kruijff, Ernst ; Julier, Simon ; Brumby, Duncan P.

The Influence of Gravity on Perceived Travel Distance in Virtual Reality (2022)

Bury, Nils-Alexander ; Harris, Laurence R. ; Jenkin, Michael ; Allison, Robert S. ; Felsner, Sandra ; Herpers, Rainer

Telepräsenz-Roboter im häuslichen Lebens- und Pflegearrangement von Personen mit Demenz im ländlichen Raum (RoboLand) (2021)

Bleses, Helma M. ; Prassler, Erwin ; Dammert, Matthias ; Steinacker, Anna ; Nagel, Patrick ; Schöbel, Maximilian

Study Results of a skin detecting Safety Sensor on Circular Saws (2010)

Jung, Norbert ; Schwaneberg, Oliver ; Adam, Peter

STonKGs: A Sophisticated Transformer Trained on Biomedical Text and Knowledge Graphs (2021)

Balabin, Helena ; Hoyt, Charles Tapley ; Birkenbihl, Colin ; Gyori, Benjamin M. ; Bachman, John ; Tom Kodamullil, Alpha ; Plöger, Paul G. ; Hofmann-Apitius, Martin ; Domingo-Fernández, Daniel

The majority of biomedical knowledge is stored in structured databases or as unstructured text in scientific publications. This vast amount of information has led to numerous machine learning-based biological applications using either text through natural language processing (NLP) or structured data through knowledge graph embedding models (KGEMs). However, representations based on a single modality are inherently limited. To generate better representations of biological knowledge, we propose STonKGs, a Sophisticated Transformer trained on biomedical text and Knowledge Graphs. This multimodal Transformer uses combined input sequences of structured information from KGs and unstructured text data from biomedical literature to learn joint representations. First, we pre-trained STonKGs on a knowledge base assembled by the Integrated Network and Dynamical Reasoning Assembler (INDRA) consisting of millions of text-triple pairs extracted from biomedical literature by multiple NLP systems. Then, we benchmarked STonKGs against two baseline models trained on either one of the modalities (i.e., text or KG) across eight different classification tasks, each corresponding to a different biological application. Our results demonstrate that STonKGs outperforms both baselines, especially on the more challenging tasks with respect to the number of classes, improving upon the F1-score of the best baseline by up to 0.083. Additionally, our pre-trained model as well as the model architecture can be adapted to various other transfer learning applications. Finally, the source code and pre-trained STonKGs models are available at https://github.com/stonkgs/stonkgs and https://huggingface.co/stonkgs/stonkgs-150k.

Selection Performance and Reliability of Eye and Head Gaze Tracking Under Varying Light Conditions (2024)

Marquardt, Alexander ; Steininger, Melissa ; Trepkowski, Christina ; Weier, Martin ; Kruijff, Ernst

Research Project beyondSPAI - The Safe and Reliable Monitoring of Adaptive Safety Zones in the Proximity of Collaborating Industrial Robots Using an Intelligent InGaAs Camera System (2020)

Hammer, Christof ; Jung, Norbert

Reasoning of Intelligent Virtual Agents with Exceptional Behavior in Rare Scenarios (2023)

Lysek, Alexander ; Seele, Sven ; Herpers, Rainer

Push-buttons with Material Classification based on Spectral Signatures (2010)

Schwaneberg, Oliver ; Steiner, Holger ; Jung, Norbert ; Reinert, Dietmar

In this paper, we introduce an optical sensor system, which is integrated into an industrial push-button. The sensor allows to classify the type of material that is in contact with the button when pressed into different material categories on the basis of the material's so called "spectral signature". An approach for a safety sensor system at circular table saws on the same base has been introduced previously on SIAS-2007. This contactless working sensor is able to distinguish reliably between skin, textiles, leather and various other kinds of materials. A typical application for this intelligent push-button is the use at possibly dangerous machines, whose operating instructions include either the prohibition or the obligation to wear gloves during the work at the machine. An exemple of machines at which no gloves are allowed are pillar drilling machines, because of the risk of getting caught in the drill chuck and being turned in by the machine. In many cases this causes very serious hand injuries. Depending on the application needs, the sensor system integrated into the push-button can be configured flexibly by software to prevent the operator from accidentally starting a machine with or without gloves, which can decrease the risk of severe accidents significantly. Especially two-hand controls are incentive to manipulation for easier handling. By equipping both push-buttons of a two-hand control with material classification properties, the user is forced to operate the controls with his bare fingers. That limitation disallows the manipulation of a two-hand control by a simple rodding device.

Perception-driven rendering : techniques for the efficient visualization of 3D scenes including view- and gaze-contingent approaches (2019)

Weier, Martin

Computer graphics research strives to synthesize images of a high visual realism that are indistinguishable from real visual experiences. While modern image synthesis approaches enable to create digital images of astonishing complexity and beauty, processing resources remain a limiting factor. Here, rendering efficiency is a central challenge involving a trade-off between visual fidelity and interactivity. For that reason, there is still a fundamental difference between the perception of the physical world and computer-generated imagery. At the same time, advances in display technologies drive the development of novel display devices. The dynamic range, the pixel densities, and refresh rates are constantly increasing. Display systems enable a larger visual field to be addressed by covering a wider field-of-view, due to either their size or in the form of head-mounted devices. Currently, research prototypes are ranging from stereo and multi-view systems, head-mounted devices with adaptable lenses, up to retinal projection, and lightfield/holographic displays. Computer graphics has to keep step with, as driving these devices presents us with immense challenges, most of which are currently unsolved. Fortunately, the human visual system has certain limitations, which means that providing the highest possible visual quality is not always necessary. Visual input passes through the eye’s optics, is filtered, and is processed at higher level structures in the brain. Knowledge of these processes helps to design novel rendering approaches that allow the creation of images at a higher quality and within a reduced time-frame. This thesis presents the state-of-the-art research and models that exploit the limitations of perception in order to increase visual quality but also to reduce workload alike - a concept we call perception-driven rendering. This research results in several practical rendering approaches that allow some of the fundamental challenges of computer graphics to be tackled. By using different tracking hardware, display systems, and head-mounted devices, we show the potential of each of the presented systems. The capturing of specific processes of the human visual system can be improved by combining multiple measurements using machine learning techniques. Different sampling, filtering, and reconstruction techniques aid the visual quality of the synthesized images. An in-depth evaluation of the presented systems including benchmarks, comparative examination with image metrics as well as user studies and experiments demonstrated that the methods introduced are visually superior or on the same qualitative level as ground truth, whilst having a significantly reduced computational complexity.

Perception-driven Accelerated Rendering (2017)

Weier, Martin ; Stengel, Michael ; Roth, Thorsten ; Didyk, Piotr ; Eisemann, Elmar ; Eisemann, Martin ; Grogorick, Steve ; Hinkenjann, André ; Kruijff, Ernst ; Magnor, Marcus ; Myszkowski, Karol ; Slusallek, Philipp

Advances in computer graphics enable us to create digital images of astonishing complexity and realism. However, processing resources are still a limiting factor. Hence, many costly but desirable aspects of realism are often not accounted for, including global illumination, accurate depth of field and motion blur, spectral effects, etc. especially in real‐time rendering. At the same time, there is a strong trend towards more pixels per display due to larger displays, higher pixel densities or larger fields of view. Further observable trends in current display technology include more bits per pixel (high dynamic range, wider color gamut/fidelity), increasing refresh rates (better motion depiction), and an increasing number of displayed views per pixel (stereo, multi‐view, all the way to holographic or lightfield displays). These developments cause significant unsolved technical challenges due to aspects such as limited compute power and bandwidth. Fortunately, the human visual system has certain limitations, which mean that providing the highest possible visual quality is not always necessary. In this report, we present the key research and models that exploit the limitations of perception to tackle visual quality and workload alike. Moreover, we present the open problems and promising future research targeting the question of how we can minimize the effort to compute and display only the necessary pixels while still offering a user full visual experience.

Object-Based Tree Stump Detection Fusing RGB and Multispectral Data (2023)

Chaturvedi, Pranisha ; Johenneken, Maximilian ; Drak, Ahmad ; Houben, Sebastian ; Asteroth, Alexander

Non-Visual Cues for View Management in Narrow Field of View Augmented Reality Displays (2019)

Marquardt, Alexander ; Trepkowski, Christina ; Eibich, Tom David ; Maiero, Jens ; Kruijff, Ernst

Navigation interfaces for virtual reality and gaming: Theory and practice (2017)

Kruijff, Ernst ; Riecke, Bernhard E.

Multisensory guidance under sensory constraints in augmented reality (2023)

Marquardt, Alexander

This research investigates the efficacy of multisensory cues for locating targets in Augmented Reality (AR). Sensory constraints can impair perception and attention in AR, leading to reduced performance due to factors such as conflicting visual cues or a restricted field of view. To address these limitations, the research proposes head-based multisensory guidance methods that leverage audio-tactile cues to direct users' attention towards target locations. The research findings demonstrate that this approach can effectively reduce the influence of sensory constraints, resulting in improved search performance in AR. Additionally, the thesis discusses the limitations of the proposed methods and provides recommendations for future research.

Mebendazole's Conformational Space and Its Predicted Binding to Human Heat-Shock Protein 90 (2022)

Fiedler, Walter ; Freisleben, Fabian ; Wellbrock, Jasmin ; Kirschner, Karl N.

Low-Cost In-Hand Slippage Detection and Avoidance for Robust Robotic Grasping with Compliant Fingers (2021)

Cervantes, Eduardo ; Schneider, Sven ; Ploeger, Paul G.

Leaning-based interfaces improve ground-based VR locomotion in reach-the-target, follow-the-path, and racing tasks (2023)

Hashemian, Abraham M. ; Adhikari, Ashu ; Kruijff, Ernst ; Heyde, Markus von der ; Riecke, Bernhard E.

Lean into it: Exploring leaning-based motion cueing interfaces for virtual reality movement (2017)

Kitson, Alexandra ; Hashemian, Abraham M. ; Stepanova, Ekaterina R. ; Kruijff, Ernst ; Riecke, Bernhard E.

Introduction to Virtual and Augmented Reality (2022)

Doerner, Ralf ; Broll, Wolfgang ; Jung, Bernhard ; Grimm, Paul ; Göbel, Martin ; Kruse, Rolf

Improving Natural Language Inference in Arabic Using Transformer Models and Linguistically Informed Pre-Training (2024)

Al Deen, Mohammad Majd Saad ; Pielka, Maren ; Hees, Jörn ; Abdou, Bouthaina Soulef ; Sifa, Rafet

Improving Natural Language Inference in Arabic using Transformer Models and Linguistically Informed Pre-Training (2023)

Deen, Mohammad Majd Saad Al ; Pielka, Maren ; Hees, Jörn ; Abdou, Bouthaina Soulef ; Sifa, Rafet

This paper addresses the classification of Arabic text data in the field of Natural Language Processing (NLP), with a particular focus on Natural Language Inference (NLI) and Contradiction Detection (CD). Arabic is considered a resource-poor language, meaning that there are few data sets available, which leads to limited availability of NLP methods. To overcome this limitation, we create a dedicated data set from publicly available resources. Subsequently, transformer-based machine learning models are being trained and evaluated. We find that a language-specific model (AraBERT) performs competitively with state-of-the-art multilingual approaches, when we apply linguistically informed pre-training methods such as Named Entity Recognition (NER). To our knowledge, this is the first large-scale evaluation for this task in Arabic, as well as the first application of multi-task pre-training in this context.

Hey Human, If your Facial Emotions are Uncertain, You Should Use Bayesian Neural Networks! (2020)

Matin, Maryam ; Valdenegro-Toro, Matias

Facial emotion recognition is the task to classify human emotions in face images. It is a difficult task due to high aleatoric uncertainty and visual ambiguity. A large part of the literature aims to show progress by increasing accuracy on this task, but this ignores the inherent uncertainty and ambiguity in the task. In this paper we show that Bayesian Neural Networks, as approximated using MC-Dropout, MC-DropConnect, or an Ensemble, are able to model the aleatoric uncertainty in facial emotion recognition, and produce output probabilities that are closer to what a human expects. We also show that calibration metrics show strange behaviors for this task, due to the multiple classes that can be considered correct, which motivates future work. We believe our work will motivate other researchers to move away from Classical and into Bayesian Neural Networks.

Open Access

006 Spezielle Computerverfahren

Refine

H-BRS Bibliography

Departments, institutes and facilities

Document Type

Year of publication

Language

Has Fulltext

Keywords

43 search hits