A Study on the Ambiguity in Human Annotation of German Oral History Interviews for Perceived Emotion Recognition and Sentiment Analysis
- For research in audiovisual interview archives often it is not only of interest what is said but also how. Sentiment analysis and emotion recognition can help capture, categorize and make these different facets searchable. In particular, for oral history archives, such indexing technologies can be of great interest. These technologies can help understand the role of emotions in historical remembering. However, humans often perceive sentiments and emotions ambiguously and subjectively. Moreover, oral history interviews have multi-layered levels of complex, sometimes contradictory, sometimes very subtle facets of emotions. Therefore, the question arises of the chance machines and humans have capturing and assigning these into predefined categories. This paper investigates the ambiguity in human perception of emotions and sentiment in German oral history interviews and the impact on machine learning systems. Our experiments reveal substantial differences in human perception for different emotions. Furthermore, we report from ongoing machine learning experiments with different modalities. We show that the human perceptual ambiguity and other challenges, such as class imbalance and lack of training data, currently limit the opportunities of these technologies for oral history archives. Nonetheless, our work uncovers promising observations and possibilities for further research.
Document Type: | Conference Object |
---|---|
Language: | English |
Author: | Michael Gref, Nike Matthiesen, Sreenivasa Hikkal Venugopala, Shalaka Satheesh, Aswinkumar Vijayananth, Duc Bach Ha, Sven Behnke, Joachim Köhler |
Parent Title (German): | Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), Marseille, 20-25 June 2022 |
First Page: | 2022 |
Last Page: | 2031 |
URN: | urn:nbn:de:hbz:1044-opus-61275 |
URL: | https://aclanthology.org/2022.lrec-1.217/ |
DOI: | https://doi.org/10.48550/arXiv.2201.06868 |
ArXiv Id: | http://arxiv.org/abs/2201.06868 |
Publisher: | European Language Resources Association |
Publishing Institution: | Hochschule Bonn-Rhein-Sieg |
Date of first publication: | 2022/01/18 |
Copyright: | © European Language Resources Association (ELRA), licensed under CC-BY-NC-4.0 |
Funding: | The research project is funded by the German Federal Government Commissioner for Culture and Media. |
Keyword: | ambiguity; annotation; emotion recognition; facial emotion recognition; language; oral history; sentiment analysis; speech emotion recognition |
Dewey Decimal Classification (DDC): | 0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik |
Entry in this database: | 2022/03/09 |
Licence (German): | Creative Commons - CC BY-NC - Namensnennung - Nicht kommerziell 4.0 International |