Volltext-Downloads (blau) und Frontdoor-Views (grau)
  • search hit 1 of 3
Back to Result List

Workflow for Automatic i-Vector based Speaker Identification on German Parliament Speakers

  • In order to help journalists investigate inside large audiovisual archives, as maintained by news broadcast agencies, the multimedia data must be indexed by text-based search engies. By automatically creating a transcript through automatic speech recognition (ASR), the spoken word becomes accessible to text search, and queries for keywords are made possible. But stil, important contextual information like the identity of the speaker is not captured. Especially when gathering original footage in the political domain, the identity of the speaker can be the most important query constraint, although this name may not be prominent in the words spoken. It is thus desireable to have this information provided explicitely to the search engine. To provide this information, the archive must be an alyzed by automatic Speaker Identification (SID). While this research topic has seen substantial gains in accuracy and robustness over last years, it has not yet established itself as a helpful, large-scale tool outside the research community. This thesis sets out to establish a workflow to provide automatic speaker identification. Its application is to help journalists searching on speeches given in the German parliament (Bundestag). This is a contribution to the News-Stream 3.0 project, a BMBF funded research project that addresses accessibility of various data sources for journalists.

Export metadata

Additional Services

Search Google Scholar Check availability

Statistics

Show usage statistics
Metadaten
Document Type:Master's Thesis
Language:English
Author:Gunnar Åkermark
Number of pages:VI, 53
URL:https://nbn-resolving.org/urn:nbn:de:0011-n-4423808
DOI:https://doi.org/10.24406/publica-fhg-281487
Referee:Gerhard K. Kraetzschmar, Paul Plöger, Daniel Stein
Publisher:Fraunhofer Publica
Granting Institution:Hochschule Bonn-Rhein-Sieg, Fachbereich Informatik
Contributing Corporation:Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme
Date of first publication:2017/04/27
Note:
Projektdaten: Bundesministerium für Bildung und Forschung BMBF
01IS14003; News-Stream 3.0
Echtzeitanalyse und Auswertung heterogener Nachrichtenströme mittels Big-Data-Technologien
Keyword:Alize; LDA; PLDA; Speaker identification; i-vectors
Dewey Decimal Classification (DDC):0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik
Theses, student research papers:Hochschule Bonn-Rhein-Sieg / Fachbereich Informatik
Entry in this database:2017/06/14