Dr. Timo Baumann

Researcher and instructor in speech processing, in particular incremental proccesing for responsive spoken dialogue interaction, at Universität Hamburg.
  • mail: baumannÄŦinformatik.uni-hamburg.de
  • skype: timobaumann
  • tel: +49 40 428832360 → my phone is disconnected at the moment while my office is being renovated. Send me an E-Mail.
  • office: F-406, Stellingen
  • consultation hours: please make an appointment by e-mail

Recent News

  • 2016-09: Arnes and my paper on incremental and predictive parsing for language modelling is accepted at COLING. Yay! We show how incremental parsing can improve LM perplexity on the Billion Word Corpus by ~6% with little compuational overhead and while allowing online decoding.
  • 2016-08: I'm giving a talk at WikiCon 2016 about the Spoken Wikipedia Corpus.
  • 2016-07: Marcel Rohde's Bachelor work on navigating the Spoken Wikipedia resulted in a paper at SLPAT. Yay!
  • 2016-05: Kolja Kirsch's Bachelor work not only received the 1st prize in the Informatik EXPO 2015 but he also got it accepted as a long paper at Mensch und Computer 2016, Germany's most important HCI conference. Congratulation! (There's also a Youtube-Video.)
  • 2016-04: Christine and I will host a Girls' Day group next week on Spoken Dialogue Systems.
  • 2016-03: I'll be at LREC in May presenting Florian's, Arne's and the Spoken Wikipedia Corpus.
  • 2016-02: I'm teaching the MSc course on speech processing and co-teaching a project on subtitling this summer.
  • 2015-11: Join us at the Science Fair in Hamburg!
  • 2015-11: Casey's, Julian's, David's and my paper that compares incremental performance of ASR systems has been accepted at IWSDS. Yay!
  • 2015-10: I'll be teaching an ESSLLI 2016 introductory course on Incremental Speech and Language Processing for Interactive Systems together with Arne Köhn.
  • older news

Research Interests

Why do we speak how we speak when we just speak to speak? And how can we model systems to do the same?

A (slightly longer) research statement.

Professional Activities

  • I maintain InproTK, the incremental dialogue processing toolkit, which is being used in multiple research labs around the world for building incremental spoken dialogue systems.
  • Together with David Schlangen, I gave an Interspeech tutorial on incremental processing in 2013 which was a great success. Slides can be found here. Exercises from a more recent introduction to InproTK are on Sourceforge.
  • I've organized the Workshop on Architectures for Conversationally Competent Spoken Dialogue Systems 2012 in Hamburg, Germany. The purpose of the workshop was to bring young researchers in the field together to talk about upcoming challenges in developing highly interactive and natural dialogue systems/virtual agents/conversational systems.
  • I've co-organized the Young Researcher's Roundtable on Spoken Dialogue Systems 2012 in Seoul, Korea
  • I've been reviewing for LREC 2010-2016, IJCNLP 2011,2015, ACL 2015, AAAI 2012, SigDIal 2013-2015, SemDial 2014, EACL SRW 2014, BEA 2014-2015, ICMI 2013-2015, Interspeech 2015, HRI 2015, Computer, Speech and Language, LREV, and co-reviewing for SigDial 2012




  • Next winter term 2016
    • Logics Programming Exercises
    • Practical Course ("Praktikum") on Using Speech Processing Software (BSc)
  • Summer term 2016

Previous Teaching

Student Supervision

Currently, I am supervising Natalia Orlova's MSc thesis on the combination of multiple incremental speech recognizers, Marcel Rohde's BSc thesis on a Spoken Wikipedia Browser.

I have previously co-supervised Valentin Strauss' BSc thesis on incremental post-processing of Google's ASR results, Florian Stegen's BSc thesis on long audio alignment for the Spoken Wikipedia, Jonathan Werner's BSc thesis on keyword spotting in lecture transcriptions, Sven Zimmer's BSc thesis on build tools for scientific software development, Anne Rubruck's MSc thesis on decomposing semantic annotations into lexical semantics, Kolja Kirsch's BSc thesis on semi-automatic page-turning for piano sheet music, Jiyan Jonsdotter's BSc thesis on applying incremental spoken output to navigation systems, Engelke Eschner's diploma thesis on NLG for transit schedules, Sören Nykamp's BSc thesis on incremental processing in interactive storytelling, Johannes Twiefel's MSc thesis on improving Google's ASR using phonetic post-processing techniques given domain knowledge, Anita Eisenhaber's BSc thesis on sentiment analysis in social media statements (tweets, etc.), Svenja Neef's BSc thesis on analyzing the incremental properties of Android's ASR and integrating it with InproTK, Ole Eichhorn's BSc thesis on incremental speech synthesis integration into the VAVETaM system, and Rabih Hamadeh's MSc thesis on optimizing incremental ASR hypotheses.

I actively encourage students to go abroad! I successfully talked Maike Paetzel into visiting ICT as a summer intern in 2013 to work with David DeVault (which resulted in a publication at LREC 2014), and Arne Köhn is visiting ICT as a summer intern in 2014 to work with Kenji Sagae on incremental parsing. Sven Mutzl approached me regarding an internship opportunity in Shanghai which I helped to set up in collaboration with Kai Yu at Shanghai Technical University. Siva Meenakshi Renganathan from Anna University (Chennai) is visiting our lab as a summer intern in 2014 to work on exploiting Spoken Wikipedia data for speech research.

Under construction: Timo's assessment criteria for theses (and, to a lesser degree, seminar papers). See also: Timo's advice on how to write a good thesis.
This site is powered by FoswikiCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki? Send feedback