- Main focus of the Speech Processing R&D Group:
- Automatic Speech Recognition (ASR) on Swiss Languages and Dialects,
- Language Identification (LID),
- Speaker Identification/Verification (SI/SV),
- Emotion Identification & Sentiment Analysis (SER),
- Text to Speech Synthesis (TTS),
- Data Selection and Active Learning,
- Natural Language Processing & Understanding (NLP/NLU),
- Neural Machine Translation (NMT),
- Swiss German Dialects Text Normalization
- Swiss German Automatic Speech Recognition (in collaboration with Microsoft Research Center in Munich)
- DAHL: Domain Adaptation via Hierarchical Lexicons (In collaboration with Idiap Research Institute and École Polytechnique Fédérale de Lausanne (EPFL))
- Building A High German to Swiss German Phonetic Dictionary (In collaboration with the University of Zurich (UZH))
- Supervising and co-Supervising Master Thesis and PhD Projects
- In collaboration with Idiap Research Institute and École Polytechnique Fédérale de Lausanne (EPFL) - End-to-end Speaker Recognition from Speech, Self-attention for speech emotion recognition, End-to-end Language Recognition from Speech using Bottleneck Features, Data Selection for Speech Recognition Training, Domain Adaptation in End-to-End Speech Recognition, Key-word Search/Spotting using Convolutional Neural Networks, Semi-Supervised Data Acquisition and Explorations
- In collaboration with La Haute école spécialisée de Suisse occidentale de Fribourg (HEFR) - Swiss German Normalization with Neural Networks, Swiss German dialects Text Normalization using Neural Network Machine Translation-based Approaches and Word Embeddings
- In collaboration with University of Zurich - Fairness in AI.