**********************************************************************
*
*
* Einladung
*
*
*
* Informatik-Oberseminar
*
*
*
***********************************************************************
Zeit: Mittwoch, 28 August 2024, 15:30 Uhr
Ort: Raum 9222, E3, Informatikzentrum
Zoom:
https://rwth.zoom-x.de/j/66261562709?pwd=2driMW8j0cRJLiYFWNJRqRp4Lya80Z.1
Meeting-ID: 662 6156 2709
Kenncode: 337848
Referent: Yingbo Gao, M.Sc. (Lehrstuhl Informatik 6)
Thema: Language Modeling and Machine Translation: Improvements
in Training and Modeling
Abstract:
Substantial improvements in language modeling and machine
translation have been achieved since the wide adoption of
artificial neural networks. In this talk, we discuss three
directions related to training and modeling in neural language
modeling and neural machine translation. First, sampling-based
training criteria are investigated in order to speed up the
training of neural language models with large vocabularies.
Second, label smoothing, input smoothing as well as multi-agent
training are studied to improve the generalization of neural
machine translation models. Finally, a language modeling approach
for machine translation is proposed to simplify the architecture
of existing translation models.
Es laden ein: die Dozentinnen und Dozenten der Informatik
-- Stephanie Jansen Faculty of Mathematics, Computer Science and Natural Sciences Chair of Computer Science 6 ML - Machine Learning and Reasoning RWTH Aachen University Theaterstraße 35-39 D-52062 Aachen Tel: +49 241 80-21601 sek@ml.rwth-aachen.de www.hltpr.rwth-aachen.de