**********************************************************************
*
*
*                          Einladung
*
*
*
*                     Informatik-Oberseminar
*
*
*
***********************************************************************

Zeit: Mittwoch, 28 August 2024, 15:30 Uhr

Ort: Raum 9222, E3, Informatikzentrum

Zoom: https://rwth.zoom-x.de/j/66261562709?pwd=2driMW8j0cRJLiYFWNJRqRp4Lya80Z.1
Meeting-ID: 662 6156 2709
Kenncode: 337848


Referent: Yingbo Gao, M.Sc. (Lehrstuhl Informatik 6)

Thema: Language Modeling and Machine Translation: Improvements in Training and Modeling


Abstract: 

Substantial improvements in language modeling and machine translation have been achieved since the wide adoption of artificial neural networks. In this talk, we discuss three directions related to training and modeling in neural language modeling and neural machine translation. First, sampling-based training criteria are investigated in order to speed up the training of neural language models with large vocabularies. Second, label smoothing, input smoothing as well as multi-agent training are studied to improve the generalization of neural machine translation models. Finally, a language modeling approach for machine translation is proposed to simplify the architecture of existing translation models.


Es laden ein: die Dozentinnen und Dozenten der Informatik


-- 
Stephanie Jansen

Faculty of Mathematics, Computer Science and Natural Sciences
Chair of Computer Science 6
ML - Machine Learning and Reasoning
RWTH Aachen University
Theaterstraße 35-39
D-52062 Aachen
Tel: +49 241 80-21601
sek@ml.rwth-aachen.de
www.hltpr.rwth-aachen.de