********************************************************************** * * * Einladung * * * * Informatik-Oberseminar * * * ***********************************************************************
Zeit: Mittwoch, 28 August 2024, 15:30 Uhr
Ort: Raum 9222, E3, Informatikzentrum
Zoom: https://rwth.zoom-x.de/j/66261562709?pwd=2driMW8j0cRJLiYFWNJRqRp4Lya80Z.1 Meeting-ID: 662 6156 2709 Kenncode: 337848
Referent: Yingbo Gao, M.Sc. (Lehrstuhl Informatik 6)
Thema: Language Modeling and Machine Translation: Improvements in Training and Modeling
Abstract:
Substantial improvements in language modeling and machine translation have been achieved since the wide adoption of artificial neural networks. In this talk, we discuss three directions related to training and modeling in neural language modeling and neural machine translation. First, sampling-based training criteria are investigated in order to speed up the training of neural language models with large vocabularies. Second, label smoothing, input smoothing as well as multi-agent training are studied to improve the generalization of neural machine translation models. Finally, a language modeling approach for machine translation is proposed to simplify the architecture of existing translation models.
Es laden ein: die Dozentinnen und Dozenten der Informatik
informatik-vortraege@lists.rwth-aachen.de