40 - On the State of the Art of Evaluation in Neural Language Models, with Gábor Melis

NLP Highlights

English - November 07, 2017 19:58 - 29 minutes - 27.4 MB - ★★★★★ - 22 ratings
Science Homepage Download Apple Podcasts Google Podcasts Overcast Castro Pocket Casts RSS feed

Previous Episode: 39 - Organizing the SemEval task on scientific information extraction, with Isabelle Augenstein

Next Episode: 41 - Cross-Sentence N-ary Relation Extraction with Graph LSTMs, with Nanyun (Violet) Peng

Recent arxiv paper by Gábor Melis, Chris Dyer, and Phil Blunsom.

Gábor comes on the podcast to tell us about his work. He performs a thorough comparison between vanilla LSTMs and recurrent highway networks on the language modeling task, showing that when both methods are given equal amounts of hyperparameter tuning, LSTMs perform better, in contrast to prior work claiming that recurrent highway networks perform better. We talk about parameter tuning, training variance, language model evaluation, and other related issues.

https://www.semanticscholar.org/paper/On-the-State-of-the-Art-of-Evaluation-in-Neural-La-Melis-Dyer/2397ce306e5d7f3d0492276e357fb1833536b5d8

Books Referenced

The State of the Art