linkedin reddit search_black sharethis

Machine Learning at the Flatiron Institute Seminar: Clément Hongler

Date & Time


Location


Title: Arrows of Time for Large Language Models

Abstract: Large Language Models famously predict the next token in a text. What happens if we teach them to predict the next word? It turns out that some subtle differences emerge. I will discuss some empirical and theoretical results about this, and also some (hopefully exciting) consequences and perspectives suggested by our results.

About the Speaker

Clément Hongler has worked on statistical mechanics, quantum field theory, deep learning theory, and a few other things. He enjoys talking with people from various horizons.

Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates