Partager cette page :

Fast Inference Methods in Natural Language Processing

le 5 novembre 2025

13h15

Campus de Beaulieu Campus de Beaulieu Amphi P - bât. 12D

Intervention de Caio Corro enseignant-chercheur à l'INSA Rennes et rattaché au laboratoire IRISA, membre de l'équipe LinkMedia, dans le cadre des séminaires du département Informatique.

/medias/photo/seminaire-di_1630676501273-jpg

Résumé :
Modern natural language processing models are base on very large neural networks, meaning that inference is usually slow, even using modern GPUs.

In this talk, I will quickly overview several research topics that aim to leverage the architecture of modern GPUs to develop fast inference methods.
I will then quickly present two of my recent works on the topic.

References :

  • Bregman Conditional Random Fields: Sequence Labeling with Parallelizable Inference Algorithms (Caio Corro, Mathieu Lacroix, Joseph Le Roux) https://arxiv.org/abs/2506.00732
  • KAD: A Framework for Proxy-based Test-time Alignment with Knapsack Approximation Deferral (Ayoub Hammal, Pierre Zweigenbaum, Caio Corro)


Thématique(s)
Formation, Recherche - Valorisation
Contact
Martin Quinson (martinc.quinson@ens-rennes.fr)

Mise à jour le 4 novembre 2025