CENIA «Large Language Models Usage and Evaluation Patterns» – Instituto Milenio Fundamentos de los Datos

ABSTRACT

Large Language Models (LLMs) have changed the way computers understand and use human language. They’re used in many different areas, and in this talk we’ll look at how people use and evaluate them. We’ll start by looking at the different ways people use LLMs. First, when they are used as general assistants for tasks like writing, summarizing, coding, etc. Then, when they are adapted to address more domain-specific tasks using two approaches: 1) retrieval-assisted generation, and 2) fine-tuning. We’ll also see how LLMs are integrated into software applications, such as when they are invoked by computer code (API calls) or used by autonomous agents to make decisions on their own. On the evaluation side, we’ll talk about a method called MTBench, a multi-turn question set, and Chatbot Arena, a crowdsourced battle platform between LLMs.

EXPOSITOR

Felipe Bravo, Investigador Asociado Cenia. Profesor Asistente DCC, Universidad de Chile.

CUÁNDO Y DONDE

Este evento se llevará a cabo el miércoles 13 diciembre, de manera híbrida a las 16:00 hrs.

Presencialmente, será en el Auditorio Ramón Picarte, DCC, Universidad de Chile. 3er piso, Edificio Norte, Beauchef 851.
Virtualmente, vía ZOOM, el link será compartido por correo y slack ese día.

INSCRIPCIONES:

https://docs.google.com/forms/d/e/1FAIpQLSev6QO4JteuxkLduyxLP6slYYIJ92G55DKaFEFfZ-XD5aENYw/viewform