American Statistical Association
New York City
Metropolitan Area Chapter



ASA Text Analysis Section
NYC Metro Area ASA Chapter
Department of Statistics at Columbia University


Are Pleased to Invite You to a Jointly Hosted In-Person Workshop

STATISTICS AND LARGE LANGUAGE MODELS



Date & Time
Monday, July 24, 2023
1:00 - 5:00 P.M.

Workshop Schedule

Location
Columbia University School of Social Work
1255 Amsterdam Avenue
(corner of Amsterdam and 121st Street)
New York, New York 10027

The event was not recorded.

For Questions
Contact the NYC Metro Area ASA Chapter



STATISTICS AND LARGE LANGUAGE MODELS

WORKSHOP SCHEDULE

MONDAY, JULY 24, 2023
1:00 - 1:45 PM

Language Models: From N-Grams to Transformers to Chatbots

Presentation Slides
https://www.nycasa.org/ASA-LLM-Workshop-Jul-24-2023.pdf

by

Bob Carpenter
Flatiron Institute

Abstract

Using standard statistical notation and nomenclature, I'll present the basic principles of language modeling, starting from the Markovian n-gram language models of Shannon (1948) and connections to compression. Then I will fast forward 70 years and present the modern transformer architecture introduced by Google (2017). I'll conclude with a discussion of the scale at which OpenAI has been able to fit foundational large language models and apply reinforcement learning with human feedback fine tuning to align them with the goals of being helpful, truthful, and harmless.
1:45 - 2:30 PM

Large Language Models Beyond Language

by

Sachit Menon
Department of Computer Science
Columbia University

Abstract

The impact of large language models (LLMs) on any tasks involving the creation of text in response to text has become clear in recent months. What has gone less noticed is the potential impact on areas that are not directly language related, such as computer vision and robotics. I will discuss new paradigms of using LLMs as reasoners, especially via program generation, that stand to bring substantial change to fields that are seemingly unrelated to the natural-language origins of LLMs.
2:30 - 3:00 PM

Coffee Break
3:00 - 3:45 PM

Evaluating the (Moral) Beliefs Encoded in LLMs

by

Claudia Shi
Department of Computer Science
Columbia University

Abstract

This talk focuses on large-scale automatic evaluations of LLMs, covering design principles, evaluation protocols, and a case study analyzing the moral beliefs of 28 LLMs.
3:45 - 4:30 PM

The Evolving World of Natural Language Processing in Healthcare and Life Sciences

Presentation Slides
https://www.nycasa.org/ASA-LLM-Workshop-Jul-24-2023-IQVIA-Slides.pdf

by

Marjan Kamyab
IQVIA NLP
and
Kaitlyn Whyte
IQVIA NLP

Abstract

In this presentation from IQVIA, our speakers will cover the trends in NLP, and how this is impacting the way in which they are leading and delivering complex NLP projects across the healthcare and life sciences industries. As part of the Linguamatics team, a company acquired by IQVIA in 2019, the team has been building and delivering NLP solutions to these markets for over 20 years, and so has a breadth and depth of experience in applying the different techniques that have emerged over that time in the field. This talk will focus on choosing the right task for the job at hand, taking into account three key axes of decision making - cost, convenience and accuracy. This talk will cover the use of different techniques, from statistical and LLM approaches to linguistic and rules-based solutions, including presenting real world case studies from academia and industry.
4:30 - 5:00 PM

Group Discussion

Moderator

David Banks
Duke University

Top of Page


Home Page | Chapter News | Chapter Officers | Chapter Events | Other Events
NYC ASA Chapter Constitution | NYC ASA Chapter By-Laws
ASA National Home Page | ASA Code of Conduct

Contact the Chapter
This page last modified on November 5, 2023.
Copyright © 1998-2023 by New York City Metro Area Chapter of the ASA