Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models

Sicilia, Anthony; Kim, Hyunwoo; Chandu, Khyathi Raghavi; Alikhani, Malihe; Hessel, Jack

Computer Science > Computation and Language

arXiv:2402.03284 (cs)

[Submitted on 5 Feb 2024]

Title:Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models

Authors:Anthony Sicilia, Hyunwoo Kim, Khyathi Raghavi Chandu, Malihe Alikhani, Jack Hessel

View PDF

Abstract:Effective interlocutors account for the uncertain goals, beliefs, and emotions of others. But even the best human conversationalist cannot perfectly anticipate the trajectory of a dialogue. How well can language models represent inherent uncertainty in conversations? We propose FortUne Dial, an expansion of the long-standing "conversation forecasting" task: instead of just accuracy, evaluation is conducted with uncertainty-aware metrics, effectively enabling abstention on individual instances. We study two ways in which language models potentially represent outcome uncertainty (internally, using scores and directly, using tokens) and propose fine-tuning strategies to improve calibration of both representations. Experiments on eight difficult negotiation corpora demonstrate that our proposed fine-tuning strategies (a traditional supervision strategy and an off-policy reinforcement learning strategy) can calibrate smaller open-source models to compete with pre-trained models 10x their size.

Comments:	2 Figures; 7 Tables; 27 pages
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2402.03284 [cs.CL]
	(or arXiv:2402.03284v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2402.03284

Submission history

From: Anthony Sicilia [view email]
[v1] Mon, 5 Feb 2024 18:39:47 UTC (1,877 KB)

Computer Science > Computation and Language

Title:Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Deal, or no deal (or who knows)? Forecasting Uncertainty in Conversations using Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators