APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning

Sun, Jiashuo; Zhang, Hang; Lin, Chen; Su, Xiangdong; Gong, Yeyun; Guo, Jian

Computer Science > Computation and Language

arXiv:2212.07249 (cs)

[Submitted on 14 Dec 2022 (v1), last revised 12 Mar 2024 (this version, v3)]

Title:APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning

Authors:Jiashuo Sun, Hang Zhang, Chen Lin, Xiangdong Su, Yeyun Gong, Jian Guo

View PDF HTML (experimental)

Abstract:Long-form numerical reasoning in financial analysis aims to generate a reasoning program to calculate the correct answer for a given question. Previous work followed a retriever-generator framework, where the retriever selects key facts from a long-form document, and the generator generates a reasoning program based on retrieved facts. However, they treated all facts equally without considering the different contributions of facts with and without numbers. Meanwhile, the program consistency were ignored under supervised training, resulting in lower training accuracy and diversity. To solve these problems, we proposed APOLLO to improve the long-form numerical reasoning framework. For the retriever, we adopt a number-aware negative sampling strategy to enable the retriever to be more discriminative on key numerical facts. For the generator, we design consistency-based reinforcement learning and target program augmentation strategy based on the consistency of program execution results. Experimental results on the FinQA and ConvFinQA leaderboard verify the effectiveness of our proposed method, achieving the new state-of-the-art.

Comments:	Accepted by COLING 2024
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2212.07249 [cs.CL]
	(or arXiv:2212.07249v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2212.07249

Submission history

From: Gasol Sun [view email]
[v1] Wed, 14 Dec 2022 14:34:15 UTC (3,063 KB)
[v2] Mon, 13 Feb 2023 16:06:43 UTC (9,853 KB)
[v3] Tue, 12 Mar 2024 13:30:16 UTC (9,859 KB)

Computer Science > Computation and Language

Title:APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators