Into the Unknown: Self-Learning Large Language Models

Ferdinan, Teddy; Kocoń, Jan; Kazienko, Przemysław

Computer Science > Artificial Intelligence

arXiv:2402.09147 (cs)

[Submitted on 14 Feb 2024]

Title:Into the Unknown: Self-Learning Large Language Models

Authors:Teddy Ferdinan, Jan Kocoń, Przemysław Kazienko

View PDF

Abstract:We address the main problem of self-learning LLM: the question of what to learn. We propose a self-learning LLM framework that enables an LLM to independently learn previously unknown knowledge through self-assessment of their own hallucinations. Using the hallucination score, we introduce a new concept of Points in The Unknown (PiUs), along with one extrinsic and three intrinsic methods for automatic PiUs identification. It facilitates the creation of a self-learning loop that focuses exclusively on the knowledge gap in Points in The Unknown, resulting in a reduced hallucination score. We also developed evaluation metrics for gauging an LLM's self-learning capability. Our experiments revealed that 7B-Mistral models that have been finetuned or aligned are capable of self-learning considerably well. Our self-learning concept allows more efficient LLM updates and opens new perspectives for knowledge exchange. It may also increase public trust in AI.

Comments:	14 pages, 13 figures, to be submitted to ACL 2024
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2402.09147 [cs.AI]
	(or arXiv:2402.09147v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2402.09147

Submission history

From: Teddy Ferdinan [view email]
[v1] Wed, 14 Feb 2024 12:56:58 UTC (11,106 KB)

Computer Science > Artificial Intelligence

Title:Into the Unknown: Self-Learning Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Into the Unknown: Self-Learning Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators