poster

Understanding the Role of Temperature in Diverse Question Generation by GPT-4

Authors:
Arav Agarwal

Carnegie Mellon University, Pittsburgh, PA, USA

Carnegie Mellon University, Pittsburgh, PA, USA

0000-0001-9848-1663
View Profile

,
Karthik Mittal

Carnegie Mellon University, Pittsburgh, PA, USA

Carnegie Mellon University, Pittsburgh, PA, USA

0009-0005-5675-6987
View Profile

,
Aidan Doyle

Carnegie Mellon University, Pittsburgh, PA, USA

Carnegie Mellon University, Pittsburgh, PA, USA

0009-0008-6260-7517
View Profile

,
Pragnya Sridhar

Carnegie Mellon University, Pittsburgh, PA, USA

Carnegie Mellon University, Pittsburgh, PA, USA

0000-0003-2160-288X
View Profile

,
Zipiao Wan

Carnegie Mellon University, Pittsburgh, PA, USA

Carnegie Mellon University, Pittsburgh, PA, USA

0009-0002-5866-2376
View Profile

,
Jacob Arthur Doughty

Carnegie Mellon University, Pittsburgh, PA, USA

Carnegie Mellon University, Pittsburgh, PA, USA

0009-0008-5430-7282
View Profile

,
Jaromir Savelka

Carnegie Mellon University, Pittsburgh, PA, USA

Carnegie Mellon University, Pittsburgh, PA, USA

0000-0002-3674-5456
View Profile

,
Majd Sakr

Carnegie Mellon University, Pittsburgh, PA, USA

Carnegie Mellon University, Pittsburgh, PA, USA

0000-0002-3739-298X
View Profile

SIGCSE 2024: Proceedings of the 55th ACM Technical Symposium on Computer Science Education V. 2March 2024Pages 1550–1551https://doi.org/10.1145/3626253.3635608

Published:15 March 2024Publication History

SIGCSE 2024: Proceedings of the 55th ACM Technical Symposium on Computer Science Education V. 2

Pages 1550–1551

ABSTRACT

We conduct a preliminary study of the effect of GPT's temperature parameter on the diversity of GPT4-generated questions. We find that using higher temperature values leads to significantly higher diversity, with different temperatures exposing different types of similarity between generated sets of questions. We also demonstrate that diverse question generation is especially difficult for questions targeting lower levels of Bloom's Taxonomy.

References

Benjamin S Bloom, Max D Englehart, Edward J Furst, Walker H Hill, David R Krathwohl, et al. 1956. Taxonomy of educational objectives, handbook I: the cognitive domain. New York: David McKay Co.Google Scholar
Paul Denny, Hassan Khosravi, Arto Hellas, Juho Leinonen, and Sami Sarsa. 2023. Can We Trust AI-Generated Educational Content? Comparative Analysis of Human and AI-Generated Learning Resources. arXiv:2306.10509 [cs.HC]Google Scholar
Jacob Doughty, Zipiao Wan, Anishka Bompelli, Jubahed Qayum, Taozhi Wang, Juran Zhang, Yujia Zheng, Aidan Doyle, Pragnya Sridhar, Arav Agarwal, Christopher Bogart, Eric Keylor, Can Kultur, Jaromir Savelka, and Majd Sakr. 2024. A Comparative Study of AI-Generated (GPT-4) and Human-crafted MCQs in Programming Education. In Proceedings of the 26th Australasian Computing Education Conference ACE '24). Association for Computing Machinery, New York, NY, USA, 114--123. https://doi.org/10.1145/3636243.3636256Google ScholarDigital Library
J. Richard Landis and Gary G. Koch. 1977. The Measurement of Observer Agreement for Categorical Data. Biometrics 33, 1 (1977), 159--174. http://www.jstor.org/stable/2529310Google ScholarCross Ref
Stephen MacNeil, Andrew Tran, Arto Hellas, Joanne Kim, Sami Sarsa, Paul Denny, Seth Bernstein, and Juho Leinonen. 2023. Experiences from Using Code Explanations Generated by Large Language Models in a Web Software Development E-Book. In Proceedings of the 54th ACM Technical Symposium on Computer Science Education V. 1 (Toronto ON, Canada) (SIGCSE 2023). Association for Computing Machinery, NewYork, NY, USA, 931--937. https://doi.org/10.1145/3545945.3569785Google ScholarDigital Library
Pranjal Dilip Naringrekar, Ildar Akhmetov, and Eleni Stroulia. 2023. Generating CS1 Coding Questions Using OpenAI. In Proceedings of the 25th Western Canadian Conference on Computing Education (Vancouver, BC, Canada) (WCCCE '23). Association for Computing Machinery, New York, NY, USA, Article 11, 2 pages. https://doi.org/10.1145/3593342.3593348Google ScholarDigital Library

Index Terms

Understanding the Role of Temperature in Diverse Question Generation by GPT-4
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interactive systems and tools
2. Social and professional topics
  1. Professional topics
    1. Computing education
      1. Computing education programs
        Computer science education
        Software engineering education

Recommendations

A Comparative Study of AI-Generated (GPT-4) and Human-crafted MCQs in Programming Education
ACE '24: Proceedings of the 26th Australasian Computing Education Conference

There is a constant need for educators to develop and maintain effective up-to-date assessments. While there is a growing body of research in computing education on utilizing large language models (LLMs) in generation and engagement with coding ...
Read More
Automating Question Generation From Educational Text
Artificial Intelligence XL
Abstract
The use of question-based activities (QBAs) is wide-spread in education, traditionally forming an integral part of the learning and assessment process. In this paper, we design and evaluate an automated question generation tool for formative and ...
Read More
Improving the Readability of Generated Tests Using GPT-4 and ChatGPT Code Interpreter
Search-Based Software Engineering
Abstract
A major challenge in automated test generation is the readability of generated tests. Emerging large language models (LLMs) excel at language analysis and transformation tasks. We propose that improving test readability is such a task and explore ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
SIGCSE 2024: Proceedings of the 55th ACM Technical Symposium on Computer Science Education V. 2
March 2024
2007 pages
ISBN:9798400704246
DOI:10.1145/3626253
General Chairs:
Ben Stephenson
University of Calgary, Canada
,
Jeffrey A. Stone
Penn State University
,
Program Chairs:
Lina Battestilli
North Carolina State University, USA
,
Samuel A. Rebelsky
Grinnell College
,
Libby Shoop
Macalester College
Copyright © 2024 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 15 March 2024
Check for updates
Author Tags
automated content generation
automatic generation
course design automation
curricular development
gpt-4
large language models
learning objectives
llms
Qualifiers
- poster
Conference

Acceptance Rates
Overall Acceptance Rate1,595of4,542submissions,35%
Upcoming Conference
SIGCSE Virtual 2024

Sponsor:

sigcse

SIGCSE Virtual 2024: ACM Virtual Global Computing Education Conference

November 30 - December 1, 2024

Virtual Event , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 51
  Total Downloads
- Downloads (Last 12 months)51
- Downloads (Last 6 weeks)51
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Understanding the Role of Temperature in Diverse Question Generation by GPT-4

SIGCSE 2024: Proceedings of the 55th ACM Technical Symposium on Computer Science Education V. 2

ABSTRACT

References

Cited By

Index Terms

Recommendations

A Comparative Study of AI-Generated (GPT-4) and Human-crafted MCQs in Programming Education

Automating Question Generation From Educational Text

Improving the Readability of Generated Tests Using GPT-4 and ChatGPT Code Interpreter