Code Soliloquies for Accurate Calculations in Large Language Models

Shashank Sonkar, Xinghe Chen, Myco Le, Naiming Liu, Debshila Basu Mallick, Richard Baraniuk

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

High-quality conversational datasets are crucial for the successful development of Intelligent Tutoring Systems (ITS) that utilize a Large Language Model (LLM) backend. Synthetic student-teacher dialogues, generated using advanced GPT-4 models, are a common strategy for creating these datasets. However, subjects like physics that entail complex calculations pose a challenge. While GPT-4 presents impressive language processing capabilities, its limitations in fundamental mathematical reasoning curtail its efficacy for such subjects. To tackle this limitation, we introduce in this paper an innovative stateful prompt design. Our design orchestrates a mock conversation where both student and tutorbot roles are simulated by GPT-4. Each student response triggers an internal monologue, or 'code soliloquy' in the GPT-tutorbot, which assesses whether its subsequent response would necessitate calculations. If a calculation is deemed necessary, it scripts the relevant Python code and uses the Python output to construct a response to the student. Our approach notably enhances the quality of synthetic conversation datasets, especially for subjects that are calculation-intensive. The preliminary Subject Matter Expert evaluations reveal that our Higgs model, a fine-tuned LLaMA model, effectively uses Python for computations, which significantly enhances the accuracy and computational reliability of Higgs' responses.

Original languageEnglish (US)
Title of host publicationLAK 2024 Conference Proceedings - 14th International Conference on Learning Analytics and Knowledge
PublisherAssociation for Computing Machinery
Pages828-835
Number of pages8
ISBN (Electronic)9798400716188
DOIs
StatePublished - Mar 18 2024
Event14th International Conference on Learning Analytics and Knowledge, LAK 2024 - Kyoto, Japan
Duration: Mar 18 2024Mar 22 2024

Publication series

NameACM International Conference Proceeding Series

Conference

Conference14th International Conference on Learning Analytics and Knowledge, LAK 2024
Country/TerritoryJapan
CityKyoto
Period3/18/243/22/24

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Software

Fingerprint

Dive into the research topics of 'Code Soliloquies for Accurate Calculations in Large Language Models'. Together they form a unique fingerprint.

Cite this