How Beginning Programmers and Code LLMs (Mis)read Each Other

Share

Author

Arjun Guha (Northeastern University and Roblox), Sydney Nguyen (Wellesley College), Hannah McLean Babe (Oberlin College), Yangtian Zi (Northeastern University), Carolyn Jane Anderson (Wellesley College), Molly Q Feldman (Oberlin College)

Venue

CHI 2024

Abstract

Generative AI models, specifically large language models (LLMs), have made strides towards the long-standing goal of text-to-code generation. This progress has invited numerous studies of user interaction. However, less is known about the struggles and strategies of non-experts, for whom each step of the text-to-code problem presents challenges: describing their intent in natural language, evaluating the correctness of generated code, and editing prompts when the generated code is incorrect. This paper presents a large-scale controlled study of how 120 beginning coders across three academic institutions approach writing and editing prompts. A novel experimental design allows us to target specific steps in the text-to-code process and reveals that beginners struggle with writing and editing prompts, even for problems at their skill level and when correctness is automatically determined. Our mixed-methods evaluation provides insight into student processes and perceptions with key implications for non-expert Code LLM use within and outside of education.

Join us in shaping the future

View All Jobs

Latest

More results

How Beginning Programmers and Code LLMs (Mis)read Each Other

Author

Venue

Abstract

Join us in shaping the future

How Beginning Programmers and Code LLMs (Mis)read Each Other

Author

Venue

Abstract

Related Publications

CubePart: An Open-Vocabulary Part-Controllable 3D Generator

Grimlock: Guarding High-Agency Systems with eBPF and Attested Channels

Catalog-Native LLM: Speaking Item-ID Dialect With Less Entanglement for Recommendation

Join us in shaping the future