How (and How Not) Do Code Complexity Measures Predict Cognitive Load?

Thorgeirsson, Sverrir; Vahrenhold, Jan

Research article in edited proceedings (conference) | Peer reviewed

Abstract

Background and Context. Code complexity measures have been used to guide the design of various activities within computing education, such as instructional sequencing and assessment. However, empirical evidence for the link of these measures to actual cognitive difficulties remains mixed, with studies suffering from small sample sizes and non-controlled experimental design. Objectives. We sought to investigate how code complexity measures predict the cognitive load of university students when tracing code and whether their predictive power is moderated by computer science achievement. We also compared how these measures stacked up against the comparative judgment from a 15-member expert panel. Methods. We conducted a preregistered laboratory study to investigate the strength of code complexity measures identified in a recent neuroimaging study as predictors of cognitive load. In this controlled study, 𝑁= 551 university students traced a random selection of 24 expert-curated code snippets in their preferred language (Java, Python, or C++), and then reported their cognitive load using two validated measures of cognitive load. We assessed preregistered hierarchical regression models with respect to the predictive strength of the code complexity measures and possible moderation. Findings. Contrary to the findings from the recent neuroimaging study, we could not confirm data-flow complexity to be the strongest predictor of measured cognitive load; instead, the simple source lines of code measure dominated the other code complexity measures. In contrast, expert ratings strongly predicted cognitive load. Implications. In the educational context studied, measuring source lines of code is a simple and effective heuristic for ordering tracing tasks by difficulty and outperforms more sophisticated efforts involving data and control flow. The unexpected finding from our exploratory analyses that easy-to-obtain rankings based on pairwise-comparison sessions involving experts have a much stronger predictive power than static and dynamic measures opens up avenues for follow-up research.

Details about the publication

Editors: Brown, Neil C. C.; Searle, Kristin

Book title: Proceedings of the ACM Conference on International Computing Education Research Vol.1 (ICER 2026 Vol. 1) (Volume 1)

Publisher: ACM Press

Place of publication: New York, NY

Status: accepted / in press (not yet published)

Release year: 2026

Language in which the publication is written: English

Conference: ACM Conference on International Computing Education Research, Uppsala, Sweden

ISBN: 979-8-4007-2203-5

DOI: 10.1145/3765964.3811665

Keywords: Code Complexity; Cognitive Load; Lab Study

Authors from the University of Münster

Vahrenhold, Jan

Professur für Praktische Informatik (Prof. Vahrenhold)

How (and How Not) Do Code Complexity Measures Predict Cognitive Load?

Abstract

Details about the publication

Authors from the University of Münster

Operated by

Top-Links