Results and Insights from Diagnostic Questions: The NeurIPS 2020 Education Challenge

Zichao Wang; Angus Lamb; Evgeny Saveliev; Pashmina Cameron; Yordan Zaykov; José Miguel Hernández-Lobato; Richard E. Turner; Richard G. Baraniuk; Craig Barton; Simon Peyton Jones; Simon Woodhead; Cheng Zhang

Results and Insights from Diagnostic Questions: The NeurIPS 2020 Education Challenge

Zichao Wang, Angus Lamb, Evgeny Saveliev, Pashmina Cameron, Yordan Zaykov, José Miguel Hernández-Lobato, Richard E. Turner, Richard G. Baraniuk, Craig Barton, Simon Peyton Jones, Simon Woodhead, Cheng Zhang

Research output: Contribution to journal › Conference article › peer-review

6 Scopus citations

Abstract

This competition concerns educational diagnostic questions, which are pedagogically effective, multiple-choice questions (MCQs) whose distractors embody misconceptions. With a large and ever-increasing number of such questions, it becomes overwhelming for teachers to know which questions are the best ones to use for their students. We thus seek to answer the following question: how can we use data on hundreds of millions of answers to MCQs to drive automatic personalized learning in large-scale learning scenarios where manual personalization is infeasible? Success in using MCQ data at scale helps build more intelligent, personalized learning platforms that ultimately improve the quality of education en masse. To this end, we introduce a new, large-scale, real-world dataset and formulate 4 data mining tasks on MCQs that mimic real learning scenarios and target various aspects of the above question in a competition setting at NeurIPS 2020. We report on our NeurIPS competition in which nearly 400 teams submitted approximately 4000 submissions, with encouragingly diverse and effective approaches to each of our tasks.

Original language	English (US)
Pages (from-to)	191-205
Number of pages	15
Journal	Proceedings of Machine Learning Research
Volume	133
State	Published - 2020
Event	34th Demonstration and Competition Track at the 34th Annual Conference on Neural Information Processing Systems, NeurIPS 2020 - Virtual, Online Duration: Dec 6 2020 → Dec 12 2020

Keywords

Active learning
Diagnostic questions
Matrix completion
Missing value prediction
Personalized education
Question analytics
Unsupervised learning

ASJC Scopus subject areas

Artificial Intelligence
Software
Control and Systems Engineering
Statistics and Probability

Cite this

@article{673d360a689e40d58341c8c546124368,

title = "Results and Insights from Diagnostic Questions: The NeurIPS 2020 Education Challenge",

abstract = "This competition concerns educational diagnostic questions, which are pedagogically effective, multiple-choice questions (MCQs) whose distractors embody misconceptions. With a large and ever-increasing number of such questions, it becomes overwhelming for teachers to know which questions are the best ones to use for their students. We thus seek to answer the following question: how can we use data on hundreds of millions of answers to MCQs to drive automatic personalized learning in large-scale learning scenarios where manual personalization is infeasible? Success in using MCQ data at scale helps build more intelligent, personalized learning platforms that ultimately improve the quality of education en masse. To this end, we introduce a new, large-scale, real-world dataset and formulate 4 data mining tasks on MCQs that mimic real learning scenarios and target various aspects of the above question in a competition setting at NeurIPS 2020. We report on our NeurIPS competition in which nearly 400 teams submitted approximately 4000 submissions, with encouragingly diverse and effective approaches to each of our tasks.",

keywords = "Active learning, Diagnostic questions, Matrix completion, Missing value prediction, Personalized education, Question analytics, Unsupervised learning",

author = "Zichao Wang and Angus Lamb and Evgeny Saveliev and Pashmina Cameron and Yordan Zaykov and Hern{\'a}ndez-Lobato, {Jos{\'e} Miguel} and Turner, {Richard E.} and Baraniuk, {Richard G.} and Craig Barton and Jones, {Simon Peyton} and Simon Woodhead and Cheng Zhang",

note = "Funding Information: We thank the Codalab team for their technical support throughout the competition and all competition participants who contributed. ZW and RGB are supported by NSF grants 1842378 and 1937134 and by ONR grant N0014-20-1-2534. Publisher Copyright: {\textcopyright} 2021 Z. Wang et al.; 34th Demonstration and Competition Track at the 34th Annual Conference on Neural Information Processing Systems, NeurIPS 2020 ; Conference date: 06-12-2020 Through 12-12-2020",

year = "2020",

language = "English (US)",

volume = "133",

pages = "191--205",

journal = "Proceedings of Machine Learning Research",

issn = "2640-3498",

}

TY - JOUR

T1 - Results and Insights from Diagnostic Questions

T2 - 34th Demonstration and Competition Track at the 34th Annual Conference on Neural Information Processing Systems, NeurIPS 2020

AU - Wang, Zichao

AU - Lamb, Angus

AU - Saveliev, Evgeny

AU - Cameron, Pashmina

AU - Zaykov, Yordan

AU - Hernández-Lobato, José Miguel

AU - Turner, Richard E.

AU - Baraniuk, Richard G.

AU - Barton, Craig

AU - Jones, Simon Peyton

AU - Woodhead, Simon

AU - Zhang, Cheng

N1 - Funding Information: We thank the Codalab team for their technical support throughout the competition and all competition participants who contributed. ZW and RGB are supported by NSF grants 1842378 and 1937134 and by ONR grant N0014-20-1-2534. Publisher Copyright: © 2021 Z. Wang et al.

PY - 2020

Y1 - 2020

N2 - This competition concerns educational diagnostic questions, which are pedagogically effective, multiple-choice questions (MCQs) whose distractors embody misconceptions. With a large and ever-increasing number of such questions, it becomes overwhelming for teachers to know which questions are the best ones to use for their students. We thus seek to answer the following question: how can we use data on hundreds of millions of answers to MCQs to drive automatic personalized learning in large-scale learning scenarios where manual personalization is infeasible? Success in using MCQ data at scale helps build more intelligent, personalized learning platforms that ultimately improve the quality of education en masse. To this end, we introduce a new, large-scale, real-world dataset and formulate 4 data mining tasks on MCQs that mimic real learning scenarios and target various aspects of the above question in a competition setting at NeurIPS 2020. We report on our NeurIPS competition in which nearly 400 teams submitted approximately 4000 submissions, with encouragingly diverse and effective approaches to each of our tasks.

AB - This competition concerns educational diagnostic questions, which are pedagogically effective, multiple-choice questions (MCQs) whose distractors embody misconceptions. With a large and ever-increasing number of such questions, it becomes overwhelming for teachers to know which questions are the best ones to use for their students. We thus seek to answer the following question: how can we use data on hundreds of millions of answers to MCQs to drive automatic personalized learning in large-scale learning scenarios where manual personalization is infeasible? Success in using MCQ data at scale helps build more intelligent, personalized learning platforms that ultimately improve the quality of education en masse. To this end, we introduce a new, large-scale, real-world dataset and formulate 4 data mining tasks on MCQs that mimic real learning scenarios and target various aspects of the above question in a competition setting at NeurIPS 2020. We report on our NeurIPS competition in which nearly 400 teams submitted approximately 4000 submissions, with encouragingly diverse and effective approaches to each of our tasks.

KW - Active learning

KW - Diagnostic questions

KW - Matrix completion

KW - Missing value prediction

KW - Personalized education

KW - Question analytics

KW - Unsupervised learning

UR - http://www.scopus.com/inward/record.url?scp=85162617842&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85162617842&partnerID=8YFLogxK

M3 - Conference article

AN - SCOPUS:85162617842

SN - 2640-3498

VL - 133

SP - 191

EP - 205

JO - Proceedings of Machine Learning Research

JF - Proceedings of Machine Learning Research

Y2 - 6 December 2020 through 12 December 2020

ER -

Results and Insights from Diagnostic Questions: The NeurIPS 2020 Education Challenge

Abstract

Keywords

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this