The performance of Large Language Models (LLMs) on multiple-choice question (MCQ) benchmarks is frequently cited as proof of their medical capabilities. We hypothesized that LLM performance on medical ...
Multiple choice questions are often frowned on as an assessment tool in higher education. But when well constructed, they offer a clear and transparent way of evaluating student progress, as Anthony ...