Fix BigBench multiple-choice crash on mixed-format tasks by Chessing234 · Pull Request #3702 · EleutherAI/lm-evaluation-harness

Chessing234 · 2026-04-13T10:33:01Z

Bug

BigBench multiple-choice tasks crash with ValueError on subtasks that contain a mix of multiple-choice and free-form examples in the same dataset split (e.g. kanji_ascii).

The Jinja template {{multiple_choice_targets.index(targets[0])}} raises ValueError: 'u' is not in list (or similar) when it encounters a row where multiple_choice_targets is an empty list.

Root cause

generate_tasks.py decides whether a subtask qualifies as multiple-choice by checking only the first example. Subtasks like kanji_ascii have multiple-choice examples at the start but free-form examples later (index 188+), where multiple_choice_targets and multiple_choice_scores are both [].

Fix

Add a process_docs filter to both multiple-choice template YAMLs that drops rows with empty multiple_choice_targets before evaluation. This follows the same pattern used by other tasks in the repo (e.g. crows_pairs, bbq).

Three files changed:

New lm_eval/tasks/bigbench/utils.py — one filter function
Modified multiple_choice_template_a_yaml — added process_docs line
Modified multiple_choice_template_b_yaml — added process_docs line

Fixes #3636

Some BigBench subtasks (e.g. kanji_ascii) contain a mix of multiple-choice and free-form examples in the same dataset split. The multiple-choice templates crash with a ValueError when they encounter rows where `multiple_choice_targets` is an empty list. Add a `process_docs` filter that drops rows without multiple-choice targets before evaluation. Fixes EleutherAI#3636 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Chessing234 requested a review from 0xSMT as a code owner April 13, 2026 10:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix BigBench multiple-choice crash on mixed-format tasks#3702

Fix BigBench multiple-choice crash on mixed-format tasks#3702
Chessing234 wants to merge 1 commit intoEleutherAI:mainfrom
Chessing234:fix/bigbench-mc-filter

Chessing234 commented Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Chessing234 commented Apr 13, 2026

Bug

Root cause

Fix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant