Round 42

Student Grouping Policy for Grades 6-8

A mid-sized school district is redesigning how students are grouped for core academic subjects (math, English, science) across grades 6-8. Recent data show a widening achievement gap between high- and low-performing students, along with complaints from parents on both ends: some want more rigorous tracks for advanced learners, others worry that grouping by ability entrenches inequities along socioeconomic and racial lines. The district has moderate funding for teacher training and technology, but not enough to do everything at once. Teachers are already stretched thin and vary widely in their comfort with differentiated instruction. The school board wants a single coherent policy to roll out district-wide next year, balancing academic rigor, equity, teacher workload, and logistical feasibility (scheduling, staffing, data systems). Whatever is chosen will need to be defensible to parents, sustainable for staff, and measurable in its effect on both high and low achievers.

Dashboard

Status

DECIDED

Winner

D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction.

Majority

D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction.

Duration

7389 ms

Topic

Options

D Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction.
2.00 of 3 PPV winner
B Implement flexible regrouping, where students are re-sorted into skill-based groups for core subjects every quarter based on formative assessment data, allowing movement between groups.
1.00 of 3
A Group students by demonstrated skill level rather than grade level for core subjects, mixing ages so a strong 6th grader might join 7th graders in math while receiving grade-level instruction elsewhere.
0.00 of 3
C Shift to personalized, software-driven learning where each student follows an individualized adaptive pathway at their own pace, with teachers acting mainly as facilitators.
0.00 of 3
E Adopt full ability tracking, placing students into distinct leveled classes (e.g., advanced, on-level, support) for each core subject based on prior performance.
0.00 of 3
F Keep all students in heterogeneous, mixed-ability classrooms and invest heavily in training teachers to differentiate instruction within a single classroom.
0.00 of 3

Checks

Baselines

PPV no gold answer
Majority no gold answer
Comparison PPV matched majority
k samples8
Agents3
Parsed samples 24 / 24
Quality split agent disagreement

PPV

Top Influences

Claude Sonnet 5 · delegate 1.000
GPT 5.4 mini · delegate 1.000
Gemini 3 Flash Preview · delegate 1.000

Samples

Answer Distribution

Claude Sonnet 5 D: 8
GPT 5.4 mini B: 8
Gemini 3 Flash Preview D: 8

PPV

Influences

Label Role Score
Claude Sonnet 5 delegate 1.000
GPT 5.4 mini delegate 1.000
Gemini 3 Flash Preview delegate 1.000

Voting matrix

Constructed Delegation Matrix

To / From Claude Sonnet 5 GPT 5.4 mini Gemini 3 Flash Preview
Claude Sonnet 5 0.0000.0000.000
GPT 5.4 mini 0.0000.0000.000
Gemini 3 Flash Preview 0.0000.0000.000
A 0.0000.0000.000
B 0.0001.0000.000
C 0.0000.0000.000
D 1.0000.0001.000
E 0.0000.0000.000
F 0.0000.0000.000

Embeddings

Sample Embeddings

24 samples
Claude Sonnet 5 sample #0 · D · 1536 dims D Claude Sonnet 5 sample #1 · D · 1536 dims D Claude Sonnet 5 sample #2 · D · 1536 dims D Claude Sonnet 5 sample #3 · D · 1536 dims D Claude Sonnet 5 sample #4 · D · 1536 dims D Claude Sonnet 5 sample #5 · D · 1536 dims D Claude Sonnet 5 sample #6 · D · 1536 dims D Claude Sonnet 5 sample #7 · D · 1536 dims D GPT 5.4 mini sample #0 · B · 1536 dims B GPT 5.4 mini sample #1 · B · 1536 dims B GPT 5.4 mini sample #2 · B · 1536 dims B GPT 5.4 mini sample #3 · B · 1536 dims B GPT 5.4 mini sample #4 · B · 1536 dims B GPT 5.4 mini sample #5 · B · 1536 dims B GPT 5.4 mini sample #6 · B · 1536 dims B GPT 5.4 mini sample #7 · B · 1536 dims B Gemini 3 Flash Preview sample #0 · D · 1536 dims D Gemini 3 Flash Preview sample #1 · D · 1536 dims D Gemini 3 Flash Preview sample #2 · D · 1536 dims D Gemini 3 Flash Preview sample #3 · D · 1536 dims D Gemini 3 Flash Preview sample #4 · D · 1536 dims D Gemini 3 Flash Preview sample #5 · D · 1536 dims D Gemini 3 Flash Preview sample #6 · D · 1536 dims D Gemini 3 Flash Preview sample #7 · D · 1536 dims D
Claude Sonnet 5 GPT 5.4 mini Gemini 3 Flash Preview

Samples

Agent Responses

Claude Sonnet 5 openai_compatible · anthropic/claude-sonnet-5
#0 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Cluster grouping (D) best balances the district’s competing constraints: it preserves heterogeneous classrooms (addressing equity concerns and avoiding entrenched tracking), while still giving targeted instruction to similarly leveled small groups (addressing ...

#1 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Cluster grouping (D) balances feasibility with equity and rigor: it avoids the logistical overhaul, cross-age scheduling conflicts, and equity risks of full tracking (A, E) or age-mixing, while being far more sustainable than personalized software-driven learn...

#2 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Cluster grouping (D) best balances the district's competing constraints: it preserves heterogeneous classrooms (addressing equity concerns about entrenching socioeconomic/racial divides) while still allowing targeted instruction for different ability levels wi...

#3 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Cluster grouping (D) best balances rigor, equity, and feasibility: it keeps classrooms heterogeneous (avoiding tracking's equity concerns) while still allowing targeted small-group instruction for different skill levels within a single teacher's room. It requi...

#4 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Cluster grouping (D) best balances the district’s competing priorities: it preserves heterogeneous classrooms (addressing equity concerns) while still giving targeted instruction to different skill levels, requires moderate rather than massive investment in re...

#5 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Cluster grouping (D) best balances the constraints: it keeps scheduling and staffing simple (no cross-grade logistics or quarterly re-sorting overhead), avoids the equity backlash of full tracking since classrooms remain mixed-ability overall, and only require...

#6 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Cluster grouping (D) balances rigor and equity by keeping classrooms heterogeneous overall—avoiding the stigma and inequity risks of full tracking (E) or cross-grade grouping (A)—while still allowing targeted instruction for different skill levels within a man...

#7 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Cluster grouping (D) best balances feasibility with equity and rigor: it requires far less scheduling/staffing overhaul than cross-grade or full tracking (A, E), avoids the technology burden and loss of teacher-student relationships of software-driven pathways...

GPT 5.4 mini openai_compatible · openai/gpt-5.4-mini
#0 B · Implement flexible regrouping, where students are re-sorted into skill-based groups for core subjects every quarter based on formative assessment data, allowing movement between groups. 1536 dims

The policy needs to improve rigor for advanced students while avoiding rigid, long-term tracking that can deepen inequities. Flexible regrouping based on current formative data offers some skill-based differentiation, but still preserves movement between group...

#1 B · Implement flexible regrouping, where students are re-sorted into skill-based groups for core subjects every quarter based on formative assessment data, allowing movement between groups. 1536 dims

The district needs a policy that improves rigor and support without creating a rigid system that is hard to justify or reverse. Flexible regrouping offers skill-based instruction while still allowing students to move between groups as they grow, which helps ba...

#2 B · Implement flexible regrouping, where students are re-sorted into skill-based groups for core subjects every quarter based on formative assessment data, allowing movement between groups. 1536 dims

The district needs a policy that improves rigor and support while staying practical for scheduling and staffing. Flexible regrouping uses current assessment data to adapt placement over time, giving advanced students challenge and struggling students support w...

#3 B · Implement flexible regrouping, where students are re-sorted into skill-based groups for core subjects every quarter based on formative assessment data, allowing movement between groups. 1536 dims

The district needs a policy that improves rigor and equity without creating a permanently tracked system or requiring a major tech overhaul. Flexible regrouping based on frequent formative data lets students move between groups as their performance changes, wh...

#4 B · Implement flexible regrouping, where students are re-sorted into skill-based groups for core subjects every quarter based on formative assessment data, allowing movement between groups. 1536 dims

The district needs a policy that improves rigor and support without locking students into rigid tracks or requiring a major tech-heavy overhaul. Flexible regrouping offers targeted instruction based on current data, preserves movement between groups, and is ea...

#5 B · Implement flexible regrouping, where students are re-sorted into skill-based groups for core subjects every quarter based on formative assessment data, allowing movement between groups. 1536 dims

The district needs a policy that improves rigor and equity without requiring a full technology overhaul or creating rigid, hard-to-reverse tracks. Flexible regrouping based on formative data offers targeted instruction, lets students move between groups as the...

#6 B · Implement flexible regrouping, where students are re-sorted into skill-based groups for core subjects every quarter based on formative assessment data, allowing movement between groups. 1536 dims

The district needs a policy that is evidence-informed, flexible, and easier to defend on equity grounds than permanent tracking. Flexible regrouping based on formative data balances rigor and movement for students, while avoiding the fixed inequality risks of ...

#7 B · Implement flexible regrouping, where students are re-sorted into skill-based groups for core subjects every quarter based on formative assessment data, allowing movement between groups. 1536 dims

The policy needs to balance rigor, equity, teacher workload, and feasibility while remaining flexible enough to avoid permanently locking students into tracks. Flexible regrouping by quarter uses assessment data to provide targeted instruction, allows movement...

Gemini 3 Flash Preview openai_compatible · google/gemini-3-flash-preview
#0 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

The school board's goal is to balance academic rigor and equity while ensuring the policy is sustainable for staff and defensible to parents. Option (D) achieves this by maintaining diverse classrooms to prevent socioeconomic segregation while allowing teacher...

#1 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Option (D) balances equity and rigor by keeping students in diverse social environments while allowing teachers to focus on small groups with similar needs. This approach is more logistically feasible and less demanding on teacher workload than full differenti...

#2 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Option (D) balances equity and rigor by keeping students in diverse classrooms while allowing teachers to focus on small groups with similar needs, which is more manageable than full-class differentiation. This model limits the social stigma of rigid tracking ...

#3 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Cluster grouping (D) balances equity and rigor by keeping students in diverse classrooms while allowing teachers to efficiently target instruction to specific skill levels. Unlike full tracking (E) or skill-based grading (A), it prevents social segregation and...

#4 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Option (D) balances equity and rigor by keeping students in diverse classrooms while allowing teachers to address specific achievement gaps through targeted small-group instruction. This approach minimizes the logistical complexity and labeling stigma of full ...

#5 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Option (D) provides a balanced approach by maintaining heterogenous socialization to ensure equity while allowing teachers to provide targeted, rigorous instruction to high and low achievers in manageable clusters. This model is more logistically feasible and ...

#6 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Option (D) balances equity and rigor by allowing high-performing students to work together while preventing the social stratification and scheduling nightmares associated with rigid tracking or age-mixing. This model is more feasible than full differentiation ...

#7 D · Use cluster grouping, keeping classrooms mixed-ability overall but placing small clusters of similarly leveled students together within each room for targeted instruction. 1536 dims

Option (D) balances equity and rigor by keeping students in diverse classrooms while allowing teachers to focus on specific skill needs through small-group clusters. This approach addresses parental concerns about rigor and equity without the scheduling comple...

Timings

Stages

Sampling5419 ms
Embedding1957 ms
Signals13 ms
PPV0 ms

Errors

Agent and Round Errors

No errors recorded.