Round 18
District AI Tutoring Rollout
A mid-sized school district (22 schools, 14,000 students, grades 3-12) has a one-time $2.4M grant to improve math outcomes over three years, after which ongoing funding must come from the regular budget (~$400K/yr sustainable). Test scores have stagnated and teacher shortages have left many classrooms with underqualified substitutes in math. The district must decide how to deploy the grant. Options differ in how directly they intervene in instruction, how much they depend on teacher buy-in, how equitably benefits reach struggling vs. advanced students, and how sustainable they are once grant funding ends. Board members, principals, teachers' union reps, and parent advocates all have different priorities: some want measurable short-term score gains, some want to protect teacher autonomy and jobs, some want equity for under-resourced schools, and some worry about over-reliance on unproven technology or vendor lock-in.
Status
DECIDEDWinner
F · Target the AI tutoring platform only at the lowest-performing quartile of students in the highest-poverty schools, paired with small-group human tutoring for the rest of that cohort.Majority
F · Target the AI tutoring platform only at the lowest-performing quartile of students in the highest-poverty schools, paired with small-group human tutoring for the rest of that cohort.Duration
10051 msTopic
Options
Checks
Baselines
PPV
Top Influences
Samples
Answer Distribution
PPV
Influences
| Label | Role | Score |
|---|---|---|
| GPT 5.4 mini | delegate | 1.300 |
| Gemini 3 Flash Preview | delegate | 1.211 |
| Claude Sonnet 5 | delegate | 1.000 |
Voting matrix
Constructed Delegation Matrix
| To / From | Claude Sonnet 5 | GPT 5.4 mini | Gemini 3 Flash Preview |
|---|---|---|---|
| Claude Sonnet 5 | 0.000 | 0.000 | 0.000 |
| GPT 5.4 mini | 0.000 | 0.000 | 0.300 |
| Gemini 3 Flash Preview | 0.000 | 0.211 | 0.000 |
| A | 1.000 | 0.000 | 0.000 |
| B | 0.000 | 0.000 | 0.000 |
| C | 0.000 | 0.000 | 0.000 |
| D | 0.000 | 0.000 | 0.000 |
| E | 0.000 | 0.000 | 0.000 |
| F | 0.000 | 0.789 | 0.700 |
Embeddings
Sample Embeddings
Samples
Agent Responses
A phased pilot allows the district to rigorously test AI tutoring's effectiveness, teacher buy-in, and equity impacts on a small scale before committing the full $2.4M district-wide, avoiding the risk of vendor lock-in or wasted funds on an unproven approach. ...
A phased pilot (A) directly addresses the stakeholders' core concern—unproven technology and vendor lock-in—by generating rigorous evidence before committing the full $2.4M district-wide, while still leaving room to redirect funds toward equity, PD, or staffin...
A phased pilot (A) lets the district gather rigorous evidence on effectiveness, teacher buy-in, and equity impact before committing the full $2.4M, avoiding vendor lock-in risk and unsustainable post-grant costs. It balances competing stakeholder concerns—auto...
A phased pilot (A) directly addresses the core tension: it generates rigorous evidence on effectiveness and equity impact before committing the full $2.4M to an unproven district-wide rollout, avoiding vendor lock-in and sustainability risk once grant funds ex...
A phased pilot (A) directly addresses the district's core uncertainties—unproven technology, vendor lock-in, and sustainability after grant funding ends—by generating rigorous evidence before committing to district-wide scale. It also balances competing stakeh...
A phased pilot (Option A) best balances competing priorities by generating rigorous local evidence on efficacy, equity impact, and teacher buy-in before committing the full $2.4M to an unproven district-wide rollout that risks vendor lock-in and unsustainable ...
A phased two-year pilot (A) allows the district to gather rigorous evidence on the AI tutoring platform's effectiveness, equity impact, and teacher buy-in before committing the full one-time grant to a district-wide rollout that must later be sustained on a mu...
A phased pilot (A) directly addresses the tension between wanting measurable gains and avoiding over-reliance on unproven, expensive technology by generating rigorous evidence before committing the full $2.4M district-wide, which matters since only ~$400K/yr i...
The best choice is the option that most directly improves math instruction for the students who need it most, while also being realistic about limited sustainable funding after the grant ends. Option F targets the lowest-performing students in the highest-pove...
The best choice is the option that maximizes likely math gains while fitting the district’s long-term budget and equity goals. A districtwide, heavily vendor-dependent rollout is risky, and a purely staffing-based plan may be hard to sustain after the grant en...
The most balanced choice is the option that directly addresses the shortage of qualified math instruction, targets the students with the greatest need, and is more likely to be sustainable after the grant ends. Option (F) focuses resources on the lowest-perfor...
The best choice is the option that most directly improves math instruction for the students who need it most, while also being realistic about limited ongoing funding and teacher shortages. Targeting the lowest-performing students in the highest-poverty school...
The district needs a plan that addresses the immediate instructional gaps, uses the grant where it can most directly raise math achievement, and remains financially sustainable after three years. Targeting the lowest-performing students in the highest-poverty ...
The district needs a plan that can produce measurable math gains quickly, address shortages, and still be sustainable after the grant ends. Option F targets the students most in need in high-poverty schools, pairing AI with human tutoring for stronger support ...
The district needs a plan that can produce meaningful math gains, especially for students most harmed by shortages and stagnation, while also being feasible to sustain after the grant ends. Option F is the strongest balance: it targets the students and schools...
The best option is the one that balances evidence, equity, scalability, and sustainability under a temporary grant. A phased pilot in volunteer schools allows the district to test whether AI tutoring actually raises math scores before committing the full budge...
Option (F) balances immediate intervention for teacher shortages with long-term fiscal sustainability by focusing resources on the students with the greatest need. This targeted approach addresses equity concerns and allows the district to maintain a high-impa...
Option (F) balances immediate intervention for teacher shortages with long-term financial sustainability by targeting the students with the greatest need rather than thinning resources district-wide. This targeted approach addresses equity concerns, remains wi...
The district faces a combination of acute teacher shortages, a need for equitable outcomes, and a long-term sustainability gap between grant funding and the regular budget. Option (F) addresses these by concentrating high-impact resources on the students with ...
To solve this district-scale challenge, the district must balance fiscal sustainability, the urgency of teacher shortages, and the need for equitable math improvement. Option (B) leverages the grant to address immediate instructional gaps caused by underqualif...
This decision requires balancing immediate student needs, financial sustainability, and the reality of teacher shortages. Option (F) addresses the district's equity goals by targeting resources toward low-performing students in high-poverty schools while combi...
The district faces a combination of a one-time funding surge ($2.4M), a lower long-term budget ($400k), and chronic teacher shortages. Option (B) provides the most immediate, scalable solution to address the lack of qualified teachers across 22 schools while b...
Option (F) balances immediate intervention for teacher-depleted schools with the fiscal reality of the $400K/yr long-term budget by narrowing the scope. It addresses equity by prioritizing high-poverty students and mitigates "unproven technology" concerns by p...
Option (F) balances immediate impact on math outcomes with long-term fiscal sustainability by concentrating resources on students with the highest need rather than over-extending the $400K annual budget. This targeted approach addresses equity concerns, mitiga...
Timings
Stages
Errors