Lead
"I'd like my child in the higher-level group." And also, sincerely: "I want all the children to learn together." Both impulses come from genuine parental concern. And neither intuition — that grouping helps the top, or that it leaves the bottom behind — has a simple answer.
For decades, educational researchers have been investigating exactly this question. Which of those two intuitions is more accurate? Or does it depend on circumstances? When large longitudinal datasets are brought to bear, what do they actually say?
Background: Two Different Systems
"Ability-based class organization" covers two distinct institutional arrangements.
The first is tracking: sorting students into separate classes or programs based on perceived ability, with each track following its own curriculum — common in US and European schools, common in the United States and parts of Europe: students are assigned to fixed courses or tracks by year level, each with its own curriculum. Once placed, movement between tracks is limited.
The second is ability grouping by subject (shūjukudo-betsu shidō), more common in Japan: within a given subject (usually mathematics), students are regrouped for individual units based on assessed proficiency, then returned to their home class. The grouping is not supposed to be permanent.
In Japan, a 2022 MEXT survey found that approximately 65% of elementary schools use within-subject ability grouping in mathematics, and more than 70% of middle schools do so [4]. The practice is familiar to most families, but few know what the evidence says about its effects.
The tracking research from the United States and Europe does not map perfectly onto Japanese within-subject grouping, but it offers the clearest longitudinal evidence on the basic question: what happens when children are sorted?
What Gamoran's Research Showed: The Reinforcement Hypothesis
The most influential work in this field came from Gamoran and Mare (1989), who analyzed data from more than 20,000 high school sophomores and juniors in a nationally representative U.S. sample [1]. Students placed in higher tracks showed gains in achievement; students placed in lower tracks showed a widening gap. The finding — that tracking amplifies existing differences rather than correcting them — was given a name: the reinforcement hypothesis: the idea that ability grouping widens existing gaps between high- and low-achieving students rather than narrowing them.
Gamoran (1992) went further and identified the primary mechanism: not the students' abilities themselves, but the quality of instruction delivered in each track [2]. Upper-track classes received more complex tasks and teachers held higher expectations. Lower-track classes received repetitive exercises aimed at catching up, delivered at a lower level of intellectual demand. The difference in instructional quality was the vehicle through which the gap widened — quietly, over time.
What the Meta-Analyses Show
Terrin and Trivetti (2023) synthesized 50 tracking studies and found that tracking does not raise overall achievement and does produce inequality: lower-track students experienced disadvantages in the range of g = −0.20 to −0.30, while the benefits to upper-track students were smaller by comparison [3]. The effect was not symmetric.
The analysis also pointed to a moderating factor: fixed tracking systems showed more damage to lower-track students than flexible systems with genuine reassignment. Mobility matters.
Slavin's (1990) systematic review reached a similar conclusion: the academic achievement effects of ability grouping are limited, and in the absence of differentiated instruction within each group, the effects are negligible [6].
The Japanese Case: Design Versus Practice
Japan's within-subject ability grouping is structurally different from European or American tracking: it is designed to be unit-by-unit and flexible, not a fixed assignment. MEXT has consistently stated that it should not become "fixed ability-based instruction."
In practice, the gap between design and implementation is real. Groups that are supposed to be reconstituted each unit sometimes remain stable across units; "upper group" and "lower group" become visible identities for the children and their families. The system is designed for fluidity, but fluidity requires active effort to maintain.
From northern Europe, Pekkarinen et al. (2009) analyzed Finnish data following a reform that abolished early tracking and found that abolishing it produced positive effects on the cognitive abilities of lower-achieving students [7]. The institutional design of sorting has long-run consequences for children's development.
Practical Takeaways
Three perspectives worth holding, given what the research shows:
Look at the quality of instruction, not just the group level. Which group a child is placed in matters less than the quality of the tasks offered in that group and the level of teacher expectation. At school observation days or parent-teacher conferences, asking about the content and approach of instruction is more informative than asking which group the child is in.
Don't lower your own expectations when a child is in a lower group. Part of the reinforcement mechanism documented in the research is that group labels lead both teachers and children themselves to calibrate their expectations downward. Maintaining "you can do this" at home provides a counterweight.
Ask about group changes without hesitation. Within-subject ability grouping is designed to be flexible. Feeling that the current group is not the right fit and raising that with the school is consistent with how the system is supposed to function.
Summary
Ability grouping is designed to help high achievers advance — but the research record repeatedly shows it also functions as a mechanism for widening gaps. For students in lower groups, the disadvantage tends to accumulate through differences in instructional quality, quietly, in ways that are hard to see from the outside.
Individual parents cannot change institutional design. But understanding how the mechanism works creates choices — in how you engage at home, in what you ask about at school, and in whether you treat a placement as fixed or open.
References
- Gamoran A, Mare RD. Secondary school tracking and educational inequality: compensation, reinforcement, or neutrality? Am J Sociol. 1989;94(5):1146–1183. doi:10.1086/229114
- Gamoran A. Is ability grouping equitable? Educ Leadersh. 1992;50(2):11–17.
- Terrin E, Trivetti M. The effect of school tracking on student achievement and inequality: a meta-analysis. Rev Educ Res. 2023;93(2):236–274. doi:10.3102/00346543221100850
- Ministry of Education, Culture, Sports, Science and Technology (MEXT), Japan. Survey on curriculum organization and implementation at public elementary and middle schools, FY2021. 2022. URL: https://www.mext.go.jp/a_menu/shotou/new-cs/index.htm
- Oakes J. Keeping Track: How Schools Structure Inequality. 2nd ed. New Haven: Yale University Press; 2005.
- Slavin RE. Achievement effects of ability grouping in secondary schools: a best-evidence synthesis. Rev Educ Res. 1990;60(3):471–499. doi:10.3102/00346543060003471
- Pekkarinen T, Uusitalo R, Kerr S. School tracking and development of cognitive skills. J Labor Econ. 2009;27(3):461–485. doi:10.1086/598308