A Bad Start Is Forever. A Good One Isn't.
Every July, fans ask which first-half teams are real. We pooled five full seasons (2021-2025, 150 team-seasons) and tested it out of sample. A team's quality is real (random-split r=0.74), but the first half predicts only about a THIRD of the second -- and almost all of that third is in the basement. A bad first half predicts the second at R2=0.33; a good one at just 0.07; the .500 muddle is noise. 82% of clearly-bad starts stayed buried vs 67% of good ones staying up -- gravity pulls down harder. The signal locks in by ~game 40 (Memorial Day); run differential and the plain record predict it best (fancy stats add nothing); bat speed is noise. Plus: the biggest frauds (2023 Angels 44-37 -> 29-52) and what the rule says about 2026.