What's often quoted in Warren Buffet's Billion dollar NCAA contest are the odds of randomly picking the perfect bracket of teams, but I have to think that Buffet's insurance actuaries actually calculated the odds of randomly selecting the right bracket with a millions of random guesses around a locus of likely guesses informed by analysis like the posted page (and from other stats analysis professionals.. vegas oddsmakers for example...). Otherwise the risk of taking that bet is actually much higher than a fully random pick, yes?
Edit: As it turns out, Warren Buffet, as usual, seems to have done his homework and even adds the interesting angle of "buying out" a possible winner partway through the contest:
Silver's round-by-round projection percentages are included in the HTML source. If you like building rule-specific points projections, here's the data, extracted:
A #16 seed has never beaten a #1 seed in the history of the tournament, yet the percentages of moving on for #1 seeds against #16 seeds vary from 96%-99%.
I wonder if there would be any of this variance (4 percentage points from 100%) if computers, without human input, were used to place the likelihood of moving on.
There was a big variance in the regular season performances of the #1 seeds this year, which is what the computers are picking up on. If anything, I think humans have the opposite effect of what you suspect, as they do things like overrating Wichita St for emotional reasons (undefeated record) while the most famous computer ranking http://www.usatoday.com/sports/ncaab/sagarin/ puts Wichita St as only the 13th best team.
Also, historically #1 seeds were stronger than they are now because top players didn't go to the NBA so quickly.
I would say the exact opposite, really. Players can't jump straight from high school anymore. So you have a larger base of extremely talented freshmen.
Also, I would consider the Pomeroy rankings to be better (many more variables, Sagarin uses point differentials which are silly).
There were only like 5-6 seasons when it was popular to go straight to the NBA from high school. In the last millennium, it was very uncommon. Instead, it was typical for top players to stay for 3-4 years. Now, they usually stay for just 1. Compare:
Along the lines of UNLV, check out the 2005 draft with UNC. #2, #5, #13, & #14. I think that's the most lottery picks from one school in a year, and three were in the same class (#2 was a freshman)!
I think TheBiv's point is that, even though you know it can happen, historical data suggests that 1-4% is too high a probability. It's never happened in, say, 20 years of a 64-team tournament? That's 80 #1 seeds? Then 1% might be a reasonable probability, but 4% does not seem reasonable.
Is this the first year Nate and his team have modeled the NCAA Tournament? I'd be curious to know if they applied a similar statistical approach last year and, if so, how that model faired against the actual tournament results.
Edit: As it turns out, Warren Buffet, as usual, seems to have done his homework and even adds the interesting angle of "buying out" a possible winner partway through the contest:
http://www.slate.com/articles/sports/sports_nut/2014/03/bill...