Thursday, December 13, 2007

Mitchell reported expected probability

Today at 2:00, the "Mitchell Report" will be released, dealing with Major League Baseball's "performance-enhancing drug" issues.

This provides an excellent opportunity for playing with binomial probability distributions. There are 30 Major League teams, and reportedly as many as 70-80 players named. The binomial probability formula says

P(k out of n) = (n! / k!(n-k)!) * (p^k)*(q^(n-k))

where n is the number of trials, k is the number of successes, p is the probability of success and q is the probability of failure. In the simplistic case, assuming random distribution of named players, and 80 players named, we can construct a probability table using n = 75, p = (1/30), and q = (1-p) as follows:



Probability and the Mitchell Report
Players namedProbability of that many playersExpected number of teams

07.87%2

120.34%6

225.96%8

321.78%7

413.52%4

56.62%2

62.66%1

7.91%0

8.27%0

9.07%0

10.02%0


We'd expect to see 2 teams with no named players and 1 team with 6, just as a matter of simple probability.

Now, it isn't, of course, that simple. Most players have player for more than one team. If Roger Clemens (to take one name that has been alleged) is in the list, he played for the Red Sox, Blue Jays, Yankees and Astros. If we assume that the average named player has played for two teams, than p changes from 1/30 to 2/30. And the table changes to:



Probability and the Mitchell Report
Players namedProbability of that many playersExpected number of teams

13.03%1

28.01%2

313.93%4

417.91%5

518.16%5

615.13%5

710.66%3

86.47%2

93.44%1

101.62%0

11.68%0

12.26%0




Now we don't expect any teams to have no players named. The odds are that every team will have a player named who either is, or has been, affiliated with the team.

Labels: , ,

|

0 Comments:

Post a Comment

Comment?

<< Home