Questions

Housing Day

Suppose Harvard College is conducting its housing lottery. For simplicity's sake, we'll say that there are 1200 Freshmen that will be randomly assigned to 12 houses. Let $X_1, X_2, \ldots, X_{12}$ count how many students are place in Pforzheimer ( $X_1$ ), all the way to Eliot ( $X_{12}$ ) (organized by best house to worst).

Are $X_1$ and $X_2$ independent?

What is the joint distribution of $X_1, X_2, \ldots, X_{12}$ ?

What is the marginal distribution of $X_1$ , the number of students who are placed into Pforzheimer House, and the joint distribution of $X_1$ and $1200 - X_1$ ?

What is the conditional distribution of $X_1$ given $X_{10} + X_{11} + X_{12} = 450$ ?

Jelly Beans

I have a jar of 30 jellybeans: 10 red, 8 green, 12 blue. I draw a sample of 12 jellybeans without replacement. Let $X$ be the number of red jellybeans in the sample, $Y$ the number of green jellybeans.

Find $Cov(X, Y)$ .

Let $X = I_1 + \ldots + I_{12}$ , and $Y = J_1 + \ldots + J_{12}$ , where

\begin{aligned} I_i &= \begin{cases} 1 & \textrm{if $$i$$th jellybean in sample is red} \\ 0 & \textrm{otherwise} \end{cases} \\ J_i &= \begin{cases} 1 & \textrm{if $$i$$th jellybean in sample is green} \\ 0 & \textrm{otherwise} \end{cases}\end{aligned}

We can now solve using indicator variables and the fundamental bridge.

\begin{aligned} Cov\left({I_1, J_1}\right) &= E\left({I_1 J_1}\right) - E\left({I_1}\right) E\left({J_1}\right) \\ &= 0 - \left({\frac{10}{30}}\right)\left({\frac{8}{30}}\right) \\ Cov\left({I_1, J_2}\right) &= E\left({I_1 J_2}\right) - E\left({I_1}\right) E\left({J_2}\right) \\ &= \left({\frac{10}{30}}\right)\left({\frac{8}{29}}\right) - \left({\frac{10}{30}}\right)\left({\frac{8}{30}}\right) \\ Cov\left({X, Y}\right) &= \sum_{i=1}^{12} Cov\left({I_i, J_i}\right) + 2 \sum_{i < j} Cov\left({I_i, J_j}\right) \\ &= \sum_{i=1}^{12} Cov\left({I_1, J_1}\right) + 2 \binom{12}{2} Cov\left({I_1, J_2}\right) \\ &= 12 \cdot Cov\left({I_1, J_1}\right) + 12 \cdot 11 \cdot Cov\left({I_1, J_2}\right) \\ &= - \frac{96}{145}\end{aligned}

It's good to do a little sanity check at the end: it makes sense that the covariance is negative. If the sample contains a lot of red jellybeans, the sample probably has fewer green jellybeans. Another way to solve this is to create an indicator for each red jellybean and each green jellybean in the jar, where the indicator equals 1 if the jellybean is in the sample and 0 otherwise.

Stat Courses

Let $X$ be the number of statistics majors in a certain college in the class of 2030, viewed as an r.v. Each statistics major chooses between two tracks: a general track in statistical principles, and a track in quant finance. Suppose that each statistics major chooses randomly which of these two tracks to follow, independently, with probability $p$ of choosing the general track. Let $Y$ be the number of statistics majors who choose the general track, and $Z$ be the number of statistics majors who choose the quantitative finance track.

Suppose that $X \sim \text{Pois}(\lambda)$ . Find the correlation between $X$ and $Y$ .

Let $n$ be the size of the Class of 2030, where $n$ is a known constant. For this part and the next, instead of assuming that $X$ is Poisson, assume that each of the $n$ students chooses to be a statistics major with probability $r$ , independently. Find the joint distributions of $Y$ , $Z$ , and the number of non-statistics majors, and their marginal distributions.

Continuing as in the previous part, find the correlation between $X$ and $Y$ .

PreviousSection 7 NextSection 8

Last updated 4 years ago

Was this helpful?