thelastone07
diff --git a/‎Lecture_03.ipynb
+11 b/‎Lecture_03.ipynb
+11
diff --git a/‎Lecture_07.ipynb
+9-7 b/‎Lecture_07.ipynb
+9-7
diff --git a/‎Lecture_08.ipynb
+14-12 b/‎Lecture_08.ipynb
+14-12
diff --git a/‎Lecture_09.ipynb
+12-10 b/‎Lecture_09.ipynb
+12-10
diff --git a/‎Lecture_10.ipynb
+8-6 b/‎Lecture_10.ipynb
+8-6
diff --git a/‎Lecture_11.ipynb
+21-15 b/‎Lecture_11.ipynb
+21-15
diff --git a/‎Lecture_12.ipynb
+5-5 b/‎Lecture_12.ipynb
+5-5
diff --git a/‎Lecture_13.ipynb
+126-53 b/‎Lecture_13.ipynb
+126-53
diff --git a/‎Lecture_14.ipynb
+38-30 b/‎Lecture_14.ipynb
+38-30
diff --git a/‎Lecture_16.ipynb
+67-31 b/‎Lecture_16.ipynb
+67-31
diff --git a/‎Lecture_17.ipynb
+20-11 b/‎Lecture_17.ipynb
+20-11
@@ -202,6 +202,17 @@
     "----"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Appendix A: The Birthday Paradox Experiment\n",
+    "\n",
+    "Here's a very [nice, interactive explanation of the Birthday Paradox](http://bit.ly/2NUDoPa).\n",
+    "\n",
+    "----"
+   ]
+  },
   {
    "cell_type": "markdown",
    "metadata": {},
 
@@ -127,7 +127,7 @@
     "> Variance is a measure of how a random variable is spread about its mean.\n",
     ">\n",
     "> \\\\begin{align}\n",
-    ">   Var(X) &= \\mathbb{E}(X - \\mathbb{E}X)^2 & \\quad \\text{or alternatively} \\\\\\\\\n",
+    ">   \\operatorname{Var}(X) &= \\mathbb{E}(X - \\mathbb{E}X)^2 & \\quad \\text{or alternatively} \\\\\\\\\n",
     ">          \\\\\\\\\n",
     ">          &= \\mathbb{E}X^2 - 2X(\\mathbb{E}X) + \\mathbb{E}(X^2) & \\quad \\text{by Linearity}\\\\\\\\\n",
     ">          &= \\boxed{\\mathbb{E}X^2 - \\mathbb{E}(X)^2}\n",
@@ -154,7 +154,7 @@
     "> The _standard deviation_ the square root of the variance.\n",
     ">\n",
     "> \\\\begin{align} \n",
-    ">   SD(X) &= \\sqrt{Var(X)}\n",
+    ">   SD(X) &= \\sqrt{\\operatorname{Var}(X)}\n",
     "> \\\\end{align}\n",
     "\n",
     "Note that like variance, the formula for standard deviation is the same for both discrete and continuous r.v.\n",
@@ -176,7 +176,7 @@
     "\n",
     "### Notation\n",
     "\n",
-    "$X \\sim \\mathcal{Unif}(a,b)$\n",
+    "$X \\sim \\operatorname{Unif}(a,b)$\n",
     "\n",
     "### Parameters\n",
     "\n",
@@ -257,7 +257,7 @@
     "  \\mathbb{E}(g(x)) = \\sum_{x} g(x) P(X=x)\n",
     "\\end{align}\n",
     "\n",
-    "### Variance of  $U \\sim \\mathcal{Unif}(0,1)$\n",
+    "### Variance of  $U \\sim \\operatorname{Unif}(0,1)$\n",
     "\n",
     "\\begin{align}\n",
     "  \\mathbb{E}(U) &= \\frac{1}{b-a} \\\\\n",
@@ -283,7 +283,7 @@
    "source": [
     "## Universality of the Uniform\n",
     "\n",
-    "Given an arbitrary CDF $F$ and the uniform $\\mathcal{U} \\sim \\mathcal{Unif}(0,1)$, it is possible to simulate a draw from the continuous r.v. of the CDF $F$.\n",
+    "Given an arbitrary CDF $F$ and the uniform $\\operatorname{U} \\sim \\operatorname{Unif}(0,1)$, it is possible to simulate a draw from the continuous r.v. of the CDF $F$.\n",
     "\n",
     "Assume:\n",
     "\n",
 
@@ -7,7 +7,7 @@
     "# Lecture 14: Location, Scale and LOTUS\n",
     "\n",
     "\n",
-    "## Stat 110, Joe Blitzstein, Harvard University\n",
+    "## Stat 110, Prof. Joe Blitzstein, Harvard University\n",
     "\n",
     "----"
    ]
@@ -24,7 +24,7 @@
     "- PDF $\\frac{1}{\\sqrt{2\\pi}} ~~ e^{-\\frac{z^2}{2}}$\n",
     "- CDF $\\Phi$\n",
     "- Mean $\\mathbb{E}(\\mathcal{Z}) = 0$\n",
-    "- Variance $\\mathbb{Var}(\\mathcal{Z}) = \\mathbb{E}(\\mathcal{Z}^2) = 1$\n",
+    "- Variance $\\operatorname{Var}(\\mathcal{Z}) = \\mathbb{E}(\\mathcal{Z}^2) = 1$\n",
     "- Skew (3<sup>rd</sup> moment) $\\mathbb{E}(\\mathcal{Z^3}) = 0$ (odd moments are 0 since they are odd functions)\n",
     "- $-\\mathcal{Z} \\sim \\mathcal{N}(0,1)$ (by symmetry; this simply flips the bell curve about its mean)\n",
     "\n",
@@ -42,27 +42,28 @@
     "## Rules on Variance\n",
     "\n",
     "\\begin{align}\n",
-    "  \\mathbb{Var}(X) &= \\mathbb{E}( (X - \\mathbb{E}X)^2 ) \\\\\n",
-    "                  &= \\mathbb{E}X^2 - (\\mathbb{E}X)^2 \\\\\n",
+    "  & \\text{[1]} & \\operatorname{Var}(X) &= \\mathbb{E}( (X - \\mathbb{E}X)^2 )  \\\\\n",
+    "  &            &   &= \\mathbb{E}X^2 - (\\mathbb{E}X)^2 \\\\\n",
     "  \\\\\n",
-    "  \\mathbb{Var}(X+c) &= \\mathbb{Var}(X) \\\\\n",
+    "  & \\text{[2]} & \\operatorname{Var}(X+c) &= \\operatorname{Var}(X) \\\\\n",
     "  \\\\\n",
-    "  \\mathbb{Var}(cX) &= c^2 ~~ \\mathbb{Var}(X) \\\\\n",
+    "  & \\text{[3]} & \\operatorname{Var}(cX) &= c^2 ~~ \\operatorname{Var}(X) \\\\\n",
     "  \\\\\n",
-    "  \\mathbb{Var}(X+Y) &\\neq \\mathbb{Var}(X) + \\mathbb{Var}(Y) ~~ \\text{in general} \n",
+    "  & \\text{[4]} & \\operatorname{Var}(X+Y) &\\neq \\operatorname{Var}(X) + \\operatorname{Var}(Y) ~~ \\text{in general} \n",
     "\\end{align}\n",
     "\n",
-    "1. We already know this.\n",
-    "1. Adding a constant $c$ has no effect on $\\mathbb{Var}(X)$.\n",
-    "1. $\\mathbb{Var}(X) \\ge 0$; $\\mathbb{Var}(X)=0$ if and only if $P(X=a) = 1$ for some $a$... _variance can never be negative!_\n",
-    "1. Unlike expected value, variance is _not_ linear. But if $X$ and $Y$ are independent, then $\\mathbb{Var}(X+Y) = \\mathbb{Var}(X) + \\mathbb{Var}(Y)$.\n",
+    "* We already know $\\text{[1]}$\n",
+    "* Re $\\text{[2]}$, adding a constant $c$ has no effect on $\\operatorname{Var}(X)$.\n",
+    "* Re $\\text{[3]}$, pulling out a scaling constant $c$ means you have to square it.\n",
+    "* $\\operatorname{Var}(X) \\ge 0$; $\\operatorname{Var}(X)=0$ if and only if $P(X=a) = 1$ for some $a$... _variance can never be negative!_\n",
+    "* Re $\\text{[4]}$, unlike expected value, variance is _not_ linear. But if $X$ and $Y$ are independent, then $\\operatorname{Var}(X+Y) = \\operatorname{Var}(X) + \\operatorname{Var}(Y)$.\n",
     "\n",
     "As a case in point for (4), consider\n",
     "\n",
     "\\begin{align}\n",
-    "  \\mathbb{Var}(X + X) &= \\mathbb{Var}(2X) \\\\\n",
-    "  &= 4 ~~ \\mathbb{Var}(X) \\\\\n",
-    "  &\\neq 2 ~~ \\mathbb{Var}(X) & \\quad \\blacksquare \\\\\n",
+    "  \\operatorname{Var}(X + X) &= \\operatorname{Var}(2X) \\\\\n",
+    "  &= 4 ~~ \\operatorname{Var}(X) \\\\\n",
+    "  &\\neq 2 ~~ \\operatorname{Var}(X) & \\quad \\blacksquare \\\\\n",
     "\\end{align}\n",
     "\n",
     "... and now we know enough about variance to return back to the general form of the normal distribution.\n",
@@ -98,7 +99,7 @@
     "From what we know about variance,\n",
     "\n",
     "\\begin{align}\n",
-    "  \\mathbb{Var}(\\mu + \\sigma \\mathcal{Z}) &= \\sigma^2 ~~ \\mathbb{Var}(\\mathcal{Z}) \\\\\n",
+    "  \\operatorname{Var}(\\mu + \\sigma \\mathcal{Z}) &= \\sigma^2 ~~ \\operatorname{Var}(\\mathcal{Z}) \\\\\n",
     "  &= \\sigma^2\n",
     "\\end{align}\n",
     "\n",
@@ -165,7 +166,7 @@
     "collapsed": true
    },
    "source": [
-    "## Variance of $\\mathbb{Pois}(\\lambda)$\n",
+    "## Variance of $\\operatorname{Pois}(\\lambda)$\n",
     "\n",
     "### Intuition\n",
     "\n",
@@ -184,11 +185,11 @@
     "  \\mathbb{E}(X^2) &= \\sum_x x^2 ~ P(X=x) \\\\\n",
     "\\end{align}\n",
     "\n",
-    "### The case for $Pois(\\lambda)$\n",
+    "### The case for $\\operatorname{Pois}(\\lambda)$\n",
     "\n",
-    "Let $X \\sim \\mathbb{Pois}(\\lambda)$. \n",
+    "Let $X \\sim \\operatorname{Pois}(\\lambda)$. \n",
     "\n",
-    "Recall that $\\mathbb{Var}(X) = \\mathbb{E}X^2 - (\\mathbb{E}X)^2$. We know that $\\mathbb{E}(X) = \\lambda$, so all we need to do is figure out what $\\mathbb{E}(X^2)$ is.\n",
+    "Recall that $\\operatorname{Var}(X) = \\mathbb{E}X^2 - (\\mathbb{E}X)^2$. We know that $\\mathbb{E}(X) = \\lambda$, so all we need to do is figure out what $\\mathbb{E}(X^2)$ is.\n",
     "\n",
     "\\begin{align}\n",
     "  \\mathbb{E}(X^2) &= \\sum_{k=0}^{\\infty} k^2 ~ \\frac{e^{-\\lambda} \\lambda^k}{k!} \\\\\n",
@@ -204,7 +205,7 @@
     "  &= e^{-\\lambda} \\lambda e^{\\lambda} (\\lambda + 1) \\\\\n",
     "  &= \\lambda^2 + \\lambda \\\\\n",
     "  \\\\\n",
-    "  \\mathbb{Var}(X) &= \\mathbb{E}(X^2) - (\\mathbb{E}X)^2 \\\\\n",
+    "  \\operatorname{Var}(X) &= \\mathbb{E}(X^2) - (\\mathbb{E}X)^2 \\\\\n",
     "  &= \\lambda^2 + \\lambda - \\lambda^2 \\\\\n",
     "  &= \\lambda & \\quad \\blacksquare\n",
     "\\end{align}\n",
@@ -216,22 +217,22 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## Variance of $\\mathbb{Binom}(X)$\n",
+    "## Variance of $\\operatorname{Binom}(X)$\n",
     "\n",
-    "Let $X \\sim \\mathbb{Binom}(n,p)$.\n",
+    "Let $X \\sim \\operatorname{Binom}(n,p)$.\n",
     "\n",
     "$\\mathbb{E}(X) = np$. \n",
     "\n",
-    "Find $\\mathbb{Var}(X)$ using all the tricks you have at your disposal.\n",
+    "Find $\\operatorname{Var}(X)$ using all the tricks you have at your disposal.\n",
     "\n",
     "### The path of least resistance\n",
     "\n",
     "Let's try applying (4) from the above Rules of Variance. \n",
     "\n",
-    "We can do so because $X \\sim \\mathbb{Binom}(n,p)$ means that the $n$ trials are _independent Bernoulli_.\n",
+    "We can do so because $X \\sim \\operatorname{Binom}(n,p)$ means that the $n$ trials are _independent Bernoulli_.\n",
     "\n",
     "\\begin{align}\n",
-    "  X &= I_1 + I_2 + \\dots + I_n & \\quad \\text{where } I_j \\text{ are i.i.d. } \\mathbb{Bern}(p) \\\\\n",
+    "  X &= I_1 + I_2 + \\dots + I_n & \\quad \\text{where } I_j \\text{ are i.i.d. } \\operatorname{Bern}(p) \\\\\n",
     "  \\\\\n",
     "  \\Rightarrow X^2 &= I_1^2 + I_2^2 + \\dots + I_n^2 + 2I_1I_2 + 2I_1I_3 + \\dots + 2I_{n-1}I_n & \\quad \\text{don't worry, this is not as bad as it looks} \\\\\n",
     "  \\\\\n",
@@ -240,7 +241,7 @@
     "  &= n p + n (n-1) p^2 & \\quad \\text{since } I_1I_2 \\text{ is the event that both } I_1 \\text{ and } I_2 \\text{ are successes} \\\\\n",
     "  &= np + n^2 p^2 - np^2 \\\\\n",
     "  \\\\\n",
-    "  \\mathbb{Var}(X) &= \\mathbb{E}(X^2) - (\\mathbb{E}X)^2 \\\\\n",
+    "  \\operatorname{Var}(X) &= \\mathbb{E}(X^2) - (\\mathbb{E}X)^2 \\\\\n",
     "  &= np + n^2 p^2 - np^2 - (np)^2 \\\\\n",
     "  &= np - np^2 \\\\\n",
     "  &= np(1-p) \\\\\n",
@@ -254,13 +255,13 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## Variance of $\\mathbb{Geom}(p)$\n",
+    "## Variance of $\\operatorname{Geom}(p)$\n",
     "\n",
-    "Let $X \\sim \\mathbb{Geom}(p)$.\n",
+    "Let $X \\sim \\operatorname{Geom}(p)$.\n",
     "\n",
     "It has PDF $q^{k-1}p$.\n",
     "\n",
-    "Find $\\mathbb{Var}(X)$.\n",
+    "Find $\\operatorname{Var}(X)$.\n",
     "\n",
     "### Applying what we know of the Geometric Series\n",
     "\n",
@@ -285,7 +286,7 @@
     "  &= p \\frac{q+1}{p^3} \\\\\n",
     "  &= \\frac{q+1}{p^2} \\\\\n",
     "  \\\\\n",
-    "  \\mathbb{Var}(X) &= \\mathbb{E}(X^2) - (\\mathbb{E}X)^2 \\\\\n",
+    "  \\operatorname{Var}(X) &= \\mathbb{E}(X^2) - (\\mathbb{E}X)^2 \\\\\n",
     "  &= \\frac{q+1}{p^2} - \\left( \\frac{1}{p} \\right)^2 \\\\\n",
     "  &= \\frac{q+1}{p^2} - \\frac{1}{p^2} \\\\\n",
     "  &= \\boxed{\\frac{q}{p^2}} & \\quad \\blacksquare\n",
@@ -313,6 +314,13 @@
     "\n",
     "----"
    ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "View [Lecture 14: Location, Scale, and LOTUS | Statistics 110](http://bit.ly/2CyYFg4) on YouTube."
+   ]
   }
  ],
  "metadata": {
 
@@ -7,7 +7,7 @@
     "# Lecture 17: Moment Generating Functions (MGFs), hybrid Bayes' rule, Laplace's rule of succession\n",
     "\n",
     "\n",
-    "## Stat 110, Joe Blitzstein, Harvard University\n",
+    "## Stat 110, Prof. Joe Blitzstein, Harvard University\n",
     "\n",
     "----"
    ]
@@ -16,9 +16,9 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## $\\mathbb{Expo}(\\lambda)$ and the  Memorylessness Property\n",
+    "## $\\operatorname{Expo}(\\lambda)$ and the  Memorylessness Property\n",
     "\n",
-    "#### Theorem: If $X$ is a positive, continuous r.v. with the memorylessness property, then $X \\sim \\mathbb{Expo}(\\lambda)$ for some $\\lambda$.\n",
+    "#### Theorem: If $X$ is a positive, continuous r.v. with the memorylessness property, then $X \\sim \\operatorname{Expo}(\\lambda)$ for some $\\lambda$.\n",
     "\n",
     "Let $F$ be the CDF of $X$, $G(x) = P(X \\ge x) = 1 - F(x)$.\n",
     "\n",
@@ -52,7 +52,7 @@
     "  &             &      &= e^{-\\lambda x} & \\quad \\blacksquare \\\\\n",
     "\\end{align}\n",
     "\n",
-    "And so now we see that in the continuous case, $\\mathbb{Expo}(\\lambda)$ is the only distribution with the memorylessness property."
+    "And so now we see that in the continuous case, $\\operatorname{Expo}(\\lambda)$ is the only distribution with the memorylessness property."
    ]
   },
   {
@@ -81,7 +81,7 @@
     "#### Moments\n",
     "\n",
     "* the average value for a random variable $X$ $\\mathbb{E}(X)$ is known as the _first moment_\n",
-    "* the _second moment_ of $X$ is $\\mathbb{E}(X^{2})$ which helps use derive $\\mathbb{Var}(X)$\n",
+    "* the _second moment_ of $X$ is $\\mathbb{E}(X^{2})$ which helps use derive $\\operatorname{Var}(X)$\n",
     "* higher moments are easily generated (derived), as well\n",
     "\n",
     "### 3 reasons why MGF is important\n",
@@ -104,14 +104,14 @@
     "\n",
     "### MGF for $Bern(p)$\n",
     "\n",
-    "Given $X \\sim Bern(p)$, we obtain the MGF with\n",
+    "Given $X \\sim \\operatorname{Bern}(p)$, we obtain the MGF with\n",
     "\n",
     "\\begin{align}\n",
     "  M(t) &= \\mathbb{E}(e^{tX}) \\\\\n",
     "       &= p \\, e^t * q &\\quad \\text{ where } q = 1-p\n",
     "\\end{align}\n",
     "\n",
-    "### MGF for $Bin(p)$\n",
+    "### MGF for $\\operatorname{Bin}(p)$\n",
     "\n",
     "Given $X \\sim Bin(n,p)$, we obtain the MGF with\n",
     "\n",
@@ -168,13 +168,13 @@
     "\n",
     "_If we have observed the sun rising for the past $n$ days in succession, then what is the probability that the sun will rise tomorrow?_\n",
     "\n",
-    "Given $p$ is the probability that the sun will rise on any given day $X_k$, we can consider a consecutive string of days $X_1, X_2, \\dots \\text{ i.i.d. } Bern(p)$ which is conditional on $p$. But for the question above, we do not know what $p$ is. Bayesians treat $p$ as an r.v.\n",
+    "Given $p$ is the probability that the sun will rise on any given day $X_k$, we can consider a consecutive string of days $X_1, X_2, \\dots \\text{ i.i.d. } \\operatorname{Bern}(p)$ which is conditional on $p$. But for the question above, we do not know what $p$ is. Bayesians treat $p$ as an r.v.\n",
     "\n",
     "### Problem structure\n",
     "\n",
-    "* Let $p \\sim Unif(0,1)$ be our _prior_; we choose $Unif(0,1)$ since $p$ could be _anything_\n",
+    "* Let $p \\sim \\operatorname{Unif}(0,1)$ be our _prior_; we choose $\\operatorname{Unif}(0,1)$ since $p$ could be _anything_\n",
     "* Let $S_n = X_1 + X_2 + \\cdots + X_n$\n",
-    "* So we then assume $S_n | p \\sim Bin(n,p) \\text{, } p \\sim Unif(0,1)$\n",
+    "* So we then assume $S_n | p \\sim \\operatorname{Bin}(n,p) \\text{, } p \\sim \\operatorname{Unif}(0,1)$\n",
     "\n",
     "### Questions\n",
     "\n",
@@ -209,7 +209,16 @@
     "  \\text{and } P(X_{n+1}=1 | S_n=n) &= \\int_{0}^{1} (n+1) \\, p \\, p^n \\, dp &\\quad \\text{ Fundamental Bridge, } \\mathbb{E}(p | S_n=n) \\\\ \n",
     "       &= \\int_{0}^{1} (n+1) \\, p^{n+1} \\, dp \\\\\n",
     "       &= \\boxed{\\frac{n+1}{n+2}}\n",
-    "\\end{align}\n"
+    "\\end{align}\n",
+    "\n",
+    "----"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "View [Lecture 17: Moment Generating Functions | Statistics 110](http://bit.ly/2CxVsgR) on YouTube."
    ]
   }
  ],