diff --git a/BDA 5.9.1-2-4.ipynb b/BDA 5.9.1-2-4.ipynb new file mode 100644 index 0000000..67737be --- /dev/null +++ b/BDA 5.9.1-2-4.ipynb @@ -0,0 +1,138 @@ +{ + "cells": [ + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Problem 5.9.1\n", + "\n", + "### Exchangeability with known model parameters\n", + "\n", + "For each of the following three examples, answer: \n", + "1. Are observations $y_1$ and $y_2$ exchangeable? \n", + "2. Are observations $y_1$ and $y_2$ independent? \n", + "3. Can we act as if the two observations are independent? \n", + "\n", + "Examples:\n", + "1. A box has one black ball and one white ball. We pick a ball $y_1$ at random, put it back, and pick another ball $y_2$ at random. \n", + "\n", + "Here the events are clearly independent and exchangeable. \n", + "\n", + "\n", + "2. A box has one black ball and one white ball. We pick a ball $y_1$ at random, we do not put it back, then we pick ball $y_2$.\n", + "\n", + "In this case there are four outcomes: (BB), (BW), (WB), (WW) and of these four only (WB) and (BW) have non-zero probability (1/2). Since the likelihood is symmetric, the observations are exchangeable. Clearly, though,\n", + "the events aren't independent; for example P(B|B)=0 and P(B|W)=1. And you clearly can't act as if they're independent since the second observation is determined by the first.\n", + "\n", + "3. A box has a million black balls and a million white balls. We pick a ball $y_1$ at random, we do not put it back, then we pick ball $y_2$ at random.\n", + "\n", + "These are exchangeable observations since P(BW)=(1/2)(1000000/1999999) =P(WB) and P(BB)=P(WW). They act independent since 1/2(1000000)/1999999) is just about 1/4 and so is 1/2(999999)/(1999999).\n", + "\n", + "\n", + "Gelman, Andrew; Carlin, John B.; Stern, Hal S.; Dunson, David B.; Vehtari, Aki; Rubin, Donald B.. Bayesian Data Analysis, Third Edition (Chapman & Hall/CRC Texts in Statistical Science) (Page 134). CRC Press. Kindle Edition. " + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Problem 5.9.2\n", + "\n", + "We ask the same questions as in the preceeding problem but under the conditions:\n", + "\n", + "1. A box has $n$ balls colored black and white, but we don't know how many of each. We pick a ball, put it back, then pick another.\n", + "2. Same except we pick a ball, don't put it back, then pick another.\n", + "3. Suppose we know that there are a lot of balls of each color.\n", + "\n", + "In the first case, let $\\theta$ be the proportion of white balls in the urn. Then $P(BW)=(1-\\theta)\\theta$\n", + "and $P(WB)=\\theta(1-\\theta)$. So the events are exchangeable. Also $P(BB)=(1-\\theta)^2=P(B)^2$, $P(BW)=P(WB)=\n", + "P(W)P(B)$, and $P(WW)=P(W)^2$. So they are independent. \n", + "\n", + "In the second case, we have $P(WB)=(\\theta)(n(1-\\theta)/(n-1))$ and $P(BW)=(1-\\theta)(n\\theta)/(n-1)$\n", + "and these are the same, so it's exchangeable. But they are not independent since $P(WB)$ isn't $P(W)P(B)$.\n", + "Also $P(WW)=(\\theta)(n\\theta-1)/(n-1)$ and $P(BB)=(1-\\theta)(n-n\\theta-1)/(n-1)$.\n", + "\n", + "They do get close to independent if $n$ is large.\n", + "\n" + ] + }, + { + "cell_type": "code", + "execution_count": 24, + "metadata": {}, + "outputs": [ + { + "data": { + "image/png": "\n", + "text/plain": [ + "
" + ] + }, + "metadata": {}, + "output_type": "display_data" + } + ], + "source": [ + "# These graphs show the difference Pnr(WW)-Pr(WW) where Pnr means no replacement and Pr means with replacement.\n", + "x=np.linspace(0,1,100)\n", + "n=5\n", + "fig,ax=plt.subplots(1,2)\n", + "ax[0].plot(x,x*(n*x-1)/(n-1)-x*x)\n", + "ax[0].set_title('n=5')\n", + "n=100\n", + "ax[1].plot(x,x*(n*x-1)/(n-1)-x*x)\n", + "#ax[1].plot(x,x*x)\n", + "ax[1].set_title('n=100')\n", + "plt.show()" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "# Problem 5.9.4\n", + "\n", + "Exchangeable prior distributions: suppose it is known a priori that the $2J$ parameters $\\theta_1,\\ldots,\\theta_{2J}$ are clustered into two groups, with exactly half being drawn from a $N(1, 1)$ distribution, and the other half being drawn from a $N(−1 , 1)$ distribution, but we have not observed which parameters come from which distribution. \n", + "\n", + "1. Are $\\theta_1,\\ldots,\\theta_{2J}$ exchangeable under this prior distribution? \n", + "2. Show that this distribution cannot be written as a mixture of independent and identically distributed components.\n", + "3. Why can we not simply take the limit as $J\\to\\infty$ and get a counterexample to de Finetti’s theorem?\n", + "\n", + "Gelman, Andrew; Carlin, John B.; Stern, Hal S.; Dunson, David B.; Vehtari, Aki; Rubin, Donald B.. Bayesian Data Analysis, Third Edition (Chapman & Hall/CRC Texts in Statistical Science) (Page 134). CRC Press. Kindle Edition. " + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "Let's look at the case J=1. This is very much like the earlier problems, but with a continuous distribution.\n", + "So for example $P(x,y)=(1/2)P(x,N(1,1))P(y,N(-1,1))+(1/2)P(x,N(-1,1))P(y,N(1,1))$ and\n", + "$P(x,x)=P(x,N(1,1))P(x,N(-1,1))$ so it's exchangeable.\n", + "\n", + "However, in a mixture distribution there's no way to make sense of the requirement that the parameters are clustered into two groups. If we had a mixture then \n", + "$$P(x,y)=\\int P((x,y)|\\theta)p(\\theta) d\\theta=\\int P(x|\\theta)P(y|\\theta)p(\\theta)d\\theta$$" + ] + } + ], + "metadata": { + "kernelspec": { + "display_name": "Python 3", + "language": "python", + "name": "python3" + }, + "language_info": { + "codemirror_mode": { + "name": "ipython", + "version": 3 + }, + "file_extension": ".py", + "mimetype": "text/x-python", + "name": "python", + "nbconvert_exporter": "python", + "pygments_lexer": "ipython3", + "version": "3.6.4" + } + }, + "nbformat": 4, + "nbformat_minor": 2 +}