1 00:00:00,940 --> 00:00:02,390 Now, conditional probability will 2 00:00:02,390 --> 00:00:06,260 let us explain a lot of the confused arguments 3 00:00:06,260 --> 00:00:09,100 that people brought up about Monty Hall. 4 00:00:09,100 --> 00:00:12,120 And we'll see that it is a little bit confusing 5 00:00:12,120 --> 00:00:15,540 and where there is some correct sounding arguments that 6 00:00:15,540 --> 00:00:17,250 give you the wrong answer. 7 00:00:17,250 --> 00:00:20,190 So let's go back and look at our Monty Hall 8 00:00:20,190 --> 00:00:22,785 tree that allowed us to derive the sample space 9 00:00:22,785 --> 00:00:27,400 and probability space for the whole process of the prize 10 00:00:27,400 --> 00:00:29,600 being placed and the contest picking a door 11 00:00:29,600 --> 00:00:31,660 and Carol opening a door. 12 00:00:31,660 --> 00:00:34,030 Now, this tree was way more complicated 13 00:00:34,030 --> 00:00:36,030 than we needed if all we were trying to do 14 00:00:36,030 --> 00:00:38,760 was figure out the probability of winning if you switch. 15 00:00:38,760 --> 00:00:40,474 But having the tree will allow us 16 00:00:40,474 --> 00:00:42,140 to discuss a whole bunch of other events 17 00:00:42,140 --> 00:00:43,890 in their probabilities that will get 18 00:00:43,890 --> 00:00:46,310 us a grip on some of the arguments that 19 00:00:46,310 --> 00:00:47,950 gave the wrong answer. 20 00:00:47,950 --> 00:00:49,650 So let's look at the event, first of all 21 00:00:49,650 --> 00:00:51,150 that the goat is at 2. 22 00:00:51,150 --> 00:00:53,630 Now, this is the branch where the prize is at 2. 23 00:00:53,630 --> 00:00:57,120 And so in all the other branches the goat is at 2, 24 00:00:57,120 --> 00:01:07,690 l which means that we have these eight of the 12 outcomes 25 00:01:07,690 --> 00:01:09,680 in the event-- goat is at 2. 26 00:01:09,680 --> 00:01:13,620 Now, let's also look at the event that the prize is at 1. 27 00:01:13,620 --> 00:01:17,450 That's just this branch of the tree, OK? 28 00:01:17,450 --> 00:01:22,110 So one of the arguments is that when the contestant is 29 00:01:22,110 --> 00:01:27,880 at the point where they've seen that the open door and they're 30 00:01:27,880 --> 00:01:31,750 trying to decide whether to stick or switch, 31 00:01:31,750 --> 00:01:34,270 they know that the goat is at the door 2. 32 00:01:34,270 --> 00:01:36,550 Say without loss of generality that that 33 00:01:36,550 --> 00:01:38,870 was the door that they got to look at behind, 34 00:01:38,870 --> 00:01:40,330 that Carol opened. 35 00:01:40,330 --> 00:01:43,790 And so we want to ask the probability, given 36 00:01:43,790 --> 00:01:48,410 that he picked 1, what's the probability that the prize is 37 00:01:48,410 --> 00:01:50,750 at 1 given that the goat is at 2? 38 00:01:50,750 --> 00:01:53,880 That means that if you're at door 1 39 00:01:53,880 --> 00:01:59,640 then you should stick if that probability is high 40 00:01:59,640 --> 00:02:01,460 and otherwise you shouldn't stick. 41 00:02:01,460 --> 00:02:06,500 So we can look at this event, the prize at 1 given the goat 42 00:02:06,500 --> 00:02:09,900 at 2, and what we can see is that it's taking up 43 00:02:09,900 --> 00:02:13,460 exactly half of the outcomes for goat at 2 44 00:02:13,460 --> 00:02:17,130 and the same kind of outcomes-- red ones and green ones. 45 00:02:17,130 --> 00:02:19,590 The red ones are worth an 1/18 and the green ones 46 00:02:19,590 --> 00:02:22,470 are worth a 1/9 in probability, and that 47 00:02:22,470 --> 00:02:26,250 implies that the probability that the prize is at 1 given 48 00:02:26,250 --> 00:02:28,640 that the goat is at 2 is 1/2. 49 00:02:28,640 --> 00:02:30,610 It really is. 50 00:02:30,610 --> 00:02:33,020 And that's the argument that people were saying. 51 00:02:33,020 --> 00:02:35,030 They said, look, when the contestant sees 52 00:02:35,030 --> 00:02:38,660 that the goat is at door 2, and they're 53 00:02:38,660 --> 00:02:40,880 trying to decide whether the goat-- the prize is 54 00:02:40,880 --> 00:02:44,350 at the door-- is it door 1 or at the other door, 55 00:02:44,350 --> 00:02:45,530 and it's equally likely. 56 00:02:45,530 --> 00:02:47,780 And so it doesn't matter whether they stick or switch. 57 00:02:47,780 --> 00:02:50,600 That's a correct argument but it's not 58 00:02:50,600 --> 00:02:55,840 calculating the probability of the stick strategy winning. 59 00:02:55,840 --> 00:02:56,810 Why? 60 00:02:56,810 --> 00:03:01,580 Well, because there's more information that's 61 00:03:01,580 --> 00:03:04,850 available than goat is at 2. 62 00:03:04,850 --> 00:03:06,765 The contestant not only knows that the goat 63 00:03:06,765 --> 00:03:08,850 is at 2 and trying to figure out the probability 64 00:03:08,850 --> 00:03:11,110 that the prize is at 1, but the contestant 65 00:03:11,110 --> 00:03:12,990 knows what door he picked. 66 00:03:12,990 --> 00:03:16,180 So let's suppose that the contestant did pick door 1 67 00:03:16,180 --> 00:03:18,800 and learned that the goat was at door 2, that's 68 00:03:18,800 --> 00:03:20,030 a different event. 69 00:03:20,030 --> 00:03:23,590 If the blue one is marked off at the places where 70 00:03:23,590 --> 00:03:26,550 the contestant picks one, this is 71 00:03:26,550 --> 00:03:31,710 where the door is picked-- is 1 and here's 1 and here's 1. 72 00:03:31,710 --> 00:03:36,180 This 1 splits into one event, this 1 splits into one event, 73 00:03:36,180 --> 00:03:39,610 but this choice of 1 splits into two outcomes. 74 00:03:39,610 --> 00:03:44,360 And so when we look at the event that both the goat is at 2 75 00:03:44,360 --> 00:03:46,230 and the contestant picked 1, which 76 00:03:46,230 --> 00:03:48,970 is what the contest really knows when they get 77 00:03:48,970 --> 00:03:51,050 to see that there's a goat at door 2, 78 00:03:51,050 --> 00:03:56,740 we wind up with the overlap of just three outcomes. 79 00:03:56,740 --> 00:04:00,870 Two outcomes that have probability 1/8 and one outcome 80 00:04:00,870 --> 00:04:05,730 that has probability a 1/9. 81 00:04:05,730 --> 00:04:06,880 It's just those three. 82 00:04:06,880 --> 00:04:09,670 And the result is that the probability that the prize is 83 00:04:09,670 --> 00:04:12,460 at 1 given that you picked 1 and the goat 84 00:04:12,460 --> 00:04:19,380 is at 2-- so this is the event-- goat at 2 85 00:04:19,380 --> 00:04:21,730 and picked 1, these three outcomes. 86 00:04:21,730 --> 00:04:25,950 The prize is at 1 is these two outcomes, which 87 00:04:25,950 --> 00:04:28,490 are each worth an 1/18 and this is 88 00:04:28,490 --> 00:04:29,800 an outcome that's worth a 1/9. 89 00:04:29,800 --> 00:04:33,630 So the prize at 1 outcomes amount 90 00:04:33,630 --> 00:04:37,080 to 1/2 of the total probability of this event, goat at 2 91 00:04:37,080 --> 00:04:37,740 picked at 1. 92 00:04:37,740 --> 00:04:40,080 So, again, the probability that the prize 93 00:04:40,080 --> 00:04:43,630 is at 1 given that the contestant picked 1 and saw 94 00:04:43,630 --> 00:04:47,430 the goat at 2 is a 1/2 also. 95 00:04:47,430 --> 00:04:49,460 That's confusing. 96 00:04:49,460 --> 00:04:52,420 So it seems as though the contestant may as well stick 97 00:04:52,420 --> 00:04:56,440 because at the point that he has to decide whether to stick 98 00:04:56,440 --> 00:04:59,060 or switch, and he knows where-- he sees where the goat is 99 00:04:59,060 --> 00:05:01,210 and he knows what door he's picked, 100 00:05:01,210 --> 00:05:03,610 it's 50-50 whether he should stick or switch. 101 00:05:03,610 --> 00:05:07,520 The probability that the prize is at door 1 that he picked 102 00:05:07,520 --> 00:05:10,740 is a 1/2, so it really doesn't matter if he stays there 103 00:05:10,740 --> 00:05:15,240 or if he decides to switch to the unopened door. 104 00:05:15,240 --> 00:05:18,020 But wait a minute, that's not right 105 00:05:18,020 --> 00:05:21,980 because the contestant not only knows what door he picked, 106 00:05:21,980 --> 00:05:27,010 not only knows that there's a goat behind a given door 107 00:05:27,010 --> 00:05:30,330 that Carol has opened, but he knows that Carol 108 00:05:30,330 --> 00:05:32,020 has opened that door. 109 00:05:32,020 --> 00:05:34,560 That's how he got to know that the goat was there. 110 00:05:34,560 --> 00:05:36,990 So let's go back and look at the tree. 111 00:05:36,990 --> 00:05:39,180 What basically the previous two arguments 112 00:05:39,180 --> 00:05:41,614 are conditioning on the wrong events. 113 00:05:41,614 --> 00:05:43,530 It's a typical mistake and one that you really 114 00:05:43,530 --> 00:05:45,360 have to watch out for. 115 00:05:45,360 --> 00:05:48,300 So if you use the correct event, what we're looking at 116 00:05:48,300 --> 00:05:51,910 is the contestant knows that they've picked door 1. 117 00:05:51,910 --> 00:05:56,410 That's the outcomes of picked door 1 are marked here. 118 00:05:56,410 --> 00:05:59,710 In addition, the contestant will get 119 00:05:59,710 --> 00:06:01,515 to know, for example, in a play of the game 120 00:06:01,515 --> 00:06:04,260 that Carol has opened door 2. 121 00:06:04,260 --> 00:06:07,390 Carol opening door 2 is quite a different event 122 00:06:07,390 --> 00:06:09,250 from the goat being at 2. 123 00:06:09,250 --> 00:06:12,540 This is a picture of the outcomes in Carol 124 00:06:12,540 --> 00:06:16,020 opening door 2, and we're interested in the intersection 125 00:06:16,020 --> 00:06:16,650 of them. 126 00:06:16,650 --> 00:06:18,740 That is, just this guy that's in both 127 00:06:18,740 --> 00:06:21,070 and this guy that's in both. 128 00:06:21,070 --> 00:06:22,640 There they are. 129 00:06:22,640 --> 00:06:25,255 And so what we can do is identify 130 00:06:25,255 --> 00:06:29,030 that the event that you picked 1 and that Carol opened 131 00:06:29,030 --> 00:06:34,250 door 2 consists simply of two outcomes-- one worth an 1/18 132 00:06:34,250 --> 00:06:36,540 and one worth a 1/9. 133 00:06:36,540 --> 00:06:40,150 Now, of these two outcomes, which one has the prize at 1? 134 00:06:40,150 --> 00:06:41,070 Well, only that one. 135 00:06:41,070 --> 00:06:43,750 Remember the first component here is where the prize is. 136 00:06:43,750 --> 00:06:49,920 And so the prize at 1 event among the given 137 00:06:49,920 --> 00:06:54,040 picked 1 and opened 2 is just this red outcome. 138 00:06:54,040 --> 00:06:57,730 Now, the red outcome has probability 139 00:06:57,730 --> 00:07:01,050 1/18 and the green outcome has probability 140 00:07:01,050 --> 00:07:03,400 that's twice as much. 141 00:07:03,400 --> 00:07:08,210 So that means that relative to this event, the probability 142 00:07:08,210 --> 00:07:12,680 that the prize is at 1 given that you picked 1 and opened 2 143 00:07:12,680 --> 00:07:19,200 is actually 1/18 over 1/18 plus 1/9, or 1/3. 144 00:07:19,200 --> 00:07:23,890 So given that you picked 1 and you get to see what Carrol did, 145 00:07:23,890 --> 00:07:26,780 the probability the prize is at the door you picked 146 00:07:26,780 --> 00:07:31,250 is only 1/3, which means that if you stick you only 147 00:07:31,250 --> 00:07:33,450 have a 1/3 chance of winning. 148 00:07:33,450 --> 00:07:36,200 You should switch. 149 00:07:36,200 --> 00:07:40,310 And if you do, you'll have a 2/3 probability of winning. 150 00:07:40,310 --> 00:07:43,740 So when we finally condition on everything that we know, 151 00:07:43,740 --> 00:07:46,490 which is the contestant knows what door he picked 152 00:07:46,490 --> 00:07:48,770 and what door Carol opened, then we 153 00:07:48,770 --> 00:07:52,580 discover that it correctly-- as we deduced previously-- 154 00:07:52,580 --> 00:07:55,530 that the probability of switching wins is 2/3. 155 00:07:55,530 --> 00:07:57,890 So we're not trying to rederive the fact 156 00:07:57,890 --> 00:08:00,370 that the probability of switching wins is 2/3. 157 00:08:00,370 --> 00:08:03,160 We're trying to illustrate a very basic blunder that you 158 00:08:03,160 --> 00:08:05,970 have to watch out for, which is when you're trying to reason 159 00:08:05,970 --> 00:08:09,150 about some situation and you condition on some event 160 00:08:09,150 --> 00:08:12,520 that you think summarizes what's going on, 161 00:08:12,520 --> 00:08:14,800 if you don't get the conditioning event right, 162 00:08:14,800 --> 00:08:17,540 you're going to get the wrong answer. 163 00:08:17,540 --> 00:08:20,680 So it's easy to see how many people got confused, 164 00:08:20,680 --> 00:08:23,550 and, in fact, finding the right event can be tricky. 165 00:08:23,550 --> 00:08:27,560 When in doubt, the 4 step method with constructing 166 00:08:27,560 --> 00:08:29,460 the tree where you're not even thinking 167 00:08:29,460 --> 00:08:33,130 about conditional probabilities but you're just examining 168 00:08:33,130 --> 00:08:36,330 the individual outcomes is a good fall back 169 00:08:36,330 --> 00:08:39,750 to avoid these kinds of confusing situations.