1 00:00:02,040 --> 00:00:05,440 We will now consider an example that illustrates the 2 00:00:05,440 --> 00:00:09,030 difference between the notion of independence of a 3 00:00:09,030 --> 00:00:12,990 collection of events and the notion of pairwise 4 00:00:12,990 --> 00:00:16,460 independence within that collection. 5 00:00:16,460 --> 00:00:18,290 The model is simple. 6 00:00:18,290 --> 00:00:22,700 We have a fair coin which we flip twice. 7 00:00:22,700 --> 00:00:25,460 So at each flip, there is probability 1/2 8 00:00:25,460 --> 00:00:28,950 of obtaining heads. 9 00:00:28,950 --> 00:00:32,340 Furthermore, we assume that the two flips are independent 10 00:00:32,340 --> 00:00:33,770 of each other. 11 00:00:33,770 --> 00:00:38,760 Let H1 be the event that the first coin toss resulted in 12 00:00:38,760 --> 00:00:43,655 heads, which corresponds to this event in this diagram. 13 00:00:46,370 --> 00:00:51,700 Let H2 be the event that the second toss resulted in heads, 14 00:00:51,700 --> 00:00:56,570 which is this event in the diagram-- 15 00:00:56,570 --> 00:01:02,245 the two ways that we can have the second toss being heads. 16 00:01:04,800 --> 00:01:08,590 Now, we're assuming that the tosses are independent. 17 00:01:08,590 --> 00:01:13,500 So the event heads-heads has a probability which is equal to 18 00:01:13,500 --> 00:01:16,539 the probability that the first toss resulted in heads-- 19 00:01:16,539 --> 00:01:17,390 that's 1/2-- 20 00:01:17,390 --> 00:01:20,200 times the probability that the second toss resulted in heads, 21 00:01:20,200 --> 00:01:21,460 which is 1/2. 22 00:01:21,460 --> 00:01:23,730 So the product is 1/4. 23 00:01:23,730 --> 00:01:26,880 We have probability 1/4 for this outcome. 24 00:01:26,880 --> 00:01:31,430 Now, the total probability of event H1 is 1/2, which means 25 00:01:31,430 --> 00:01:36,560 that the probability of what remains should be 1/4, so that 26 00:01:36,560 --> 00:01:39,320 the sum of these two numbers is 1/2. 27 00:01:39,320 --> 00:01:42,270 By the same argument, the probability of this outcome, 28 00:01:42,270 --> 00:01:45,880 tails-heads , should be 1/4. 29 00:01:45,880 --> 00:01:47,940 We have a total of 3/4. 30 00:01:47,940 --> 00:01:50,070 So what's left is 1/4. 31 00:01:50,070 --> 00:01:53,080 And that's going to be the probability of the outcome 32 00:01:53,080 --> 00:01:55,500 tails-tails . 33 00:01:55,500 --> 00:02:01,470 Let us now introduce a new event, namely the event that 34 00:02:01,470 --> 00:02:05,040 the two tosses had the same result. 35 00:02:05,040 --> 00:02:10,728 So this is the event that we obtain either heads heads or 36 00:02:10,728 --> 00:02:11,978 tails-tails. 37 00:02:14,210 --> 00:02:18,560 Schematically, event C corresponds to this blue 38 00:02:18,560 --> 00:02:21,290 region in the diagram. 39 00:02:21,290 --> 00:02:26,760 Is this event C independent from the events H1 and H2? 40 00:02:26,760 --> 00:02:30,520 Let us first look for pairwise independence. 41 00:02:30,520 --> 00:02:34,440 Let's look at the probability that H1 occurs 42 00:02:34,440 --> 00:02:37,150 and C occurs as well. 43 00:02:37,150 --> 00:02:39,820 So the first toss resulted in heads. 44 00:02:39,820 --> 00:02:43,060 And the two tosses had the same result. 45 00:02:43,060 --> 00:02:47,200 So this is the same as the probability of obtaining heads 46 00:02:47,200 --> 00:02:48,620 followed by heads. 47 00:02:51,340 --> 00:02:54,780 And this corresponds to just this outcome that has 48 00:02:54,780 --> 00:02:56,090 probability 1/4. 49 00:02:59,930 --> 00:03:07,250 How about the product of the probabilities of H1 and of C? 50 00:03:07,250 --> 00:03:09,242 Is it the same? 51 00:03:09,242 --> 00:03:12,290 Well, the probability of H1 is 1/2. 52 00:03:15,240 --> 00:03:16,900 And the probability of C-- 53 00:03:16,900 --> 00:03:17,850 what is it? 54 00:03:17,850 --> 00:03:21,450 Event C consists of two outcomes. 55 00:03:21,450 --> 00:03:24,030 Each one of these outcomes has probability 1/4. 56 00:03:24,030 --> 00:03:27,550 So the total is, again, 1/2. 57 00:03:27,550 --> 00:03:32,430 And therefore, the product of these probabilities is 1/4. 58 00:03:32,430 --> 00:03:34,829 So we notice that the probability of the two events 59 00:03:34,829 --> 00:03:38,020 happening is the same as the product of their individual 60 00:03:38,020 --> 00:03:42,370 probabilities, and therefore, H1 and C 61 00:03:42,370 --> 00:03:44,065 are independent events. 62 00:03:46,610 --> 00:03:52,500 By the same argument, H2 and C are going to be independent. 63 00:03:52,500 --> 00:03:55,690 It's a symmetrical situation. 64 00:03:55,690 --> 00:03:59,940 H1 and H2 are also independent from each other. 65 00:03:59,940 --> 00:04:02,740 So we have all of the conditions for pairwise 66 00:04:02,740 --> 00:04:04,470 independence. 67 00:04:04,470 --> 00:04:08,710 Let us now check whether we have independence. 68 00:04:08,710 --> 00:04:12,730 To check for independence, we need to also look into the 69 00:04:12,730 --> 00:04:16,329 probability of all three events happening and see 70 00:04:16,329 --> 00:04:19,070 whether it is equal to the product of the individual 71 00:04:19,070 --> 00:04:20,420 probabilities. 72 00:04:20,420 --> 00:04:23,770 So the probability of all three events happening-- 73 00:04:23,770 --> 00:04:27,450 this is the probability that H1 occurs and H2 74 00:04:27,450 --> 00:04:30,750 occurs and C occurs. 75 00:04:30,750 --> 00:04:34,260 What is this event? 76 00:04:34,260 --> 00:04:37,460 Heads in the first toss, heads in the second toss, and the 77 00:04:37,460 --> 00:04:39,770 two tosses are the same-- 78 00:04:39,770 --> 00:04:44,280 this happens if and only if the outcome is 79 00:04:44,280 --> 00:04:46,780 heads followed by heads. 80 00:04:46,780 --> 00:04:51,060 And this has probability 1/4. 81 00:04:51,060 --> 00:04:54,480 On the other hand, if we calculate the probability of 82 00:04:54,480 --> 00:05:01,850 H1 times the probability of H2 times the probability of C, we 83 00:05:01,850 --> 00:05:07,490 get 1/2 times 1/2 times 1/2, which is 1/8. 84 00:05:07,490 --> 00:05:10,900 These two numbers are different. 85 00:05:10,900 --> 00:05:15,020 And therefore, one of the conditions that we had for 86 00:05:15,020 --> 00:05:18,340 independence is violated. 87 00:05:18,340 --> 00:05:23,800 So in this example, H1, H2, and C are pairwise 88 00:05:23,800 --> 00:05:29,040 independent, but they're not independent in the sense of an 89 00:05:29,040 --> 00:05:32,330 independent collection of events. 90 00:05:32,330 --> 00:05:34,345 How are we to understand this intuitively? 91 00:05:37,800 --> 00:05:46,520 If I tell you that event H1 occurred and I ask you for the 92 00:05:46,520 --> 00:05:49,940 conditional probability of C given that H1 93 00:05:49,940 --> 00:05:53,050 [occurred], what is this? 94 00:05:53,050 --> 00:05:57,360 This is the probability that we obtain, given that the 95 00:05:57,360 --> 00:06:01,790 first event is heads, the first result is heads, the 96 00:06:01,790 --> 00:06:05,030 only way that you can have the two tosses having the same 97 00:06:05,030 --> 00:06:08,970 result is going to be in the second toss also 98 00:06:08,970 --> 00:06:12,020 resulting in heads. 99 00:06:12,020 --> 00:06:15,680 And since H2 and H1 are independent, this is just the 100 00:06:15,680 --> 00:06:19,340 probability that we have heads in the second toss. 101 00:06:19,340 --> 00:06:22,110 And this number is 1/2. 102 00:06:22,110 --> 00:06:26,720 And 1/2 is also the same as the probability of C. That's 103 00:06:26,720 --> 00:06:31,380 another way of understanding the independence of H1 and C. 104 00:06:31,380 --> 00:06:36,510 Given that the first toss resulted in heads, this does 105 00:06:36,510 --> 00:06:41,580 not help you in any way in guessing whether the two 106 00:06:41,580 --> 00:06:45,750 tosses will have the same result or not. 107 00:06:45,750 --> 00:06:49,290 The first one was heads, but the second one could be either 108 00:06:49,290 --> 00:06:52,870 heads or tails with equal probability. 109 00:06:52,870 --> 00:06:57,670 So event H1 does not carry any useful information about the 110 00:06:57,670 --> 00:07:04,210 occurrence or non-occurrence of event C. On the other hand, 111 00:07:04,210 --> 00:07:12,210 if I were to tell you that both events, H1 and H2, 112 00:07:12,210 --> 00:07:14,560 happened, what would the conditional 113 00:07:14,560 --> 00:07:17,560 probability of C be? 114 00:07:17,560 --> 00:07:21,250 If both H1 and H2 occurred, then the results of the two 115 00:07:21,250 --> 00:07:27,400 coin tosses were identical, so you know that C also occurred. 116 00:07:27,400 --> 00:07:31,540 So this probability is equal to 1. 117 00:07:31,540 --> 00:07:36,000 And this number, 1, is different from the 118 00:07:36,000 --> 00:07:41,040 unconditional probability of C, which is 1/2. 119 00:07:41,040 --> 00:07:46,190 So we have here a situation where knowledge of H1 having 120 00:07:46,190 --> 00:07:51,409 occurred does not help you in making a better guess on 121 00:07:51,409 --> 00:07:53,980 whether C is going to occur. 122 00:07:53,980 --> 00:07:57,490 H1 by itself does not carry any useful information. 123 00:07:57,490 --> 00:08:02,320 But the two events together, H1 and H2, do carry useful 124 00:08:02,320 --> 00:08:03,630 information about C. 125 00:08:03,630 --> 00:08:06,360 Once you know that H1 and H2 occurred, then C 126 00:08:06,360 --> 00:08:08,040 is certain to occur. 127 00:08:08,040 --> 00:08:12,430 So your original probability for C, which was 1/2, now gets 128 00:08:12,430 --> 00:08:14,610 revised to a value of 1. 129 00:08:14,610 --> 00:08:19,720 So H1 and H2 carry information relevant to C. And therefore, 130 00:08:19,720 --> 00:08:25,290 C is not independent from these two events collectively. 131 00:08:25,290 --> 00:08:28,810 And we say that events H1. 132 00:08:28,810 --> 00:08:31,420 H2, and C are not independent.