1 00:00:00,000 --> 00:00:05,000 So our next class of biomolecule that we're going to talk about are 2 00:00:05,000 --> 00:00:10,000 nucleic acids. And we can, for the most part, 3 00:00:10,000 --> 00:00:15,000 describe their properties by considering just covalent bonds and 4 00:00:15,000 --> 00:00:20,000 hydrogen bonds. Although, that's a bit of an 5 00:00:20,000 --> 00:00:25,000 oversimplification. But, anyway, these are, 6 00:00:25,000 --> 00:00:32,000 again, polymers. So this is DNA and RNA, 7 00:00:32,000 --> 00:00:40,000 terms you've undoubtedly heard. And these are made by splitting out 8 00:00:40,000 --> 00:00:48,000 water. And, in this case, the monomeric units are given the 9 00:00:48,000 --> 00:00:56,000 special term nucleotide. And a nucleotide consists of a 10 00:00:56,000 --> 00:01:05,000 sugar with something called a base on it. 11 00:01:05,000 --> 00:01:12,000 It's got a phosphate group at one end and a hydroxyl group, 12 00:01:12,000 --> 00:01:19,000 one of the sugar hydroxyls that we saw the other day at the other end. 13 00:01:19,000 --> 00:01:30,000 The B stands for base. 14 00:01:30,000 --> 00:01:34,000 And the way the bond is formed, as I said, is by splitting out water 15 00:01:34,000 --> 00:01:39,000 like that to form what's known as a phosphodiester bond. 16 00:01:39,000 --> 00:01:43,000 And we'll be talking a lot about those when we talk about DNA and RNA 17 00:01:43,000 --> 00:01:48,000 in more detail later on in the course. The sugars are pentoses 18 00:01:48,000 --> 00:01:53,000 where N equals 5. We were talking about these the 19 00:01:53,000 --> 00:01:57,000 other day. The base goes in this position. That's the 1 position 20 00:01:57,000 --> 00:02:03,000 of the carbon. This is the 5 position of the carbon. 21 00:02:03,000 --> 00:02:10,000 And this is where the phosphate is located. This sugar is called 22 00:02:10,000 --> 00:02:17,000 ribose. And then RNA, which is the polymer of nucleotides 23 00:02:17,000 --> 00:02:24,000 that have ribose as the sugar is ribonucleic acid or RNA, 24 00:02:24,000 --> 00:02:31,000 as you've known it. If this hydroxyl here is replaced by a 25 00:02:31,000 --> 00:02:38,000 hydrogen and the rest of it's the same -- 26 00:02:38,000 --> 00:02:47,000 -- this is deoxyribonucleoside. 27 00:02:47,000 --> 00:02:55,000 And if you polymerize that together then you get DNA or deoxyribonucleic 28 00:02:55,000 --> 00:03:01,000 acid. The bases come in two flavors. 29 00:03:01,000 --> 00:03:05,000 And this will be on your handout. Ones that either have two rings, 30 00:03:05,000 --> 00:03:10,000 adenine or guanine. And the general term of those is 31 00:03:10,000 --> 00:03:14,000 purine or they have one ring of pyrimidine. And in DNA one finds 32 00:03:14,000 --> 00:03:19,000 cytidine and thiamine abbreviated as C and T. Or in RNA, 33 00:03:19,000 --> 00:03:24,000 instead of finding thiamine you find uracil, which is the same except 34 00:03:24,000 --> 00:03:28,000 that it doesn't have the methyl group that's present at this 35 00:03:28,000 --> 00:03:33,000 position on thiamine. And the important thing about these 36 00:03:33,000 --> 00:03:38,000 particular nucleotide bases is that they can form hydrogen bonds in a 37 00:03:38,000 --> 00:03:43,000 very special way. It's diagramed on here that this is 38 00:03:43,000 --> 00:03:48,000 a guanine pairing with the cytidine, so a G pairing with a C. And you 39 00:03:48,000 --> 00:03:53,000 can form three hydrogen bonds. Or between an A, an adenine and a 40 00:03:53,000 --> 00:03:59,000 thiamine you can form two hydrogen bonds. 41 00:03:59,000 --> 00:04:03,000 Those are just the things we were diagramming on the board the other 42 00:04:03,000 --> 00:04:08,000 day. And those are the forces that hold the strands of DNA together so 43 00:04:08,000 --> 00:04:12,000 that DNA is the double helix, as you know. It's basically a 44 00:04:12,000 --> 00:04:17,000 backbone with sugars and phosphates. And then there'll be some sequence 45 00:04:17,000 --> 00:04:22,000 of bases down this. And then on the other strand you'll 46 00:04:22,000 --> 00:04:27,000 have the base that can form hydrogen bonds with this. 47 00:04:27,000 --> 00:04:30,000 So C, there would be three hydrogen bonds here. This would be a G on 48 00:04:30,000 --> 00:04:34,000 this side, again, three hydrogen bonds. 49 00:04:34,000 --> 00:04:38,000 If there's an A here that will be a T there, two hydrogen bonds. 50 00:04:38,000 --> 00:04:42,000 And so on down. And we'll talk about the implications of this later 51 00:04:42,000 --> 00:04:46,000 in the course when we talk about DNA replication, but for the moment I 52 00:04:46,000 --> 00:04:50,000 think your eye can see, you can probably see that the 53 00:04:50,000 --> 00:04:54,000 geometric arrangement of these is just exactly the same, 54 00:04:54,000 --> 00:04:58,000 whether it's a G-C or it's an A-T base pair. You can superimpose them, 55 00:04:58,000 --> 00:05:02,000 and they have just exactly the same molecular structure. 56 00:05:02,000 --> 00:05:05,000 And that's really crucial for a lot of things having to do with DNA. 57 00:05:05,000 --> 00:05:09,000 So, as you know, it's not just sort of a ladder with hydrogen bonds. 58 00:05:09,000 --> 00:05:13,000 It's twisted in 3-dimensional space. That's the double helix. 59 00:05:13,000 --> 00:05:16,000 And in that little movie I showed you the nitrogen atoms and the bases 60 00:05:16,000 --> 00:05:20,000 are blue so you can pretty much pick it out that there's a series of 61 00:05:20,000 --> 00:05:24,000 hydrogen bonds going right down the middle of a DNA molecule with the 62 00:05:24,000 --> 00:05:28,000 phosphoribose backbone on the outside. 63 00:05:28,000 --> 00:05:32,000 So every one of your cells, since you have about 3 billion base 64 00:05:32,000 --> 00:05:36,000 pairs you have two point something times that many hydrogen bonds 65 00:05:36,000 --> 00:05:40,000 holding your DNA together. The thing to remember about the 66 00:05:40,000 --> 00:05:45,000 strength of the hydrogen bond, it's about a twentieth a covalent 67 00:05:45,000 --> 00:05:49,000 bond, and so you're able to pull those things apart and then put them 68 00:05:49,000 --> 00:05:53,000 back together at physiological temperature while leaving all the 69 00:05:53,000 --> 00:05:57,000 covalent bonds that make up each strand of the DNA leaving 70 00:05:57,000 --> 00:06:02,000 those intact. It's also possible, 71 00:06:02,000 --> 00:06:06,000 since RNAs are usually single-stranded, 72 00:06:06,000 --> 00:06:10,000 that if you have a little sequence here that has the complimentary 73 00:06:10,000 --> 00:06:14,000 sequence over there then these can pair like this forming a hairpin or 74 00:06:14,000 --> 00:06:18,000 some little structure like that. And, again, we'll talk about 75 00:06:18,000 --> 00:06:22,000 transfer RNAs which play a really key role in protein synthesis. 76 00:06:22,000 --> 00:06:26,000 They're the little translators that go back and forth between the 77 00:06:26,000 --> 00:06:30,000 nucleic acid code, the genetic code and the protein 78 00:06:30,000 --> 00:06:35,000 code, which is written in amino acids. 79 00:06:35,000 --> 00:06:39,000 And this just shows making an RNA copy for a tRNA gene from the DNA, 80 00:06:39,000 --> 00:06:43,000 but then these are the relationships between the complimentary sequences 81 00:06:43,000 --> 00:06:47,000 right in that strand. So this thing is able to fold up 82 00:06:47,000 --> 00:06:51,000 into a sort of cloverleaf structure that some of you have certainly 83 00:06:51,000 --> 00:06:55,000 probably seen at some point. It's a little bit twisted here 84 00:06:55,000 --> 00:06:59,000 because you can see how the complimentary sequences 85 00:06:59,000 --> 00:07:02,000 have found each other. And even though this is just a 86 00:07:02,000 --> 00:07:06,000 single strand of RNA, by forming hydrogen bonds to 87 00:07:06,000 --> 00:07:09,000 complimentary sequences within itself it can take up a structure. 88 00:07:09,000 --> 00:07:12,000 And I'll show you. It actually goes on, there are some other forces 89 00:07:12,000 --> 00:07:16,000 that come in. And this will fold up into a 3-dimensional structure that 90 00:07:16,000 --> 00:07:19,000 goes even beyond what I've shown you, but we won't need to talk about that 91 00:07:19,000 --> 00:07:23,000 for just a little bit. OK. So then the next -- 92 00:07:23,000 --> 00:07:32,000 -- class of molecules that we're 93 00:07:32,000 --> 00:07:36,000 going to spend a lot of this course on are proteins. And these 94 00:07:36,000 --> 00:07:46,000 are polymers again. 95 00:07:46,000 --> 00:07:52,000 -- made by splitting out water. So that's been true of 96 00:07:52,000 --> 00:07:58,000 polysaccharides. It's true of nucleic acids. 97 00:07:58,000 --> 00:08:04,000 It's true of proteins. In this case the monomers are structures 98 00:08:04,000 --> 00:08:10,000 known as amino acids. And they have an amino group. 99 00:08:10,000 --> 00:08:17,000 And then it's joined to a carbon known as the alpha carbon. 100 00:08:17,000 --> 00:08:24,000 And then there's a carboxyl group. So this is why they're called amino 101 00:08:24,000 --> 00:08:32,000 acids, because their carboxyl group is an acid. 102 00:08:32,000 --> 00:08:39,000 And the way they form -- 103 00:08:39,000 --> 00:08:45,000 We'll give these different side chains here. I'll tell you about 104 00:08:45,000 --> 00:08:52,000 these side chains in just a minute. The way these form a bond is by 105 00:08:52,000 --> 00:08:59,000 splitting out water here. And then this will give this very 106 00:08:59,000 --> 00:09:12,000 important bond in nature -- 107 00:09:12,000 --> 00:09:16,000 -- which is known as the peptide bond. And there's a chemical 108 00:09:16,000 --> 00:09:21,000 property of this that's important. Someone was bemoaning the fact that 109 00:09:21,000 --> 00:09:25,000 I had to go over a bunch of chemistry and they hadn't liked 5. 110 00:09:25,000 --> 00:09:29,000 11. My apologies. But we won't be spending all course 111 00:09:29,000 --> 00:09:32,000 doing chemistry. But if you want to understand how 112 00:09:32,000 --> 00:09:36,000 these things work you do need to understand some of the chemical 113 00:09:36,000 --> 00:09:39,000 principles to understand them. And this is a case where it's 114 00:09:39,000 --> 00:09:42,000 really important because, although it's written this way with 115 00:09:42,000 --> 00:09:45,000 the double bond here and a single bond there, this double bond 116 00:09:45,000 --> 00:09:48,000 actually sort of spends part of its time over here. 117 00:09:48,000 --> 00:09:52,000 So this is actually sort of a partial double bond. 118 00:09:52,000 --> 00:09:55,000 And that has an important consequence because if you're a 119 00:09:55,000 --> 00:09:58,000 single bond, if you remember a single bond can bend and stretch but 120 00:09:58,000 --> 00:10:02,000 it can also rotate. But if you're a double bond you 121 00:10:02,000 --> 00:10:06,000 cannot rotate. So the peptide bond, 122 00:10:06,000 --> 00:10:10,000 and you make a lot of these when you're polymerizing amino acids 123 00:10:10,000 --> 00:10:14,000 together to make proteins, those bonds have a very special 124 00:10:14,000 --> 00:10:18,000 character that they cannot rotate. Now, let me say, I'll come back and 125 00:10:18,000 --> 00:10:22,000 show you why that's important in just a moment. 126 00:10:22,000 --> 00:10:26,000 But let me just say a word about the side chains. 127 00:10:26,000 --> 00:10:29,000 There are 20 different amino acids. And they have side chains that have 128 00:10:29,000 --> 00:10:33,000 very different chemical properties. And when we start thinking about how 129 00:10:33,000 --> 00:10:37,000 a chain of amino acids take up the properties that make it into an 130 00:10:37,000 --> 00:10:41,000 enzyme or part of a motor or a structural protein or into your 131 00:10:41,000 --> 00:10:44,000 finger nails or your hair or skin, they have to have very special 132 00:10:44,000 --> 00:10:48,000 properties. And it's the sequence of these different amino acids with 133 00:10:48,000 --> 00:10:52,000 their different chemical properties that are eventually going to let 134 00:10:52,000 --> 00:10:56,000 each protein form up to one particular 3-dimensional structure 135 00:10:56,000 --> 00:11:00,000 that will give it its characteristics. 136 00:11:00,000 --> 00:11:05,000 So the different types of amino acids, and again you won't have to 137 00:11:05,000 --> 00:11:10,000 memorize these, but here they are up here. 138 00:11:10,000 --> 00:11:15,000 But let me just point out the important classes, 139 00:11:15,000 --> 00:11:20,000 because the thing you really want to do with this one is to remember the 140 00:11:20,000 --> 00:11:25,000 types of amino acids we find. There are negatively charged amino 141 00:11:25,000 --> 00:11:28,000 acids. An example of this would be 142 00:11:28,000 --> 00:11:28,000 aspartate. Under physiological conditions, although this is an acid, it will dissociate so it will have a negative charge. And that's abbreviated as A-S-P. Glutamine is another one that has a negative charge. There are also amino acids that have positive charges on the side chain. 143 00:11:29,000 --> 00:11:39,000 An example of this would be lysine which has four methylene groups. 144 00:11:39,000 --> 00:11:50,000 And then it has an amino group. But, again, under physiological 145 00:11:50,000 --> 00:12:00,000 conditions, around pH 7, that will be protonated so it will 146 00:12:00,000 --> 00:12:11,000 have a plus charge. And that is lysine or L-Y-S. 147 00:12:11,000 --> 00:12:22,000 And arginine and histidine are two other amino acids that have a 148 00:12:22,000 --> 00:12:33,000 positive charge, or can have a positive charge. 149 00:12:33,000 --> 00:12:38,000 Then there's a set of amino acids that have a polar character. 150 00:12:38,000 --> 00:12:43,000 They don't have a full charge. And, as you might guess, they have 151 00:12:43,000 --> 00:12:48,000 one of the bonds that we've talked about that are polar. 152 00:12:48,000 --> 00:12:56,000 This is serine. 153 00:12:56,000 --> 00:12:59,000 Serine. There's another one that has a hydroxyl that's 154 00:12:59,000 --> 00:13:02,000 known as threonine. And then there is a glutamine and 155 00:13:02,000 --> 00:13:06,000 asparagine, both of which have an N-H bond. So just through what I've 156 00:13:06,000 --> 00:13:09,000 told you here, we haven't even been through the set, 157 00:13:09,000 --> 00:13:13,000 you can see how you can begin to decorate an amino acid chain. 158 00:13:13,000 --> 00:13:16,000 So there's a plus charge and a minus charge, a polar charge. 159 00:13:16,000 --> 00:13:20,000 There's a tremendous amount of diversity because at every single 160 00:13:20,000 --> 00:13:23,000 thing you have a choice of 20 things you can put in. 161 00:13:23,000 --> 00:13:27,000 So they not only have size and shape characteristics, 162 00:13:27,000 --> 00:13:31,000 but they have particular charges and other properties. 163 00:13:31,000 --> 00:13:39,000 Then, as always, there are a bunch of special, 164 00:13:39,000 --> 00:13:47,000 oh, excuse me. Actually, before we do that, we have hydrophobic. 165 00:13:47,000 --> 00:13:52,000 Or you could think of these as 166 00:13:52,000 --> 00:13:56,000 greasy or water-hating. These are the ones that are sort of 167 00:13:56,000 --> 00:14:00,000 when I was talking about trying to dissolve butter into water. 168 00:14:00,000 --> 00:14:05,000 These are things that don't like to interact with water or cannot 169 00:14:05,000 --> 00:14:10,000 interact with water, and so they cannot form hydrogen 170 00:14:10,000 --> 00:14:15,000 bonds so you cannot get them to go under water easily. 171 00:14:15,000 --> 00:14:20,000 And they come from very simple ones that have just the methyl group 172 00:14:20,000 --> 00:14:25,000 which is alanine or A-L-A or one like this which would be CH2-CH with 173 00:14:25,000 --> 00:14:30,000 a couple of methyl groups. This is even more water-hating, 174 00:14:30,000 --> 00:14:34,000 that would be lucien, L-E-U. Or here's one that you probably 175 00:14:34,000 --> 00:14:36,000 could guess that really doesn't interact with water. 176 00:14:36,000 --> 00:14:39,000 This is phenylalanine or P-H-E. And you can see what this side 177 00:14:39,000 --> 00:14:42,000 chain is. It's a methylene group. And what's dangling off it but a 178 00:14:42,000 --> 00:14:44,000 benzene ring. And I think most of you remember from probably beginning 179 00:14:44,000 --> 00:14:47,000 chemistry that benzene is something that you cannot dissolve 180 00:14:47,000 --> 00:14:54,000 sugar in or something. It's an organic solvent. 181 00:14:54,000 --> 00:15:04,000 It will only dissolve things that have a very hydrophobic character. 182 00:15:04,000 --> 00:15:17,000 Then there are some special cases. 183 00:15:17,000 --> 00:15:21,000 Glycine is one, because in this case the side chain 184 00:15:21,000 --> 00:15:25,000 is simply a hydrogen atom. And, as a consequence to that, 185 00:15:25,000 --> 00:15:29,000 this is a very flexible amino acid. So if you want to build -- 186 00:15:29,000 --> 00:15:35,000 If nature wants to build a loop into a protein, it's going to undergo a 187 00:15:35,000 --> 00:15:41,000 tight turn. You often find glycines there because there's not a big side 188 00:15:41,000 --> 00:15:47,000 chain to get in the way if you're going to be bending the chain in 189 00:15:47,000 --> 00:15:53,000 3-dimensional space. Another one is cysteine, 190 00:15:53,000 --> 00:15:59,000 which looks like serine over there, but it has a thiol group instead of 191 00:15:59,000 --> 00:16:04,000 a hydroxyl group. And that's important because that 192 00:16:04,000 --> 00:16:09,000 allows for the formation of another special type of bond that if you 193 00:16:09,000 --> 00:16:15,000 have one chain of protein that has a cysteine on it and another 194 00:16:15,000 --> 00:16:20,000 polypeptide chain that has a cysteine on it and they're close 195 00:16:20,000 --> 00:16:25,000 together in space, what can happen is you can form a 196 00:16:25,000 --> 00:16:31,000 covalent bond between these under oxidative conditions. 197 00:16:31,000 --> 00:16:36,000 This is known as a disulfide bond. That's covalent. 198 00:16:36,000 --> 00:16:40,000 So those two chains, if that bond occurs, are now sort of 199 00:16:40,000 --> 00:16:44,000 semi-permanently locked together. They're locked together in a very, 200 00:16:44,000 --> 00:16:48,000 very strong way. So this is a feature of, this is the only 201 00:16:48,000 --> 00:16:53,000 intrastrand covalent bond that you'd characteristically find in proteins. 202 00:16:53,000 --> 00:16:57,000 All the rest of them we're going to show you, when they fold up in 203 00:16:57,000 --> 00:17:01,000 3-dimensional space, depend on other kinds of 204 00:17:01,000 --> 00:17:06,000 interactions. And finally there's one last case 205 00:17:06,000 --> 00:17:12,000 which is proline. And this one is a little different 206 00:17:12,000 --> 00:17:17,000 because in the amino acid the side chain bends around like this and 207 00:17:17,000 --> 00:17:23,000 joins here. So it's actually forming a little circle here between 208 00:17:23,000 --> 00:17:29,000 the nitrogen, the amino group and the carboxyl group. 209 00:17:29,000 --> 00:17:33,000 And the consequence of this is this bond cannot rotate. 210 00:17:33,000 --> 00:17:37,000 The bond that would normally be able to rotate is not able to do 211 00:17:37,000 --> 00:17:41,000 that. And so this is sort of a protein you find that when there are 212 00:17:41,000 --> 00:17:45,000 some of these regular structures, I'm going to show you in a minute, 213 00:17:45,000 --> 00:17:49,000 like helices and things, this protein won't, 214 00:17:49,000 --> 00:17:53,000 this amino acid particularly won't fit into those structures. 215 00:17:53,000 --> 00:17:57,000 So you tend to, if nature wants to interrupt a particular regular 216 00:17:57,000 --> 00:18:01,000 structure that's coming, it will often find a proline right 217 00:18:01,000 --> 00:18:07,000 at that particular point. OK. So what we've talked about up 218 00:18:07,000 --> 00:18:14,000 until now is sort of just the very, very basic piece of protein 219 00:18:14,000 --> 00:18:27,000 structure. 220 00:18:27,000 --> 00:18:34,000 It's what called the primary structure which is nothing more than 221 00:18:34,000 --> 00:18:41,000 the sequence of amino acids. However, here's a little piece of 222 00:18:41,000 --> 00:18:48,000 protein. This is polyalanine. And one thing you can sort of see 223 00:18:48,000 --> 00:18:56,000 is if I was trying to figure out how to fold this up into a 3-dimensional 224 00:18:56,000 --> 00:19:02,000 confirmation. And let's say this had 300 amino 225 00:19:02,000 --> 00:19:06,000 acids or something, there are essentially an infinite 226 00:19:06,000 --> 00:19:10,000 number of confirmations. And so one of the real holy grails 227 00:19:10,000 --> 00:19:14,000 still in biology is trying to understand if you see the linear 228 00:19:14,000 --> 00:19:18,000 sequence of an amino acid, which we can now deduce, excuse me, 229 00:19:18,000 --> 00:19:22,000 of a protein, of an amino. If you see the linear sequence of amino 230 00:19:22,000 --> 00:19:26,000 acids in a protein, and we can deduce those from 231 00:19:26,000 --> 00:19:30,000 analyzing genomes and so on, how do we go from a thing that says 232 00:19:30,000 --> 00:19:34,000 a tryptophan, a cysteine, a serine, a serine, a threonine, 233 00:19:34,000 --> 00:19:38,000 whatever down the chain to finding its 3-dimensional structure and 234 00:19:38,000 --> 00:19:43,000 ultimately its role? And you can sort of hopefully get a 235 00:19:43,000 --> 00:19:48,000 sense from this of why it's important. So there are levels this 236 00:19:48,000 --> 00:19:52,000 goes. The next level is what's known as secondary structure. 237 00:19:52,000 --> 00:19:57,000 These are regions of local secondary structure and they're 238 00:19:57,000 --> 00:20:01,000 determined by hydrogen bonds. And I'll show you how these go in 239 00:20:01,000 --> 00:20:05,000 just a second. And then you can think about 240 00:20:05,000 --> 00:20:09,000 proteins in the tertiary structure. So what we've done and sort of 241 00:20:09,000 --> 00:20:13,000 taken a chain and then found out how a little region might take up a 242 00:20:13,000 --> 00:20:17,000 particular, for example, here's a portion that's in a helix. 243 00:20:17,000 --> 00:20:21,000 This is fairly rigid right now because of the way it's held 244 00:20:21,000 --> 00:20:25,000 together. And we'll then find maybe another region like a beta sheet I'm 245 00:20:25,000 --> 00:20:29,000 going to show you in a minute. Ultimately we have to figure out how 246 00:20:29,000 --> 00:20:33,000 all these units fold up into a 3-dimensional structure. 247 00:20:33,000 --> 00:20:38,000 And what we get there is called the tertiary structure. 248 00:20:38,000 --> 00:20:42,000 And this has some other forces we're going to talk about besides 249 00:20:42,000 --> 00:20:47,000 covalent bonds and hydrogen bonds that determine that. 250 00:20:47,000 --> 00:20:51,000 And then, as I've tried to tell you, you can see that proteins play a lot 251 00:20:51,000 --> 00:20:56,000 of roles in nature and they're not all single proteins running around 252 00:20:56,000 --> 00:21:01,000 being an enzyme or something like that. 253 00:21:01,000 --> 00:21:05,000 Many of them are parts of machines so they're made to fit together in 254 00:21:05,000 --> 00:21:10,000 absolutely beautiful ways. Some of them have, at this point, 255 00:21:10,000 --> 00:21:14,000 fifty-hundred parts that all go together fitting shapes and 256 00:21:14,000 --> 00:21:19,000 interacting with these shapes on the principles that we'll be talking 257 00:21:19,000 --> 00:21:24,000 about here, the different forces that make things happen in nature. 258 00:21:24,000 --> 00:21:28,000 And so quaternary means the structure when there's more than one 259 00:21:28,000 --> 00:21:36,000 polypeptide chain. 260 00:21:36,000 --> 00:21:41,000 So getting a handle on protein structure was kind of a very 261 00:21:41,000 --> 00:21:46,000 important intractable problem for a long time because it was just too 262 00:21:46,000 --> 00:21:51,000 hard a nut to crack, but in the 1930s and 1940s x-ray 263 00:21:51,000 --> 00:21:57,000 crystallography started to come into usage where basically you'd bounce 264 00:21:57,000 --> 00:22:01,000 x-rays off of a crystal. And then they would refract and 265 00:22:01,000 --> 00:22:05,000 you'd see characteristic reflections. And you could work backwards to 266 00:22:05,000 --> 00:22:09,000 figure out what the structure of the crystal was. This had been applied 267 00:22:09,000 --> 00:22:13,000 to minerals and a lot of structure, but it hadn't been applied to 268 00:22:13,000 --> 00:22:17,000 proteins. When people started to look they found there were certain 269 00:22:17,000 --> 00:22:21,000 proteins that gave characteristic reflections. Carotin, 270 00:22:21,000 --> 00:22:25,000 for example. Your hair gives a characteristic reflection around 5. 271 00:22:25,000 --> 00:22:29,000 angstroms. So that suggested that there was a repeating unit somewhere 272 00:22:29,000 --> 00:22:33,000 in carotin that had this. And, again, with artificial peptides 273 00:22:33,000 --> 00:22:37,000 sometimes they were able to see these reflections. 274 00:22:37,000 --> 00:22:42,000 And so that was where things stood for a while. And then one of these 275 00:22:42,000 --> 00:22:46,000 secondary structures, a very, very important one known as 276 00:22:46,000 --> 00:22:51,000 the alpha helix was deduced by Linus Pauling. Some of you have heard of 277 00:22:51,000 --> 00:22:55,000 him. He was a famous chemistry at Caltech. He got the Nobel prize. 278 00:22:55,000 --> 00:23:00,000 He also got famous later in his career because he championed the use 279 00:23:00,000 --> 00:23:04,000 of vitamin C to cure every ill known to mankind, including 280 00:23:04,000 --> 00:23:09,000 the common cold. Although there's some merit to what 281 00:23:09,000 --> 00:23:14,000 Linus stated, he probably overstated some of those later findings, 282 00:23:14,000 --> 00:23:19,000 but his contributions to the underlying chemistry and 283 00:23:19,000 --> 00:23:24,000 biochemistry of proteins was amazing. And he was the one that figured out 284 00:23:24,000 --> 00:23:29,000 the structure that explained the 5. angstrom repeat. 285 00:23:29,000 --> 00:23:33,000 And it was kind of an interesting story. He was in Oxford, 286 00:23:33,000 --> 00:23:38,000 England. And he got sick. I think it was some time in the 287 00:23:38,000 --> 00:23:42,000 winter. And he got bored reading detective books after a while so he 288 00:23:42,000 --> 00:23:47,000 thought he'd try and figure out the structure of proteins that gave rise 289 00:23:47,000 --> 00:23:51,000 to this characteristic repeat. So he made a simplifying assumption. 290 00:23:51,000 --> 00:23:56,000 He decided he'd forget all the side chains and just focus on this 291 00:23:56,000 --> 00:24:01,000 peptide backbone just with the peptide bond. 292 00:24:01,000 --> 00:24:06,000 And he was a chemist. And he knew, what I just told you, 293 00:24:06,000 --> 00:24:11,000 that this had a partial double bond character so it couldn't rotate. 294 00:24:11,000 --> 00:24:16,000 And he reasoned that this was held together by, since these things 295 00:24:16,000 --> 00:24:21,000 could form hydrogen bonds that this was probably forming a hydrogen bond 296 00:24:21,000 --> 00:24:26,000 with a carboxyl group of some other amino acid and this was probably 297 00:24:26,000 --> 00:24:31,000 forming a hydrogen bond with an amino group of a different 298 00:24:31,000 --> 00:24:40,000 amino acid. And so what he did was he made a 299 00:24:40,000 --> 00:24:53,000 sort of chain like this and he started to pleat it at the alpha 300 00:24:53,000 --> 00:25:06,000 carbon, which is the one that has the side chain on it, 301 00:25:06,000 --> 00:25:19,000 and was finding to trying to structure that would let him do this. 302 00:25:19,000 --> 00:25:32,000 And basically what he found was that if he made a helix that looks 303 00:25:32,000 --> 00:25:45,000 something like this, right-handed helix, and he could get 304 00:25:45,000 --> 00:25:58,000 a repeat structure that allowed him to form a hydrogen bond. you can build a tremendous amount of diversity even within that helical 305 00:25:58,000 --> 00:25:53,000 And the repeating unit was 5. angstroms and 3.7, excuse me, 306 00:25:53,000 --> 00:25:49,000 amino acids per turn. And it's a right-handed helix. 307 00:25:49,000 --> 00:25:44,000 It's the same sort of thing if you're trying to turn in a screw. 308 00:25:44,000 --> 00:25:40,000 It's got that kind of structure. And this shows you a little movie of 309 00:25:40,000 --> 00:25:35,000 an alpha helix. You can see this is just showing is if you put on the side chains, and you can put them on in any order, 310 00:25:35,000 --> 00:25:31,000 the backbone. So this is the part you can look right down 311 00:25:31,000 --> 00:25:31,000 the end of it. See how you can look right though? And you can see how the hydrogen bonds are formed by turning this 312 00:25:36,000 --> 00:25:40,000 thing into this regular structure. And the neat thing about this then 313 00:25:50,000 --> 00:25:55,000 structure. I think I can stop this. I just want to show you one thing, 314 00:25:55,000 --> 00:26:00,000 if I can manage to this when it comes around again. Stop it there. 315 00:26:00,000 --> 00:26:03,000 One of the things you can see, now we're looking down the helix. 316 00:26:03,000 --> 00:26:06,000 And although you won't recognize the structures of all the amino 317 00:26:06,000 --> 00:26:10,000 acids right away, you may be able to see in this 318 00:26:10,000 --> 00:26:13,000 particular one. Here are a couple of aromatic rings 319 00:26:13,000 --> 00:26:17,000 off on this side. So this side of the helix wouldn't 320 00:26:17,000 --> 00:26:20,000 like to see water, and over here are a bunch of charged 321 00:26:20,000 --> 00:26:24,000 and polar amino acids. So you could see how you could 322 00:26:24,000 --> 00:26:27,000 build into a helix like that, a surface, one part that wouldn't 323 00:26:27,000 --> 00:26:31,000 like to interact with water and another part that would. 324 00:26:31,000 --> 00:26:35,000 So that was an extremely important contribution. And there are alpha 325 00:26:35,000 --> 00:26:40,000 helices in almost all proteins. They'll be in little chunks coming 326 00:26:40,000 --> 00:26:45,000 down an amino acids chain. But they'll take up that structure. 327 00:26:45,000 --> 00:26:50,000 And, as you can see, it's driven by these hydrogen bonds that we've been 328 00:26:50,000 --> 00:26:54,000 talking about. There turns out then to be a second 329 00:26:54,000 --> 00:26:59,000 type of secondary structure that's important. It's called 330 00:26:59,000 --> 00:27:05,000 a beta sheet. And in this case you can either line 331 00:27:05,000 --> 00:27:11,000 up two polypeptide chains running in the same orientation, 332 00:27:11,000 --> 00:27:17,000 amino to carboxyl, amino to carboxyl, or you can run them in opposite 333 00:27:17,000 --> 00:27:23,000 orientations, amino to carboxyl, amino to carboxyl in the other way. 334 00:27:23,000 --> 00:27:29,000 The latter one is called an anti-parallel beta sheet. 335 00:27:29,000 --> 00:27:34,000 And if you line things up this way you'll see you can find hydrogen 336 00:27:34,000 --> 00:27:39,000 bonds between the chains like that. So this allows two things to form 337 00:27:39,000 --> 00:27:44,000 in this way and gives a sort of sheet-like structure. 338 00:27:44,000 --> 00:27:49,000 Whereas, that alpha helix has this tight coil like this. 339 00:27:49,000 --> 00:27:54,000 So over here I think we have a movie of a beta sheet. 340 00:27:54,000 --> 00:27:59,000 And you'll see again you can build up more than one. 341 00:27:59,000 --> 00:28:03,000 Because if you look up here you can see how you are all set up to form 342 00:28:03,000 --> 00:28:08,000 more hydrogen bonds out in that kind of way. And, as I said, 343 00:28:08,000 --> 00:28:13,000 you can do this same trick putting the polypeptide chains so they have 344 00:28:13,000 --> 00:28:17,000 the same polarity. And so you can approximate, 345 00:28:17,000 --> 00:28:22,000 look at the structure of most proteins then by depicting them 346 00:28:22,000 --> 00:28:27,000 either as alpha helices, which you'll see in these diagrams. 347 00:28:27,000 --> 00:28:32,000 You've already seen a few in the examples given. 348 00:28:32,000 --> 00:28:36,000 They'll look like this. Or a beta sheet which are indicated 349 00:28:36,000 --> 00:28:40,000 as these flat arrows. So here's a little piece of a 350 00:28:40,000 --> 00:28:44,000 protein made up of alpha helix, these beta sheets. What this is, 351 00:28:44,000 --> 00:28:49,000 actually, is a piece of the BRCA-1 gene. That's the familial 352 00:28:49,000 --> 00:28:53,000 susceptibility to breast cancer. The gene that causes that is called 353 00:28:53,000 --> 00:28:57,000 BRCA-1. And it has a special interaction domain called 354 00:28:57,000 --> 00:29:01,000 the BRC T domain. This is the structure. 355 00:29:01,000 --> 00:29:05,000 And the only point I'm trying to make, it's of a protein that's 356 00:29:05,000 --> 00:29:08,000 involved in preventing you from getting breast cancer. 357 00:29:08,000 --> 00:29:12,000 If you get a mutation in it, or particularly in this region, 358 00:29:12,000 --> 00:29:15,000 for example, you can end up with an increased susceptibility to breast 359 00:29:15,000 --> 00:29:18,000 cancer. But what is it? It's an alpha helices beta sheet. 360 00:29:18,000 --> 00:29:22,000 There's green fluorescent protein. You've seen that a few times. 361 00:29:22,000 --> 00:29:25,000 Maybe now you'll recognize it's mostly made of beta sheets. 362 00:29:25,000 --> 00:29:29,000 There's a little bit of alpha helix down there, a little 363 00:29:29,000 --> 00:29:32,000 bit right there. And that has the property that we've 364 00:29:32,000 --> 00:29:36,000 talked about of fluorescing. This is a protein I'll tell you 365 00:29:36,000 --> 00:29:40,000 later on that recognizes mismatches in DNA, and you get a susceptibility 366 00:29:40,000 --> 00:29:44,000 to cancer if it breaks. The only thing you notice here are 367 00:29:44,000 --> 00:29:47,000 a lot of alpha helices in it. And hopefully already your eye can 368 00:29:47,000 --> 00:29:51,000 begin to pick these out. This is an enzyme. What it does is 369 00:29:51,000 --> 00:29:55,000 it's got a catalytic ability to cleave other polypeptide chains. 370 00:29:55,000 --> 00:29:59,000 The functions of these don't matter. But you can see once again alpha 371 00:29:59,000 --> 00:30:03,000 helices beta sheets. Here's another one. 372 00:30:03,000 --> 00:30:07,000 It looks just about the same, alpha helices, beta sheets, except 373 00:30:07,000 --> 00:30:12,000 in this case this is the human gene known as, the protein encoded by 374 00:30:12,000 --> 00:30:16,000 human genes called RAS. That's an oncogene. That's a gene 375 00:30:16,000 --> 00:30:21,000 that if it mutates in a particular way will cause the cell that has 376 00:30:21,000 --> 00:30:26,000 that to move a step down the pathway to cancer. So what I've done is put 377 00:30:26,000 --> 00:30:30,000 up a whole lot of structures that have some alpha helices, 378 00:30:30,000 --> 00:30:35,000 some beta sheets. But you can get the idea that you 379 00:30:35,000 --> 00:30:40,000 can get very, very different biological activities from just 380 00:30:40,000 --> 00:30:45,000 depending on how you arrange those. OK. So there are a couple of other 381 00:30:45,000 --> 00:30:50,000 then forces that I need to tell you about if we're going to go all the 382 00:30:50,000 --> 00:30:55,000 way to understanding the 3-dimensional structure of proteins. 383 00:30:55,000 --> 00:31:00,000 What we can get to from that is alpha helices beta sheets. 384 00:31:00,000 --> 00:31:04,000 But you saw there were loops, there were other interactions that I 385 00:31:04,000 --> 00:31:09,000 haven't accounted for in showing you those 3-dimensional structures. 386 00:31:09,000 --> 00:31:14,000 So one of them are ionic bonds. This is the third class of force. 387 00:31:14,000 --> 00:31:19,000 This is an extreme case of electron sharing where one atom gets all of 388 00:31:19,000 --> 00:31:24,000 the electrons. So aspartate, which I had up on the 389 00:31:24,000 --> 00:31:28,000 board, aspartic acid looks like that, but under physiological conditions 390 00:31:28,000 --> 00:31:33,000 the oxygen will get all the electrons and you'll have 391 00:31:33,000 --> 00:31:39,000 a hydrogen on it. And a consequence of that then is 392 00:31:39,000 --> 00:31:45,000 that if you have a polypeptide chain that over here has an aspartate and 393 00:31:45,000 --> 00:31:52,000 over here has a lysine, which is the four methylene groups, 394 00:31:52,000 --> 00:31:59,000 and the positively charged thing here, you can get an ionic bond 395 00:31:59,000 --> 00:32:05,000 between those two amino acids that can be very far apart on 396 00:32:05,000 --> 00:32:12,000 the polypeptide chain. There may be a lot of amino acids in 397 00:32:12,000 --> 00:32:18,000 between, but what they then do is bring these two points together and 398 00:32:18,000 --> 00:32:24,000 hold them like that. The next class of force is kind of 399 00:32:24,000 --> 00:32:30,000 tricky. You may have heard of it in chemistry. It's referred to as van 400 00:32:30,000 --> 00:32:34,000 der Waals interaction. And the basis of this, 401 00:32:34,000 --> 00:32:38,000 without going into it too much, is even a nonpolar bond -- 402 00:32:38,000 --> 00:32:48,000 -- can have a transient polarity. 403 00:32:48,000 --> 00:33:00,000 And this then induces -- 404 00:33:00,000 --> 00:33:05,000 -- a transient polarity in a nearby bond. And it has to be a really 405 00:33:05,000 --> 00:33:11,000 nearby bond. So about 0. to 0.4 nanometers. Remember, 406 00:33:11,000 --> 00:33:17,000 covalent bonds are roughly half that distance or something. 407 00:33:17,000 --> 00:33:23,000 So it's got to be a very, very close interaction. It's weak. 408 00:33:23,000 --> 00:33:29,000 It's only one-third to one-quarter of a hydrogen bond, 409 00:33:29,000 --> 00:33:35,000 which you may recall is about one-twentieth of a covalent bond. 410 00:33:35,000 --> 00:33:38,000 But there can be many, many of them if the surfaces fit 411 00:33:38,000 --> 00:33:42,000 together really, really tightly. So if you have a 412 00:33:42,000 --> 00:33:45,000 protein fold, so there's a surface here, and then it folds up in such a 413 00:33:45,000 --> 00:33:49,000 way that there's a surface here, then you can get a lot of van der 414 00:33:49,000 --> 00:33:52,000 Waals interactions down here. Now, I've never had a really good 415 00:33:52,000 --> 00:33:56,000 way of explaining this. But today, part of these activities 416 00:33:56,000 --> 00:34:00,000 of this Hughes Professorship, I've set up some seminars on 417 00:34:00,000 --> 00:34:04,000 teaching. And I've invited a guy from Berkley 418 00:34:04,000 --> 00:34:09,000 named Robert Full who is talking in 68.180 at 4:00 PM. 419 00:34:09,000 --> 00:34:14,000 And I borrowed some things from him this morning. And we're just going 420 00:34:14,000 --> 00:34:19,000 to take a quick tour because I want to show you this. 421 00:34:19,000 --> 00:34:24,000 He works on, well, he does a lot of things. 422 00:34:24,000 --> 00:34:30,000 He works on biomotion and how animals work. 423 00:34:30,000 --> 00:34:37,000 But one of the things he works on, let's see if we can get this guy to 424 00:34:37,000 --> 00:34:44,000 go here. Oops. How do I figure out how to get it 425 00:34:44,000 --> 00:34:51,000 to play here? Hang on a second. I just discovered that the 426 00:34:51,000 --> 00:34:59,000 PowerPoint is not really terribly effective. 427 00:34:59,000 --> 00:35:04,000 So this isn't working as nicely as I would like. OK. 428 00:35:04,000 --> 00:35:09,000 Let's try this. Just a minute. Where are we? Here we go. OK. 429 00:35:09,000 --> 00:35:15,000 Let's see if I can get this to go. So he studies a bunch of things, 430 00:35:15,000 --> 00:35:20,000 but he did an undergrad project library studying geckos. 431 00:35:20,000 --> 00:35:26,000 And here this is a transparent surface. And he's studying how the 432 00:35:26,000 --> 00:35:30,000 geckos climb up and down the thing. And they were making measurements. 433 00:35:30,000 --> 00:35:34,000 And they found they couldn't account for why this was such an 434 00:35:34,000 --> 00:35:37,000 efficient organism. It used much less energy than most 435 00:35:37,000 --> 00:35:41,000 things, so they started looking into how it adhered to the surface. 436 00:35:41,000 --> 00:35:44,000 It can go up a vertical wall, as you can see here. 437 00:35:44,000 --> 00:35:47,000 And so they were able to look underneath and they could see, 438 00:35:47,000 --> 00:35:51,000 see how it sort of peels off the surface? And this was a robot that 439 00:35:51,000 --> 00:35:54,000 they eventually built that's not using the same molecular bases but 440 00:35:54,000 --> 00:35:58,000 uses this peeling thing. And they can get a robot that 441 00:35:58,000 --> 00:36:02,000 climbs up a wall. But that's not what we're going to 442 00:36:02,000 --> 00:36:07,000 talk about here. We're going to instead, 443 00:36:07,000 --> 00:36:12,000 I hope, go back to here. And you can see that all of the geckos have 444 00:36:12,000 --> 00:36:17,000 these sort of bizarre toes, and so they started looking to see 445 00:36:17,000 --> 00:36:22,000 what the underlying principle of this was. And they saw it has these 446 00:36:22,000 --> 00:36:28,000 setae. And they got looking in greater detail and blew it up. 447 00:36:28,000 --> 00:36:33,000 And then they found that there were, as they started looking there were 448 00:36:33,000 --> 00:36:38,000 these little hairs. And that's a 900 fold magnification. 449 00:36:38,000 --> 00:36:43,000 And once they got looking in more detail they found the ends were 450 00:36:43,000 --> 00:36:48,000 split so that they were, the very ends are about 200 451 00:36:48,000 --> 00:36:53,000 nanometers roughly at the end of this. And so a gecko has about a 452 00:36:53,000 --> 00:36:59,000 billion of these on its feet. And what it turns out it does -- 453 00:36:59,000 --> 00:37:02,000 And just to see, here's a human hair. 454 00:37:02,000 --> 00:37:06,000 You see how it splits down? Now, this is made of keratin, 455 00:37:06,000 --> 00:37:10,000 the molecule I just mentioned that was used, alpha helices, 456 00:37:10,000 --> 00:37:13,000 but it's very, very fine. And what it can do, it can make van 457 00:37:13,000 --> 00:37:17,000 der Waals interactions. This is an animal that sticks to 458 00:37:17,000 --> 00:37:21,000 the wall by van der Waals interactions. And the peeling away 459 00:37:21,000 --> 00:37:25,000 allows it to break those bonds. But, as you can see, they're 460 00:37:25,000 --> 00:37:29,000 enormously important. He's got here a micrograph. 461 00:37:29,000 --> 00:37:33,000 They're measuring the force, and the force is just huge. This is 462 00:37:33,000 --> 00:37:37,000 the end of the thing, the frayed end sticking to a surface. 463 00:37:37,000 --> 00:37:42,000 And for those of you who didn't think biology any relevance to you, 464 00:37:42,000 --> 00:37:46,000 Bob was telling me about this. They followed up, he's an engineer as 465 00:37:46,000 --> 00:37:51,000 well and builds interdisciplinary teams, and they've measured this 466 00:37:51,000 --> 00:37:55,000 stuff. But this is turning into what appears to look like it's going 467 00:37:55,000 --> 00:38:00,000 to be a $30 to $50 billion industry as all sorts of things are -- 468 00:38:00,000 --> 00:38:04,000 They're beginning to realize it can hold car parts together, 469 00:38:04,000 --> 00:38:08,000 it can go in space shuttles, Post-it notes. And here's a little 470 00:38:08,000 --> 00:38:12,000 Band-Aid they made. They own the patent on this 471 00:38:12,000 --> 00:38:16,000 self-cleaning dry adhesive. It doesn't have to be made out of 472 00:38:16,000 --> 00:38:20,000 gecko stuff. It could be made out of all sorts of things. 473 00:38:20,000 --> 00:38:25,000 But, anyway, here's an example of where not only are van der Waals 474 00:38:25,000 --> 00:38:29,000 forces very important, but where somebody who started a 475 00:38:29,000 --> 00:38:33,000 very simply aspect of biology worrying about the efficiency of how 476 00:38:33,000 --> 00:38:37,000 geckos ran and pushed it all the way down to the molecular level 477 00:38:37,000 --> 00:38:41,000 understood a principal that's going to make somebody a very 478 00:38:41,000 --> 00:38:45,000 large amount of money. OK. The last, 479 00:38:45,000 --> 00:38:48,000 and if anybody wants to come, he's an amazing speaker. Perhaps 480 00:38:48,000 --> 00:38:52,000 one of the most exciting speakers I've ever heard. 481 00:38:52,000 --> 00:38:55,000 68-180, 4:00 PM if you want to go. He'll have more of that sort of 482 00:38:55,000 --> 00:39:00,000 stuff to show you then. OK. So the last force here, 483 00:39:00,000 --> 00:39:07,000 it's not really a force, but what we'll call hydrophobic 484 00:39:07,000 --> 00:39:14,000 effects. And what I mean by this is that the principle of this is that 485 00:39:14,000 --> 00:39:22,000 amino acids that don't like to interact with water, so -- 486 00:39:22,000 --> 00:39:33,000 So hydrophobic amino acids. 487 00:39:33,000 --> 00:39:37,000 These are ones like lucien and phenylalanine. 488 00:39:37,000 --> 00:39:41,000 Well, I showed you the water the other day and how it was forming 489 00:39:41,000 --> 00:39:45,000 hydrogen bonds between the molecules. So if you're going to stick another 490 00:39:45,000 --> 00:39:48,000 molecule in there, you're going to break a bunch of 491 00:39:48,000 --> 00:39:52,000 bonds. And if you're not charged or polar you cannot make new bonds with 492 00:39:52,000 --> 00:39:56,000 the water. And so what happens, if you put these together, just like 493 00:39:56,000 --> 00:40:00,000 if you put oil together it will all bundle up and it will minimize its 494 00:40:00,000 --> 00:40:04,000 interactions with water. And that's what proteins do. 495 00:40:04,000 --> 00:40:08,000 Here's the structure of a protein all folded up in 3-dimensional space. 496 00:40:08,000 --> 00:40:13,000 And you can see at the core of the protein how there are these many 497 00:40:13,000 --> 00:40:17,000 hydrophobic amino acids that are interacting. And let me just, 498 00:40:17,000 --> 00:40:22,000 I'm going to close by showing you one more little movie. 499 00:40:22,000 --> 00:40:26,000 And the new version of PowerPoint doesn't do this well so I'm just 500 00:40:26,000 --> 00:40:31,000 going to get out of this for a second here. 501 00:40:31,000 --> 00:40:39,000 This is a really cool movie I saw. 502 00:40:39,000 --> 00:40:47,000 I want to show you a DNA repair protein sticking to a piece of helix. 503 00:40:47,000 --> 00:40:55,000 Can you hit the lights somebody there? So this is a lesion on a 504 00:40:55,000 --> 00:41:03,000 piece of, see the double helix here? 505 00:41:03,000 --> 00:41:06,000 And what I especially liked about this is this is sort of a Star Wars 506 00:41:06,000 --> 00:41:10,000 movie. You're going to fly down the major groove of a double helix. 507 00:41:10,000 --> 00:41:14,000 And you can see where this particular protein folded up in 508 00:41:14,000 --> 00:41:18,000 3-dimensional space is reaching down into that helix. 509 00:41:18,000 --> 00:41:22,000 So this is sort of putting together the two things that I've been 510 00:41:22,000 --> 00:41:26,000 telling you about. This blue is a DNA repair protein. 511 00:41:26,000 --> 00:41:30,000 Oopsy daisy. A DNA repair protein that's able to 512 00:41:30,000 --> 00:41:34,000 find a lesion in the DNA. And here's the double helix that's 513 00:41:34,000 --> 00:41:39,000 the two chains held together by hydrogen bonds. 514 00:41:39,000 --> 00:41:43,000 And then, as you can see, there's a groove on each side. 515 00:41:43,000 --> 00:41:46,000 And the protein is searching down into that groove --