The Dynamic Genome and Transposable Elements
Transcript of Part 1: Introduction to Transposable Elements
00:00:00.06 My name is Sue Wessler. I'm a professor of Genetics at the University of California, Riverside, 00:00:04.14 and my lab studies what is the subject of the talks today: and those are transposable elements. 00:00:10.14 The title of the general lecture is The Dynamic Genome and the title 00:00:15.06 of this first presentation is Introduction to Transposable Elements. Here I will talk about the discovery 00:00:21.17 of transposable elements, and how this seemingly trivial discovery 00:00:25.23 led to what is now recognized as a revolution in biology. So I start showing pictures 00:00:33.11 that are integral to the talk. The first is a picture of Barbara McClintock who 00:00:40.12 is the discoverer of transposable elements, and I'll tell you more about her as the talk 00:00:44.04 goes on, and the other is the corn kernels which are also integral to the discovery. 00:00:51.07 So I've divided the talk into three parts. The first is a description of the discovery 00:00:58.29 of transposable elements by Barbara McClintock. The second is a little more detailed 00:01:04.12 on how transposable elements actually move and increase their copy number 00:01:09.17 in the genome. And the third is just how abundant transposable elements are in genomes, 00:01:15.01 and this is really the part that made transposable elements more than just a trivial 00:01:23.23 discovery and that has led to transposable elements being viewed as the major 00:01:29.24 component of the genomes of higher organisms. Here's a picture of Barbara McClintock 00:01:36.00 when she was a graduate student. It's a picture of the lab of R.A. Emerson 00:01:41.23 who is the father of maize genetics. This picture was taken at Cornell 00:01:47.23 University in 1929 and in the next talk actually we will come back and revisit 00:01:54.09 this particular area of Cornell. Here's a picture of McClintock and you can see 00:02:01.24 that she wasn't wearing what women normally wore back then - that's a skirt - she wore knickers. 00:02:07.14 I had the good fortune to know her later in her life and I will talk about that a little later. 00:02:13.06 The focus of their lab was corn and corn genetics, and one of the reasons 00:02:20.26 it was such a wonderful organism to study genetics is it had an abundance 00:02:24.24 of interesting visible characteristics - traits. As we found out years later, 00:02:31.29 one of the reasons for this diversity is that it has a remarkably diverse genome. So McClintock 00:02:40.28 asked a very simple question, and this question is really at the heart of this talk 00:02:46.20 and the next talk. And that is "why are these corn kernels spotted?" Now many of you may recognize 00:02:54.08 this corn although it doesn't look like the corn you eat as the corn 00:02:57.29 that you see during Thanksgiving. It's usually hanging up in supermarkets. 00:03:01.16 It's Indian corn. It's known as Indian corn and unlike the corn we eat which is yellow, this corn is highly pigmented. 00:03:08.22 But what's unusual about this particular corn cob here are the spotted 00:03:16.20 corn kernels. McClintock was interested in what is the genetic mechanism responsible 00:03:23.18 for this spotted kernel phenotype. Here's a picture. A lot of McClintock's 00:03:30.24 work because she went on to be world famous is available online and you can actually 00:03:36.20 access her notebooks. This is a picture of her notebook with her typing 00:03:41.09 at the bottom describing these kernels and the genetic basis for their unusual phenotypes. 00:03:46.11 The one thing you'll notice is that the kernels are incredibly detailed. 00:03:51.17 That is one of the reasons that she was able to figure out so much about the behavior 00:03:58.24 of the transposable elements that I'll talk to you about because of the wonderful resolution 00:04:03.24 of the phenotypes in the kernels. I'm going to cut to the chase 00:04:08.09 and tell you McClintock's solution to the spotted corn kernel question and that is that 00:04:15.26 she discovered that spotted corn kernels were caused by a new type of genetic 00:04:21.18 element - a new mutation - and I've diagrammed it here. If we start with the gene, 00:04:26.23 the gene is mutant but not due to a basepair change or deletion, it's due to actually an insertion 00:04:35.10 of a piece of DNA into the gene and this insertion inactivates 00:04:41.11 the gene. But unlike other mutations such as, as I said before, basepair changes or deletions this mutation 00:04:49.05 is reversible. The way it's reversible is that the piece of DNA - the TE or transposable 00:04:56.24 element and you'll see this abbreviation throughout "TE". The TE can insert into the gene 00:05:03.09 and it can then excise from the gene. When it inserts we have a mutant phenotype. 00:05:08.13 When it excises we revert to a wildtype phenotype or normal phenotype. 00:05:13.09 What I've drawn here is a spotted corn kernel. 00:05:17.00 And I've tried to explain how the different sectors in the kernel arise. 00:05:23.07 What you see at the top is a mutant 00:05:27.09 gene and so we can imagine that this gene is a gene responsible for pigment biosynthesis 00:05:33.10 so that if the gene is mutant there's no pigment - the kernel is yellow. 00:05:37.23 If the gene is wildtype, normal, the kernel is purple. To explain a spotted kernel 00:05:44.04 we have to have a reversible phenotype, a reversible mechanism. 00:05:48.16 So what you see is that the colorless areas are due to the gene at the top. 00:05:56.06 That is cells that have the gene at the top that is a gene that is disrupted by a transposable element 00:06:01.11 so it cannot produce the products necessary for pigment biosynthesis 00:06:06.21 and so we end up with the yellow unpigmented areas. However, during kernel development 00:06:12.05 the transposable element can excise from the gene and when it does that it restores expression of the gene 00:06:19.01 and we end up with the spotted kernels and so because kernel development is very regular 00:06:25.14 - that is that unlike in animal systems 00:06:28.03 plant cells when they divide they don't move around they divide and they're literally cemented in place 00:06:33.07 - we end up with these sectors. A large sector is due to a cell 00:06:40.18 in which the transposable element has excised from the gene and all of the mitotic progeny 00:06:46.22 then can express the gene so we end up with a sector much like a clone on a Petri dish of bacteria. 00:06:54.11 As I said before, the rest of the kernel are from cells 00:06:59.13 that still have the transposable element in the gene. 00:07:02.03 McClintock was able to use the behavior of transposable elements and the detailed resolution 00:07:15.01 that the corn kernels facilitated to understand something about the behavior of the element 00:07:23.11 and that's shown in this slide. 00:07:25.02 What we have here at the top is a gene that's pigmented 00:07:28.28 and it's pigmented because the color gene is wildtype. 00:07:32.15 That's one allele: a wildtype gene. We have another allele where we have what I call an 'NTE'. 00:07:40.04 It stands for non-autonomous TE and I'll explain that in a second. 00:07:44.07 What you're going to see here is that there's two different types of transposable elements. 00:07:47.26 One is autonomous, one is non-autonomous and these slides will hopefully clarify what I mean by that. 00:07:53.21 In this case we have a transposable element sitting in a gene, 00:07:57.28 but the transposable element, which is non-autonomous, is not able to move on its own. 00:08:02.01 However, that transposable element can move if there's a second transposable element in the genome 00:08:10.21 and that's shown here as the 'TE' and this is elsewhere in the genome. 00:08:14.02 It could be on a different chromosome. It is not near this color gene, 00:08:18.03 but that transposable element activates the non-autonomous element 00:08:23.07 and causes it to excise from the gene and we end up with a spotted kernel. 00:08:28.27 In the absence of that autonomous element as you see above it, 00:08:32.24 the colorless kernel, the transposable element cannot move. 00:08:35.19 That's why we call it non-autonomous. 00:08:37.29 And finally the last situation is we can have autonomous elements inserting into genes and that's shown here. 00:08:45.19 We have the autonomous element inserting into a gene and because the autonomous element 00:08:49.26 makes everything that is needed for transposition that kernel will be spotted. 00:08:55.11 To review: McClintock not only discovered transposable elements 00:09:00.23 but she also discovered that there are different types of transposable elements 00:09:06.02 and in this case we have the autonomous element and that's defined as an element 00:09:10.07 that provides everything needed for transposition 00:09:14.00 and non-autonomous elements which can only move or transpose in the presence 00:09:20.11 of the autonomous element that is if the autonomous element is in the genome simultaneously. 00:09:24.26 McClintock made her discoveries initially in the 1940s - a very long time ago 00:09:34.00 - but it took a long time for the scientific world to catch up with her. 00:09:38.27 That's why I say here that she was well ahead of her time. In the 40 or 50 years after their discovery, 00:09:45.08 transposable elements which were initially only recognized to be present in maize 00:09:51.00 were found in many organisms in fact in virtually all eukaryotes. 00:09:56.08 In the 1950s transposable elements were discovered in Drosophila fruit fly. 00:10:01.15 In the 1960s, they were discovered in bacteria, in E. coli. 00:10:05.11 And in the 1970s they were discovered in the human genome 00:10:10.01 as a cause of some mutations in the human genome, and we'll talk a lot about that, more about that later. 00:10:14.23 So here is one of my favorite pictures. 00:10:18.04 It is a picture of a transposable element in action in a rose in the Napa Valley. 00:10:25.04 And another picture from my colleague Tom Gerats is a picture of a petunia flower. 00:10:32.12 And the reason that I show you this is that if you look in gardens, rose gardens or other gardens, 00:10:38.11 you will notice these phenotypes. 00:10:40.06 They are not just patterns, which you see with a lot of flowers. 00:10:42.19 They are actually sectors and in case these sectors are completely analogous 00:10:47.21 to the spots on the corn kernels that I showed you before. 00:10:50.04 So McClintock as I said, it took 40 years really for the world, the scientific world, 00:10:58.12 to recognize that her discovery in corn, in maize, was true for most eukaryotes. 00:11:06.26 And in fact what McClintock discovered was that there was more in the genome than just genes. 00:11:16.21 She discovered a new component of the genome. 00:11:18.06 And I've drawn that here as a chromosome before McClintock. 00:11:24.00 showing these rectangular boxes, and those are representations of genes. 00:11:27.27 So before McClintock, people thought, when they thought about it at all, 00:11:32.03 that the genome, that the chromosomes, were essentially genes sort of lined up like beads on a string. 00:11:38.01 After McClintock, it was recognized that there was another component in the genome. 00:11:43.27 And that component as I have drawn as black ellipses here were the transposable elements. 00:11:48.14 So McClintock recognized that these transposable elements were moving around were not coming from 00:11:56.28 the environment. They are not viruses that are infecting the organism. 00:11:59.26 They actually are residents of the genome, and these we now know are residents of most eukaryotic genomes. 00:12:08.14 So for this discovery McClintock was awarded the Nobel Prize in Medicine or Physiology in 1983. 00:12:18.03 Now Nobel Prizes are frequently awarded many years after the discovery. 00:12:25.13 40 years is a very long time, even for Nobel Prizes. 00:12:31.01 In this really is for the reason I said before. She was really ahead of her time. 00:12:34.19 The other thing is that Nobel Prizes are frequently awarded for up to three people. 00:12:39.00 She was awarded the Nobel Prize by herself, 00:12:43.07 and this really recognizes that this was her discovery. 00:12:47.27 So here is a picture of her. She lived from 1902 to 1992. 00:12:53.19 I had the wonderful fortune of knowing her for the last 10 years of her life, 00:12:57.12 and continuing on with some of her discoveries, and I will be talking about that in the next talk. 00:13:04.07 She also commented on how her life really encompassed the entire history of genetics. 00:13:15.25 So Mendel's laws were rediscovered in 1902, the early part of the 1900s, which was the year of her birth. 00:13:23.05 And she lived long enough first of all to be recognized for her great discoveries, but second of all, 00:13:30.16 to actually see the world enter the genomics age. 00:13:33.13 So the age of DNA sequencing, and that is what we will talk about in a minute. 00:13:36.26 So what I am going to do now is tell you a little bit about how transposable elements move. 00:13:44.05 So we are going to move from the genetics to the molecular biology. 00:13:46.11 So here is a diagram of what a generic transposable element looks like. 00:13:57.17 And so they are very simple genetic systems. 00:14:00.23 This is a piece of DNA. It varies from a couple of hundred nucleotides to several thousand basepairs. 00:14:07.21 Transposable elements are.... this is an autonomous element, 00:14:10.27 so remember this is the element that can move on its own. 00:14:13.26 It encodes everything needed to move itself and to move a non-autonomous element. 00:14:18.19 So when it says everything needed, that is a single protein that it encodes, and that is called transposase. 00:14:24.24 And I will tell you in a minute a little bit more about what transposase does. 00:14:27.25 The element is flanked by special sequences which are called terminal inverted repeats. 00:14:35.08 These are sequences that are the same sequence forward and then flipped over backwards. 00:14:39.27 And I will show you in a minute why, what the functional significance of that is. 00:14:45.12 And the whole element is flanked by a target site duplication. A 'TSD'. 00:14:51.20 And I will show you in a minute how that is derived. 00:14:53.21 So I mentioned that there are autonomous elements and non-autonomous elements. 00:14:57.28 Non-autonomous elements which cannot move on its own. 00:15:01.13 They cannot move on their own. 00:15:03.09 And they require an autonomous element to provide the stuff needed to move them, 00:15:11.01 and you can see here that what we have is a non-autonomous element is sometimes, 00:15:15.15 but not always, a defective version of an autonomous element. 00:15:21.08 So what I have drawn here is a deletion that has been sustained which prevents the non-autonomous element 00:15:29.29 from making transposase. 00:15:33.05 The transposase that is made by the autonomous element, as we will see in a second, 00:15:36.29 can influence both the movement of the autonomous element and the non-autonomous element. 00:15:42.25 So that is shown in this slide. 00:15:45.11 So what I've shown here is the transposase, I am sorry, the autonomous element, which is at the top, 00:15:51.13 encodes a single protein, and that protein is a transposase. And what that transposase does initially 00:15:56.14 is it binds to the ends, to the terminal inverted repeats, 00:15:59.24 of both the autonomous element and the non-autonomous element. 00:16:04.23 So one of the functions that the transposase has is as a DNA binding protein. 00:16:09.29 Ok, so this cartoon sort of will take you through the steps in the transposition 00:16:17.18 of a transposable element. 00:16:20.05 So I've shown here that this is what we saw in the previous slide, but in a more abbreviated form. 00:16:25.01 We have the transposase proteins bound to the ends of a transposable element 00:16:32.16 Those transposase molecules come together and form a dimer. 00:16:38.05 They then cleave the transposable element out of what is called the donor DNA, 00:16:45.09 out of the rest of the DNA, and that entire complex as you see here then can insert somewhere else. 00:16:53.19 So what we end up with is a transposable element in a new location in the genome. 00:16:58.12 So I want to define another term and that is a transposable element family. 00:17:06.12 And this we will see a lot more in the next talk. 00:17:09.27 So a transposable element family, as you saw before, has autonomous elements and non-autonomous elements, 00:17:16.18 and there can be lots and lots of members of the family. 00:17:21.06 So we can have one ore more autonomous elements, 00:17:23.26 one or more... many, many non-autonomous elements in the genome. 00:17:28.15 And as you see here the structure of the non-autonomous element can vary. 00:17:31.14 Sometimes it will be a simple deletion of the transposase region. 00:17:36.18 Sometimes it will be more extensive, and sometimes there may be none of the... 00:17:39.22 the only thing it may share with the autonomous element 00:17:44.00 are the terminal inverted repeats and the length of the target site duplication. 00:17:49.28 So what I am going to do in this slide is to show you how the target site duplication arises. 00:17:56.11 So what you see at the top is going to be a new insertion site of a transposable element. 00:18:02.06 So this is a piece of DNA where the transposase is going to bind. 00:18:08.28 It is going to... we talked before about the transposase binding to the ends of the transposon. 00:18:14.01 Now that is not... this is different from that. This is the target site. 00:18:16.20 This is where the transposable element is going to insert. 00:18:20.06 The transposase has a lot of different functions built into this single protein. 00:18:26.05 One of the functions is to cleave the target site. 00:18:31.05 And it cleaves it in a way that is much like restriction endonucleases. 00:18:34.23 It makes a staggered cut, so you see that here. 00:18:37.08 It cuts on the two strands of the DNA, at essentially the same sequence, and when those strands come apart 00:18:43.27 you see that we are left with these overhangs. 00:18:48.28 These sequence overhangs. 00:18:50.18 It is into this region that the transposable element inserts. 00:18:54.02 Then we are left with these gaps on the side. Those gaps are filled in by host enzymes. 00:19:01.04 And because of this reaction, the transposable element, which is shown in green, 00:19:05.18 is then flanked by a repeat, a target site duplication, 00:19:11.08 which is a... and the length of that duplication in this case, 00:19:16.09 we show that the transposable element is cleaving 5 basepairs. It is a staggered cut of 5 basepairs. 00:19:22.10 The repeat sequence will be 5 basepairs. 00:19:25.03 Different transposases have characteristic staggered cuts. 00:19:30.29 So some transposases will cut three basepairs apart. Some will cut 8 basepairs apart. 00:19:36.08 And the resulting transposable element will then have a target site duplication 00:19:41.02 that is the length of the staggered cut. 00:19:44.26 So again here to remind you, here is our transposable element family. 00:19:53.00 And this is to show you that the families, like all families, like human families, 00:19:57.19 can be different, can look different. 00:20:00.12 What we see here is that there could be multiple autonomous elements in the genome. 00:20:04.17 There could be many, many, many non-autonomous elements in the genome. 00:20:07.27 And we will see more in the second talk. 00:20:10.29 The other thing that differs is that genomes can and do have multiple families of transposable elements. 00:20:21.08 So I've shown you here at the top the family we've been working with 00:20:25.17 with where the transposase, the blue family, 00:20:27.20 where the transposase is produced and binds to all the yellow terminal inverted repeats. 00:20:35.12 At the same time the genome could also have a different family. 00:20:39.07 And I've shown the green family at the bottom. 00:20:42.13 The transposase from this family binds specifically to the purple terminal inverted repeats. 00:20:48.13 So in this way there can be multiple families that co-exist in a genome 00:20:52.29 that have really nothing to do with each other. 00:20:55.26 What I want to do now is in a few slides take you through... What I haven't explained to you 00:21:03.04 is... I've told you that there are multiple copies of transposable elements in the genome. 00:21:07.18 But I haven't explained how a transposable element can increase its copy number. 00:21:11.29 Because in fact what I have told you is sort of the opposite. 00:21:14.11 What I have told you is that a transposable element at one place 00:21:17.00 excises and moves someplace else. And I am not really great at math, 00:21:22.01 but I can figure out that that won't increase the copy number. 00:21:25.10 You start with one. You move from one site; you move to another. 00:21:28.02 So I want to show you in the next couple of slides is, just very simply, 00:21:32.04 I'll show you a schematic on how by doing that a transposable element can actually increase its copy number. 00:21:37.21 Because the copy number of transposable elements will be a major part of the talk, the rest of this talk 00:21:43.20 and the talk that follows. 00:21:45.16 So what I have drawn here is at the top I have a transposable element at a particular site 00:21:51.05 in the genome. That region is now ... the DNA is being replicated. 00:21:56.06 And what we see is the familiar replication fork. So here are the sister chromatids. OK, 00:22:03.22 so what happens, this is the same thing at the top, we are replicating. 00:22:08.12 The transposable element is going to move from one of the sister chromatids after replication 00:22:14.11 to another site, to the other sister chromatid. 00:22:17.12 And that is shown at the bottom here. 00:22:20.00 So I've redrawn that at the top and what happens after this is kind of neat. 00:22:30.19 The site, the empty site here, sometimes it remains empty, 00:22:36.17 sometimes the host will use the transposable element on the sister chromatid 00:22:42.20 to copy it into that empty site, and that is what you see at the bottom here. 00:22:46.14 So when these chromosomes... when replication is finished, 00:22:50.22 and we end up with two double stranded daughter strands, 00:22:56.11 what we have is the top strand has one transposable element, 00:22:59.15 and the bottom strand has two transposable elements. 00:23:02.20 So we've gone from a situation where we had two transposable elements, 00:23:05.06 I am sorry, one transposable element, 00:23:06.25 to one that now we have two. 00:23:09.24 These chromatids will separate and they will go into separate cells. 00:23:13.12 So one cell will have two transposons, and one will have one. 00:23:16.06 And I think in the next slide what I have done is I have summarized all of the steps. 00:23:21.11 So we start out with a single transposable element. 00:23:24.00 There is replication. There is transposition that occurs from one sister chromatid to the other. 00:23:31.00 There's repair of the empty site using the transposon from the sister chromatid. 00:23:37.01 And there is separation of the chromatids, and we end up with an increase in one transposon in one of the cells. 00:23:43.08 That is one mechanism. There is another mechanism which I have just drawn as a shortcut here. 00:23:48.18 And that is you see at the top just what we started with before. We have a transposable element. 00:23:53.28 We have replication. We then have transposition not from a replicated site into another replicated site, 00:24:01.00 but instead from a replicated site into an unreplicated site ahead of the replication fork. 00:24:07.27 What happens then is again we will have the completion of replication, 00:24:15.09 and we end up with a separation of strands, 00:24:18.08 and we end up again with a gain in the number of transposable elements. 00:24:23.13 So I have told you so far I have focused on one... what is now known to be one 00:24:31.23 of two classes of transposable elements that are in the genome. 00:24:35.07 The elements that were discovered by McClintock and that are responsible for the unstable, 00:24:41.01 for the spotted kernels, the sectored flowers, 00:24:45.12 those are caused by the element type that I have shown you here 00:24:51.02 with a transposase with terminal inverted repeats, 00:24:54.00 but there in fact is another class of transposable elements that I am not really focusing on in this talk. 00:25:01.17 And I am only going to mention it briefly. 00:25:05.07 But these are in fact incredibly abundant transposable elements. 00:25:09.00 They are called retrotransposons. 00:25:13.03 And these are called now Class 1 transposons. 00:25:17.05 Whereas the DNA transposons that McClintock studied are called Class 2 transposons. 00:25:23.17 Now the retrotransposon is characterized by terminal inverted repeats, 00:25:31.07 long terminal inverted repeats, where the ends of the element are not inverted repeats like the DNA transposon, 00:25:38.02 they are direct repeats which are shown here. 00:25:41.04 They also have a target site duplication because 00:25:44.13 the insertion occurs in the same way as the insertion of DNA transposons. 00:25:48.10 And now I am calling them DNA transposons and RNA transposons 00:25:51.18 and the reason is they're named for the intermediate in transposition. 00:25:57.16 So a DNA transposon excises from one site as a DNA element and moves elsewhere. 00:26:03.20 In contrast, an RNA transposon or retrotransposon, the intermediate is RNA, 00:26:09.15 and I've summarized that in the next slide. 00:26:12.27 So here I have shown you a retrotransposon sitting inserted into a chromosome somewhere. 00:26:25.23 The element is transcribed much like a gene. 00:26:29.15 That RNA then is converted into... it is copied into DNA, into a DNA copy. 00:26:39.10 And this is copied by an enzyme called reverse transcriptase, 00:26:42.11 and it is encoded by the retrotransposon. 00:26:46.00 That is then converted into a double stranded DNA molecule. 00:26:51.13 And that double stranded DNA molecule inserts elsewhere in the genome. 00:26:57.05 So this is a lot easier than the DNA transposition mechanism that I showed you before. 00:27:01.06 In essence you could think of a retrotransposon as like a printing press. 00:27:05.21 It makes RNAs and each of those RNAs potentially could be converted 00:27:10.09 into a double stranded DNA, and that double stranded DNA can insert elsewhere in the genome. 00:27:16.13 So a single element because there can be many, many transcripts that come from a single element, 00:27:23.07 a single element could potentially lead to hundreds and hundreds of new 00:27:28.08 integrations, of new copies in the genome. 00:27:30.26 So I've told you about the discovery of transposable elements. 00:27:36.19 I've told you about, something about how elements move and increase their copy number 00:27:42.03 Now I want to sort of wow you with something that I think 00:27:45.13 has been one of the major findings of this era of genomics, 00:27:51.04 and that is just how many transposable elements there are in the genomes of higher organisms. 00:27:56.28 So, when we started sequencing genomes, 00:28:03.05 and when I say we I am using the general "we" of the scientific community. 00:28:06.12 It turns out that the largest component of genomes are derived from transposable elements. 00:28:13.03 50% of the human genome, of the chimp genome, of the mouse genome, 00:28:18.19 more or less fifty percent are derived from transposable element sequences. 00:28:23.24 Plants, especially flowering plants have even higher proportions of their genomes that are transposable elements. 00:28:31.19 The maize genome, over 75% of the genome is derived from transposable elements. 00:28:37.16 The barley genome, and we will be talking more about these creatures in the next talk. 00:28:45.29 The barley genome is almost 85% transposable elements. 00:28:49.17 And even more remarkable, the iris genome is 98% transposable elements. 00:28:53.29 And I don't know about you, but if I look at an iris plant, I mean they are beautiful, 00:28:57.14 there is nothing that would tell us that their genomes are just largely transposable elements. 00:29:06.13 I want to give you a feel for how many transposable elements there are in the human genome. 00:29:12.07 So our genome is comprised of 2.5 billion basepairs. 00:29:20.13 Let's call them A, G, C, of T. Let's say that's 2.5 billion letters, like A, B, C, and D. 00:29:26.16 So if they are letters, let's start filling up some books. 00:29:32.05 So this is equivalent to about a thousand textbooks of a thousand pages each. No pictures. 00:29:38.28 Only 20 to 40 of those 1,000 textbooks contain all of the genes necessary 00:29:52.13 to encode the proteins that make us up. 00:29:56.00 Five hundred of the 1000 textbooks contain sequences that are derived from transposable elements. 00:30:01.28 So it is pretty stunning. 00:30:04.00 To give you and idea of just how many transposable elements there are and where they are 00:30:11.23 I am showing you an example of a typical human gene. 00:30:14.26 And what you see are the green boxes which are the exons, 00:30:18.07 coding regions, and the blue areas which are the introns. 00:30:23.06 Let's look and see where there are transposable elements in this gene. 00:30:26.22 Because there are transposable elements in 70-80 percent of our genes contain transposable elements. 00:30:32.08 So this shows you all the different places that this genes has transposable elements. 00:30:38.14 What you'll notice, it is kind of hard to tell in this slide, but that mostly the non-coding regions, 00:30:44.26 the non-exonic regions contain some transposable elements. 00:30:49.14 In fact some human genes have almost a hundred transposable elements in their introns. 00:30:54.27 So transposable elements are really everywhere. 00:30:58.19 So what I want to show you in the next slide is how transposable elements 00:31:03.23 can diversify a group of very, very closely related organisms. 00:31:08.10 And for this I am going to go back to plants, and I am going to show you 00:31:13.20 a group of organisms that we are very familiar with - the cereal grasses. 00:31:17.00 You may not know I came from New York City, 00:31:19.13 so I didn’t realize that all of these were in fact members of the grass clade. 00:31:24.08 They are rice, sorghum, maize, and barley are some of the most important organisms 00:31:32.03 on this planet for human calories. 00:31:35.09 And what you see here, we've talked about maize a lot. 00:31:39.26 Maize... the maize genome size is 2500 megabases. 00:31:43.09 That's about the same size as the human genome. 00:31:46.12 However, what you see is that relatively closely related organisms, 00:31:51.16 such as the rice genome has a genome size that is almost ten times smaller, 350 megabases. 00:31:58.05 Sorghum is twice the size of that, 700 megabases. Barley is 5000 megabases. 00:32:03.26 These genomes have gone through a remarkable expansion. 00:32:07.18 An explosion. 00:32:08.15 And what's responsible for that largely is the increase in the genome size due to transposable elements. 00:32:15.08 How do organisms function with that much stuff? 00:32:20.23 There is actually three major reasons for the success of organisms 00:32:26.06 despite being crowded with that many transposable elements. 00:32:29.13 The first thing is that most of the transposable elements in the genome 00:32:33.11 are dead. And when I say dead, I mean they can't move. 00:32:36.18 They don't move anymore. They probably haven't moved for a long time. 00:32:40.03 And they are dead because they are mutated. They contain mutations. 00:32:44.14 So every single generation mutations are introduced into our DNA. 00:32:49.11 Mutations that occur in genes will lead to problems for the organism, and they are selected against. 00:32:56.20 However, mutations that occur in transposable elements just accumulate. 00:32:59.24 It's not a problem to the organism at all. 00:33:02.01 So we end up with most of the transposons, the vast majority of the 50% of our genome 00:33:07.19 that are transposable elements are not able to move around anymore. 00:33:10.25 And will never move around. They will just sort of, as we say, senesce. 00:33:15.15 The sequences will mutate and mutate 00:33:17.13 until there is really no trace that they ever were derived from a transposable element. 00:33:20.20 The second way that organisms survive, it's really how transposable elements survive, 00:33:28.18 is that transposable elements have evolved mechanisms that allow them to insert into places in the genome 00:33:36.21 that won't harm the organism. 00:33:39.24 So for example, one safe haven might be into another transposable element. 00:33:44.22 So if a transposable element inserts there, it is not inserting... it is not causing 00:33:49.01 a mutation in any of the genes necessary for the organism to survive. 00:33:53.24 And we will talk a lot more about that in the second talk. 00:33:58.09 The final thing which I am just going to allude to briefly here 00:34:00.22 is that the host has a way to fight transposable elements, and it is a very sophisticated way. 00:34:06.10 The host silences transposable elements using epigenetic mechanisms and inactivates them. 00:34:14.15 And I've... I'll show you on the next slide. 00:34:18.10 So this is just an example of a region of the barley chromosome. 00:34:23.00 And what you see are... remember I said that 85% of the barley genome is derived from transposable elements. 00:34:30.12 And this is how a region of the genome might look. 00:34:33.25 So here we have a few genes, the little blue boxes over there. 00:34:36.14 And what you see stacked up here are all the transposable elements. 00:34:40.12 They are the vast majority of this particular region of the gene. 00:34:43.24 So, but if you look at a flat version of the DNA, 00:34:50.02 if you actually look at the three dimensional structure, the chromatin, 00:34:53.09 the way the genome exists, what you find is that the region where these transposable elements are clustered 00:35:02.25 in fact is a tightly compacted region of the genome. 00:35:06.24 It is what's called heterochromatin. 00:35:08.22 There's very little, nothing really... This is a host response. 00:35:16.11 It condenses the chromatin and prevents the transposons 00:35:20.28 from making the transcripts and proteins needed for it to move. 00:35:25.18 So they are said to be silenced. 00:35:28.01 This in addition to the fact that these transposable elements are accumulating mutations. 00:35:31.20 In contrast, the region where the host needs the genes to be expressed, 00:35:38.02 those are euchromatic regions. They are less condensed. 00:35:42.21 So the genome, the chromosome, is composed of regions that are very, very densely compacted, 00:35:49.26 and that is generally where the transposable elements are, 00:35:52.15 and regions that are much less compacted so that the host can access 00:35:56.08 those to make the gene products needed for its development in life. 00:36:00.26 So like many things with transposable elements, McClintock was ahead of her time. 00:36:07.05 She didn't only discover transposable elements, 00:36:09.24 but she proposed that they had a role in generating diversity. 00:36:15.16 And this scenario had the following components. 00:36:19.27 That transposable elements in the genome usually do not move around, 00:36:24.00 because if they did move around, they would cause mutations. 00:36:26.22 That "stress" conditions may activate transposable elements. 00:36:32.00 Now when I say stress conditions I am thinking more... I am not talking about like driving in rush hour traffic, 00:36:36.17 I am talking more about climate change, that might be one thing. 00:36:40.26 This is a scenario. This is not proven by any means. 00:36:44.12 This was her ideas 20 or 30 years ago. 00:36:48.05 So genomes have a stash of transposable elements. Most are inactive, some are active. 00:36:54.29 And the host is keeping them inactive by these epigenetic mechanisms, but if something happens to the host, 00:37:03.27 some of these, or the host population, some of these transposable elements can be activated to move around. 00:37:09.16 The significance of that is that the movement of transposable elements will generate genetic diversity. 00:37:17.03 And it will do this by increasing the frequency of mutation. 00:37:20.11 So what we end up with is a population that is now more diverse because of the movement of transposable elements. 00:37:26.10 There is new mutations in that population, and it is possible that some of those new mutations may be adaptive. 00:37:34.10 May help the population survive this dramatic change in climate, or whatever. 00:37:40.04 So I want to end by essentially saying to you that the way I think of transposable elements 00:37:49.01 is that they shake up the genome. And the genome is inherently conservative. 00:37:54.26 So they shake up an otherwise conservative genome in ways that we are jut beginning to understand 00:37:59.25 and ways that I will go into in the next talk.