In mammals, as typified by various rodent species, the sequences comprising each satellite show appreciable divergence between tandem repeats. Researchers can recognize common short sequences by their preponderance among the oligonucleotide fragments produced by chemical or enzymatic treatment. However, the predominant short sequence usually accounts for only a small minority of the copies. The other short sequences are related to the predominant sequence by a variety of substitutions, deletions, and insertions.
However, a series of these variants of the short unit can constitute a longer repeating unit that is itself repeated in tandem with some variation. Thus, mammalian satellite DNAs consist of a hierarchy of repeating units that can be detected by reassociation analyses or restriction enzyme digestion.
When any satellite DNA is digested with an enzyme that has a recognition site in its repeating unit, one fragment will be obtained for every repeating unit in which the site occurs. In fact, when the DNA of a eukaryotic genome is digested with a restriction enzyme, most of it gives a general smear due to the random distribution of cleavage sites. However, satellite DNA generates sharp bands because a large number of fragments of identical or almost identical size are created by digestion at restriction sites that lie a regular distance apart.
Determining the sequence of satellite DNA can be difficult. For example, researchers can cut the region into fragments with restriction endonucleases and attempt to obtain a sequence directly. However, if there is appreciable divergence between individual repeating units, different nucleotides will be present at the same position in different repeats, so the sequencing gels will not clearly identify the sequence. If the divergence is not too great—say, within about 2%—it might be possible to determine an average repeating sequence.
Individual segments of the satellite can be inserted into plasmids for cloning. A difficulty is that the satellite sequences tend to be excised from the chimeric plasmid by recombination in the bacterial host. However, when the cloning succeeds it is possible to determine the sequence of the cloned segment unambiguously. Although this gives the actual sequence of a repeating unit or units, we would need to have many individual such sequences to reconstruct the type of divergence typical of the satellite as a whole.
Using either sequencing approach, the information we can gain is limited to the distance that can be analyzed on one set of sequence gels. The repetition of divergent tandem copies makes it difficult to reconstruct longer sequences by obtaining overlaps between individual restriction fragments.
The satellite DNA of the mouse Mus musculus is digested by the enzyme EcoRII into a series of bands, including a predominant monomeric fragment of 234 bp. This sequence must be repeated with few variations throughout the 60% to 70% of the satellite that is digested into the monomeric band. Researchers can analyze this sequence in terms of its successively smaller constituent repeating units.
The sequence can be described in terms of two half-repeats. By writing the 234-bp sequence so that the first 117 bp are aligned with the second 117 bp, we see that the two halves are quite similar. They differ at 22 positions, corresponding to 19% divergence. This means that the current 234-bp repeating unit must have been generated at some time in the past by duplicating a 117-bp repeating unit, after which differences accumulated between the duplicates.
The repeating unit of mouse satellite DNA therefore contains two half-repeats that reveal extensive sequence identity.
Within the 117-bp unit we can recognize two further subunits. Each of these is a quarter-repeat relative to the whole satellite. The four quarter-repeats can be aligned to reveal the homologies between them. We see that the divergence between the four quarter-repeats has increased to 23 out of 58 positions, or 40%. The first three quarter-repeats are somewhat more similar, and a large proportion of the divergence is due to changes in the fourth quarter-repeat.
The alignment of quarter-repeats identifies homologies between the first and second halves of each half-repeat.
Looking within the quarter-repeats, we find that each consists of two related subunits (one-eighth-repeats), referred to as the α and β sequences. The α sequences all have an insertion of a C, and the β sequences all have an insertion of a trinucleotide sequence relative to a common consensus sequence. This suggests that the quarter-repeat originated by the duplication of a sequence like the consensus sequence, after which changes occurred to generate the components we now see as α and β. Further changes then took place between tandemly repeated αβ sequences to generate the individual quarter- and half-repeats that exist today. Among the one-eighth-repeats, the present divergence is 19/31 = 61%.
The alignment of eighth-repeats shows that each quarter-repeat consists of an α and a β half. The consensus sequence gives the most common base at each position. The “ancestral” sequence represents a sequence very closely related to the consensus sequence that could have been the predecessor to the α and β units. Because the satellite sequence is continuous, it can be treated as a circular permutation for the purpose of deducing the consensus sequence.
The consensus sequence can be analyzed directly by writing the satellite sequence as a 9-bp repeat. In this way, the current satellite sequence can be treated as derivatives of a 9-bp sequence. Three variants of this sequence can be recognized in the satellite:
G A A A A A C G T
G A A A A A T G A
G A A A A A A C T
The origin of the satellite could well lie in an amplification of one of these three nonamers (9-bp units). The overall consensus sequence of the present satellite is effectively an amalgam of the three 9-bp repeats.
The existence of an overall consensus sequence is demonstrated by writing the satellite sequence as a 9-bp repeat.
The average sequence of the monomeric fragment of the mouse satellite DNA explains its properties. The longest repeating unit of 234 bp is identified by restriction digestion. The unit of reassociation between single strands of denatured satellite DNA is probably the 117-bp half-repeat, because the 234-bp fragments can anneal both in register and in half-register. In the latter case, the first half-repeat of one strand renatures with the second half-repeat of the other.
So far, we have treated the present satellite as though it consisted of identical copies of the 234-bp repeating unit. Although this unit accounts for the majority of the satellite, variants of it are also present. Some are scattered randomly throughout the satellite, whereas others are clustered.
The existence of variants is implied by the description of the starting material for the sequence analysis as the “monomeric” fragment. When the satellite is digested by an enzyme that has one cleavage site in the 234-bp sequence, it also generates dimers, trimers, and tetramers relative to the 234-bp length. They arise when a repeating unit has lost the enzyme cleavage site as the result of mutation.
The monomeric 234-bp unit is generated when two adjacent repeats each have the recognition site. A dimer occurs when one unit has lost the site, a trimer is generated when two adjacent units have lost the site, and so on. With some restriction enzymes, most of the satellite is cleaved into members of this repeating series. The declining number of dimers, trimers, and higher multimers shows that there is a random distribution of repeats in which the enzyme’s recognition site has been eliminated by mutation.
Digestion of mouse satellite DNA with EcoRII identifies a series of repeating units that are multimers of 234 bp and also a minor series that includes half-repeats.
Other restriction enzymes show a different type of behavior with the satellite DNA. They continue to generate the same series of bands, but digest only a small proportion of the DNA, typically 5% to 10%. This implies that a certain region of the satellite contains a concentration of repeating units with this particular restriction site. Presumably the series of repeats in this domain are all derived from an ancestral variant that possessed this recognition site, although some members have subsequently lost it by mutation.
A satellite DNA suffers unequal recombination. This has additional consequences when there is internal repetition within the repeating unit. Let us return to a cluster consisting of “ab” repeats. Suppose that the “a” and “b” components of the repeating unit are themselves sufficiently similar to allow them to pair. Then the two clusters can align in half-register, with the “a” sequence of one aligned with the “b” sequence of the other. How frequently this occurs depends on the similarity between the two halves of the repeating unit. In mouse satellite DNA, reassociation between denatured satellite DNA strands in vitro commonly occurs in the half-register.
When a recombination event occurs out of register, it changes the length of the repeating units involved in the reaction:
xababababababababababababababababy
xababababababababababababababababy
xabababababababababababababababababy
×
↓
+
xababababababababbabababababababy
In the upper recombinant cluster, an “ab” unit has been replaced by an “aab” unit. In the lower cluster, an “ab” unit has been replaced by a “b” unit.
This type of event explains a feature of the restriction digest of mouse satellite DNA. A fainter series of bands appears at lengths of 0.5, 1.5, 2.5, and 3.5 repeating units, in addition to the stronger integral-length repeats. Suppose that in the preceding example, “ab” represents the 234-bp repeat of mouse satellite DNA, generated by cleavage at a site in the “b” segment. The “a” and “b” segments correspond to the 117-bp half-repeats.
In the upper recombinant cluster, the “aab” unit generates a fragment 1.5 times the usual repeating length. In the lower recombinant cluster, the “b” unit generates a fragment half the usual length. The multiple fragments in the half-repeat series are generated in the same way as longer fragments in the integral series when some repeating units have lost the restriction site by mutation.
Turning the argument around, the identification of the half-repeat series on the gel shows that the 234-bp repeating unit consists of two half-repeats that are closely related enough to pair occasionally during recombination. Also visible are faint bands corresponding to 0.25- and 0.75-repeat spacings. These are generated in the same way as the 0.5-repeat spacings when recombination occurs between clusters aligned in quarter-register. The decreased similarity between quarter-repeats compared with half-repeats explains the reduced frequency of the 0.25- and 0.75-bands relative to the 0.5-bands.
References:
- Lewin’s GENES XII. Jones & Bartlett Learning.
- Molecular Biology of the Gene. Pearson Education
- Human Molecular Genetics. Garland Science
















