|
CENTROMERES:
A Journey to the Center of the Chromosome
Christine Mlot*
The Arabidopsis genome project is the first to give a detailed picture
of the centromeres in a higher eukaryote
As a postdoc at Stanford University in 1994, Daphne Preuss was examining
mutagenized pollen grains under a microscope when she saw it: Amid all
the dots of lone pollen, four grains were stuck together, tracing the shape
of a tetrahedron. Having written her Ph.D. thesis on yeast, where such
tetrads are standard and have been the foundation for its genetic analysis,
Preuss knew she was looking at something powerful. "I immediately knew
this [mutant] was the key to doing all kinds of genetic analysis" in plants,
she recalls. "Life would be different."
That chance finding launched her career as a plant biologist.
Some 6 months after she found it, an electron micrograph of the mutant
pollen was on the cover of Science (3 June 1994), and Preuss was
soon on her way to the University of Chicago, where she directs a lab that
runs in large part on the power of her mutant find, dubbed quartet.
The lemon-yellow pollen grains in which she spotted quartet
were from the mustard plant Arabidopsis thaliana. What was unusual
was that the four gametes were joined. Typically during meiosis in a plant
or animal, the two chromosomes within a cell join; recombine, or exchange
genetic material; then divide and separate twice into four haploid cells--the
gametes. Each gamete, whether pollen or sperm, contains half the genetic
complement. But in this newfound Arabidopsis mutant, the standard
diploid cell produces four adjoined haploid cells--a tetrad, as in yeast.
By analyzing these four cells instead of random gametes, geneticists can
chart recombination events with unprecedented precision. Preuss realized
that this four-in-one mutant could reveal what happens during meiosis in
plants as it had in yeast. It would also enable her to define the centromeres,
which have been defined in yeast but which remain a black box in plants
and animals.
The centromere is a crucial stretch of DNA buried in the knotty
terrain at the center of the chromosome. It plays a key role in meiosis,
pairing up parental chromosomes and hitching them to protein motors that
pull the chromosomes apart before cells divide. The dense, central region
of the chromosome containing the centromere is readily visible under a
microscope. Yet only in yeast have researchers been able to identify the
exact DNA sequence of the centromere.
Using the Arabidopsis tetrad mutants, Preuss has established
where the centromeric region starts and stops on each of the five chromosomes,
a first for a complex eukaryote. Now, by building "minichromosomes," she
and her colleagues are on their way to pinpointing where and how in that
region the proteins attach in meiosis. The research is "blazing trails,"
says Kelly Dawe of the University of Georgia, Athens, who is developing
such minichromosomes in maize.
Preuss's lab and quartet have also been indispensable to
the Arabidopsis genome sequencing project, started in 1996 with
the goal of deciphering the plant's 120-million-base-pair sequence. The
fine-scale genetic map her group developed by using pollen tetrads boosted
the unprecedented accuracy and completeness of the sequence of this model
organism. Not only did the map enable Preuss to define the centromeric
region, but it also enabled the six sequencing groups to assign unknown
fragments of DNA, especially from the centromeric region, to their rightful
places on the chromosomes. "We wouldn't have been able to have done it
without her and [postdoc] Greg Copenhaver," says W. Richard McCombie of
the sequencing group at Cold Spring Harbor Laboratory in New York.
Into the chromosome centers
Under a microscope, the cinched region of the centromere is easily
one of the most distinguishing features of the threadlike chromosomes.
Indeed, cytologists captured the first images of the centromere in the
late 1800s. But for all their centrality--literally and figuratively--in
dividing cells, centromeres have escaped much dissection. "The centromere
has been remarkably elusive to pin down," says Brian Charlesworth, an evolutionary
biologist at the University of Edinburgh in the United Kingdom.
For decades both cell biologists and geneticists have attacked
the problem of how chromosomes segregate and the role the centromere plays
in this process. By the 1940s geneticists were mapping the locations of
the dense centromeric regions in yeast and other fungi. This was possible
because these organisms packaged their gametes in tetrads, which enabled
geneticists to track recombination among the four cells. The centromeres
always stayed put on the chromosomes they started out with, in contrast
to other recognizable markers. Thus, the centromeres became the baseline
for measuring distance to markers on the chromosome arms. From that point
on, tetrad analysis became a stock in trade for geneticists, especially
as yeast became the favored model system.
In 1980, using tetrad analysis, researchers narrowed the location
of the centromere to about a 4000-base stretch of DNA in budding yeast,
the first organism to achieve that landmark at that level of resolution.
Eventually, as molecular technologies improved, yeast's functional centromere--the
precise bases involved in hitching chromosomes to the proteins--was whittled
down to a 125-base stretch.
Meanwhile, during the 1960s and '70s, cell biologists had developed
staining and other techniques that visually highlighted the centromeric
region on the larger chromosomes of plant and animal cells. These techniques
also highlighted the corresponding protein structure the centromere meshes
with. By that time, researchers were beginning to realize that eukaryotes
had devised many ways to segregate their chromosomes--in other words, one
centromere did not fit all. But except in yeast, the exact DNA sequence
of the centromeres remained elusive, escaping detection even in the massive
projects to sequence the genomes of fruit flies and humans.
The problem is that the chromosome centers are jungles of difficult
loops and repetitions, or heterochromatin, in contrast to the smooth runs
of readable DNA, or euchromatin, on the chromosome arms. As sequencers--who
decipher the genetic code of short fragments and then reassemble them in
correct order--approach the central regions of the chromosomes, they run
into long stretches of nothing but repeated bases, such as ATAT ... AT.
These stretches of repetitive DNA are next to impossible to piece together,
as they contain few landmarks to orient them. The centromeric region is
"the part of the genome that's ignored when it comes to estimating time
to 'complete' genome projects," says Gary Karpen, who studies the centromere
in Drosophila at the Salk Institute for Biological Studies in La
Jolla, California.
In Arabidopsis, Preuss and her co-workers were able to
find the right place for these "orphaned" stretches by analzying the results
of 1000 crosses using the mutant pollen, each cross producing four plants,
one from each cell of the tetrad. With tetrads, "we can literally redraw
what happened in meiosis," she says. They were able to find recognizable
markers linked to these orphan stretches and then examine how often the
markers stayed put with the centromeres in the tetrads generated in meiosis.
The more often a marker separated from its centromere, the farther down
the chromosome arm it was. The more often it stayed with its centromere,
the closer in it was. The frequency of these separations, or recombinations,
within a foursome was used to calculate distance between markers and centromeres.
In this fashion the researchers generated successively finer scale maps
of the genome, starting out with roughly one marker for every three megabases
and finishing in the region of the centromere with one marker for every
10 kilobases. The frequency of recombination between a marker and its centromere
also enabled the Preuss group to delineate where on the map the centromeric
region starts and stops on all five chromosomes (Science, 24 December
1999, p. 2468). When they
found no pattern of crossing over in the tetrads, the researchers knew
that they had found markers in the genomic terra incognita of the centromeric
region itself.
Among the five chromosomes, these genetically defined centromeric
regions vary in length from 1.4 megabases to 1.9 megabases, says Preuss.
That's about 7% of the entire genome--a far more precise definition of
the Arabidopsis centromeric regions than earlier estimates of more
than 40% of the genome, says Preuss. Inside these regions as well as flanking
them, Preuss and her team found more repetitive sequences, most notably,
recognizable sequences of 180 base pairs repeated hundreds of times, on
all five chromosomes.
Surprising nuggets also turned up inside the centromeric regions,
most notably, a significant number of genes. "That was one of the big interesting
things," says Preuss, because the centromere had long been considered relatively
barren territory. The analysis has located some 200 genes in the Arabidopsis
centromeric regions, at least 50 of which are expressed. About 40 of these
genes appear only once in the genome sequence. "These are bona fide genes
that would have been left out" without delving into the centromeric region,
says Preuss.
Creating chromosomes
Having defined the centromeric regions and sequenced most of them,
the researchers still need to find the functional part and figure out how
it works. Given the diversity of centromeres in different organisms, there
isn't a universal code to look for.
To find the functional centromere, Preuss's lab is developing
a new tool-- experimental minichromosomes that are a stripped-down version
of an Arabidopsis chromosome. They contain all the essential parts:
the centromere; telomeric DNA from the chromosome ends; genes of interest
or indicator genes, such as green fluorescent protein; and elements to
ferry the package into cells. When assembled, these minichromosomes should
function in plants alongside the other chromosomes.
Postdoc Kevin C. Keith in Preuss's lab is now testing pieces of
DNA from the Arabidopsis centromeric regions to see which ones work
in cell division--the hallmark of a functional centromere. Keith is combing
through all the bases in the centromeric region, including the 180-base
repeats. The process is akin to going through jars of bolts in a hardware
store to find the right one, in this case, a stretch of DNA that holds
the minichromosome to the cell's protein motors. To do so, Keith is methodically
inserting sequences of fewer than 100 kilobases into the minichromosomes
to test which ones work as the centromere.
Such minichromosomes are in the works for other organisms, including
humans. Apart from their use as a tool to explore chromosomal functioning,
they have an applied side in genetic engineering as well. Researchers believe
they will provide a controlled means of "stacking" large numbers of genes--say,
for pathogen resistance-- into an organism that could also be engineered
to be eliminated when necessary.
One of Preuss's next interests is to use the minichromosomes to
study why chromosomes don't always segregate properly and end up with two
or no chromatids in a gamete instead of just one. If the gamete is used
in fertilization, the resulting offspring will have an abnormal number
of chromosomes, a condition known as aneuploidy. It happens between 1%
and 2% of the time in Arabidopsis--about the same frequency as in
yeast--but far more often in human meioses. The development of minichromosomes
over the next 5 years "will open up tremendous resources" for exploring
such phenomena, she says.
Preuss isn't the only one who thinks so. This past summer she
became a Howard Hughes Medical Institute investigator, only the third plant
biologist to be so recognized. To commemorate the occasion, Preuss's lab
presented her with a framed collage of the pollen tetrads and the data
they've generated. At the center is a photo of Howard Hughes holding a
fistful of Arabidopsis. It's in bloom.
Christine Mlot is a science writer
in Madison, Wisconsin. |