Plant Cell Applied Biosystems SYBR(R) Cells-to-CT(TM) Kits
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via ISI Web of Science (127)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Tarchini, R.
Right arrow Articles by Rafalski, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tarchini, R.
Right arrow Articles by Rafalski, A.
Agricola
Right arrow Articles by Tarchini, R.
Right arrow Articles by Rafalski, A.
Plant Cell, Vol. 12, 381-392, March 2000, Copyright © 2000, American Society of Plant Physiologists

The Complete Sequence of 340 kb of DNA around the Rice Adh1–Adh2 Region Reveals Interrupted Colinearity with Maize Chromosome 4

Renato Tarchini1,a, Phyllis Biddlea, Robin Winelandb, Scott Tingeya, and Antoni Rafalskia
a DuPont Agricultural Biotechnology–Genomics, Delaware Technology Park, Suite 200, 1 Innovation Way, P.O. Box 6104, Newark, Delaware 19714-6104
b Pioneer Hi-Bred International, Inc., 7300 N.W. 62nd Avenue, P.O. Box 1004, Johnston, Iowa 50131-1004

Correspondence to: Antoni Rafalski, j-antoni.rafalski{at}usa.dupont.com (E-mail), 302-631-2607 (fax)


* ABSTRACT
*TOP
*ABSTRACT
*INTRODUCTION
*RESULTS
*DISCUSSION
*METHODS
*REFERENCES
*ACCESSION NUMBER

A 2.3-centimorgan (cM) segment of rice chromosome 11 consisting of 340 kb of DNA sequence around the alcohol dehydrogenase Adh1 and Adh2 loci was completely sequenced, revealing the presence of 33 putative genes, including several apparently involved in disease resistance. Fourteen of the genes were confirmed by identifying the corresponding transcripts. Five genes, spanning 1.9 cM of the region, cross-hybridized with maize genomic DNA and were genetically mapped in maize, revealing a stretch of colinearity with maize chromosome 4. The Adh1 gene marked one significant interruption. This gene mapped to maize chromosome 1, indicating a possible translocation of Adh1 after the evolutionary divergence leading to maize and sorghum. Several other genes, most notably genes similar to known disease resistance genes, showed no cross-hybridization with maize genomic DNA, suggesting sequence divergence or absence of these sequences in maize, which is in contrast to several other well-conserved genes, including Adh1 and Adh2. These findings indicate that the use of rice as the model system for other cereals may sometimes be complicated by the presence of rapidly evolving gene families and microtranslocations. Seven retrotransposons and eight transposons were identified in this rice segment, including a Tc1/Mariner–like element, which is new to rice. In contrast to maize, retroelements are less frequent in rice. Only 14.4% of this genome segment consist of retroelements. Miniature inverted repeat transposable elements were found to be the most frequently occurring class of repetitive elements, accounting for 18.8% of the total repetitive DNA.


* INTRODUCTION
*TOP
*ABSTRACT
*INTRODUCTION
*RESULTS
*DISCUSSION
*METHODS
*REFERENCES
*ACCESSION NUMBER

Rice has become the model system for the molecular biology of grasses because its genome is amenable to analysis. The rice genome is relatively small (450 Mb; Arumuganathan and Earle 1991 Down), but its gene content is comparable to that of other grasses (Ahn and Tanksley 1993 Down). More than 45,000 rice expressed sequence tags (ESTs) are publicly available, and a dense EST-based restriction fragment length polymorphism map (Harushima et al. 1998 Down) has been established and connected to a physical map (Sasaki 1998 Down). One of the objectives of our research reported here was to determine the sequence composition of the rice genome on a sizable contiguous sample of DNA; we also wanted to evaluate how its properties might affect a large-scale DNA-sequencing assembly process. The content, composition, and organization of the repetitive DNA fraction determine the assembly success of a shotgun DNA-sequencing process of rice bacterial artificial chromosomes (BACs).

To what extent the information coming from the sequencing of the entire rice genome can be applied to other Gramineae species remains to be established. Comparative mapping has revealed the existence of a high degree of conservation in gene repertoire and order among the grass genomes (Ahn and Tanksley 1993 Down; Devos et al. 1994 Down; Kurata et al. 1994 Down; Gale and Devos 1998 Down). This has allowed the construction of a general grass consensus map (Moore et al. 1995 Down; Gale and Devos 1998 Down) in which different grass genomes are described in terms of ancestral "rice linkage blocks." Bennetzen and co-workers have shown that microscale colinearity can be found in homologous regions of closely related species such as maize, sorghum, and rice (M. Chen et al. 1997 Down; Bennetzen et al. 1998 Down). However, exceptions to these findings have also been described (Bennetzen et al. 1998 Down; Tikhonov et al. 1999 Down). In particular, whereas the regions surrounding the alcohol dehydrogenase Adh1 genes of maize and sorghum were found to be colinear, the corresponding regions of rice showed a lack of conservation (Tikhonov et al. 1999 Down).

Our goal was to establish whether use of the rice genome would be feasible as a surrogate genome for map-based cloning of maize genes. To use rice as the surrogate genome, one would first genetically map a trait in maize and then use flanking maize markers to isolate the corresponding genetic interval in rice. Candidate genes would be identified from the rice BAC contig, and corresponding genes in maize would be isolated and analyzed. A similar approach was used by Kilian et al. 1995 Down for barley. Complete sequencing of a several hundred kilobase interval of the rice genome and investigation of the syntenous region in maize allowed us to predict some of the difficulties that could be encountered in using this approach. The basis for the reported lack of sequence conservation between the regions surrounding the Adh1 genes of maize and rice was investigated further, and the composition of transposable elements in this genome segment was established.


* RESULTS
*TOP
*ABSTRACT
*INTRODUCTION
*RESULTS
*DISCUSSION
*METHODS
*REFERENCES
*ACCESSION NUMBER

Physical Mapping of a BAC Contig in the Adh1 Region
A set of minimally overlapping clones consisting of BAC85C11, BAC178G5, and BAC62F3 covering the Adh1 region was assembled by hybridization with the Adh1 probe; the set was extended by polymerase chain reaction (PCR) identification of additional BACs, using primers corresponding to the ends of the Adh1-containing BACs (Fig 1B). Additional BAC subclones were used to complete the contig (see Methods).



View larger version (29K):
[in this window]
[in a new window]
 
Figure 1. Genetic and Physical Map of Rice Chromosome 11S around the Adh1 and Adh2 Loci and Genetic Map of the Corresponding Regions of Maize Chromosomes 1 and 4.

(A) Genetic map of rice 11S. Map positions of markers are shown in centimorgans (http://www.dna.affrc.go.jp:84/publicdata/geneticmap98/chr11pre.html).

(B) Physical map of the 339,485-bp segment of rice 11S. Genes indicated in red have been genetically mapped on maize chromosome 4, as shown in (C). The RZ53 map position is from Ahn and Tanksley 1993 Down. Genes shown in yellow do not cross-hybridize with maize genomic DNA. Blue indicates the Adh1 gene; its position on the genetic map of maize chromosome 1 has been inferred from published information (MaizeDB; http://www.agron.missouri.edu/). The gene indicated in white has not been hybridized with maize genomic DNA. MAPK, mitogen-activated protein kinase; PIK, phosphatidylinositol 4-kinase.

(C) Segments of maize chromosomes 1 and 4 containing maize homologs of rice chromosome 11 genes identified in (B).

Sequence of the Adh1 Region
The sequence of this region consists of 339,485 bp of DNA and is equivalent to ~2.3 centimorgans (cM) of rice chromosome 11S, as determined from the published relative map positions of restriction fragment length polymorphism markers C410 (GenBank accession number D15288), C477 (GenBank accession number D15339), C496 (GenBank accession numbers D15347 and D22600), and R682 (GenBank accession number D23967) on the 1998 version of the Rice Genome Research Program genetic map of chromosome 11, as shown in Fig 1A and Fig 1B (see also http://www.dna.affrc.go.jp:84/publicdata/geneticmap98/chr11pre.html). The distance between C410 and R682 is 330,670 bp. Those previously mapped markers correspond to gene 33 (DUPR11.33), function unknown, and gene 1 (DUPR11.1), which is Adh2, in our sequence (Table 1).

 
View this table:
[in this window]
[in a new window]
 
Table 1. Identified and Predicted Genes

The physical to genetic distance ratio is, on average, 147 kb of DNA per cM. This estimate is only approximately half of the expected average of 295 kb/cM, calculated by assuming a total genetic map length of 1522 cM (http://www.dna.affrc.go.jp:82/) and a 450-Mb genome size for rice (Arumuganathan and Earle 1991 Down).

Gene Repertoire
We identified 33 putative genes (Table 1 and Fig 2), with an average of one gene per 10.3 kb of DNA. Thirteen of these genes were confirmed by the identification of corresponding cDNA clones, and one was confirmed by the isolation of a rapid amplification of cDNA ends (RACE)–PCR product. Nineteen of the genes are homology-based or GENSCAN predictions for which no transcripts were identified.



View larger version (39K):
[in this window]
[in a new window]
 
Figure 2. Gene and Repetitive Element Map of the 339,485-bp Segment of Rice Chromosome 11S.

Positions of genes reported in Table 1 and transposable/repetitive elements reported in Table 2 are shown. Arrows indicate direction of transcription. MITEs (Wessler et al. 1995 Down) are indicated with colored stars. Different classes of MITEs are shown in different colors (R. Tarchini, unpublished results). NBS, nucleotide binding site.

Adh1 and Adh2 are the only known genes of rice present in the region. Twelve of the genes are similar in structure to other plant genes with known function. Three major classes of genes, the products of which may be related to disease resistance, were found. These include a protein kinase–like protein that contains a leucine-rich repeat (LRR) and is similar to Xa-21 (DUPR11.16), a nucleotide binding site– containing protein similar to a lettuce disease resistance protein (DUPR11.30), and three gene products similar to S-domain receptor–like kinases (DUPR11.18, DUPR11.19, and DUPR11.20; Table 1). No function could be attributed to 18 of the putative genes. We found evidence of transcription for six of them: matching ESTs were found in five instances, and a RACE product was isolated for the remaining one (Table 1). Two of the transcriptionally active genes (REP54 and REP43) are members of a family that is represented by 13 different copies. Nine of those copies have the potential to encode functional proteins; the others are apparently pseudogenes.

Repetitive DNA
The region contains numerous class I (retrotransposon) and class II (transposon) mobile genetic elements (Grandbastien 1992 Down), miniature inverted repeat transposable elements (MITEs; Wessler et al. 1995 Down) (Table 2 and Fig 2), and simple sequence repeats (Weber and May 1989 Down). Four Ty1-Copia–like and three Ty3-Gypsy–like retrotransposons were found. Most of them are not highly similar to any well-characterized rice retrotransposons. The identified structural similarities are to partial sequences of repetitive DNA in rice or are limited to the highly conserved reverse transcriptase domain. Some of the elements show nucleotide sequence similarities to retrotransposons of other species. Copia C resembles the wheat Tar1 element (Matsuoka and Tsunewaki 1997 Down), and Copia D is similar to BARE-1, a Copia-like retroelement of barley (Manninen and Schulman 1993 Down). Gypsy C is highly similar to retrosat2 of rice (GenBank accession number AF111709). It also resembles the maize retrotransposon Tekay (GenBank accession number AF050455), the dea1 element from the evolutionarily more distant pineapple (Ananas comosus) (Thomson et al. 1998 Down), and the Lilium henryi del retrotransposon (Smyth et al. 1989 Down). With the possible exception of Gypsy C, all of the elements are most probably inactive, given the presence of frameshifts, missense mutations, or deletions and rearrangements.

 
View this table:
[in this window]
[in a new window]
 
Table 2. Transposons and Retrotransposons Identified in This Study

No long interspersed nuclear elements were found by searching for the reverse transcriptase domain in the vicinity of oligo(dA/dT) sequences. All reverse transcriptase domains found were within long terminal repeat (LTR)–containing elements described above.

Six of the eight identified transposons can be ascribed to the CACTA family of DNA transposable elements (Gierl et al. 1989 Down), based on the presence of the characteristic CACTA motif in the terminal inverted repeats. The two major rice elements of this group are unique in sequence, but both show similarities to the coding portions of the maize Enhancer/Suppressor mutator (En/Spm) system (Pereira et al. 1986 Down) and the snapdragon Tam1 element (Nacken et al. 1991 Down) (Table 2). More than 95% of the En/Spm A nucleotides are identical to those of RIM2, a rice transposon protein–like cDNA induced by Magnaporthe grisea (GenBank accession number AF121139). CACTA A, CACTA B, CACTA C, and CACTA D are highly homologous to each other and can be considered members of the same family of defective transposable elements. The remaining two transposable elements are a small, presumably defective, novel transposable element and a Tc1/Mariner–like element (Hartl et al. 1997 Down), providing evidence for the presence of this family of transposable elements in rice.

MITEs are the numerically most abundant class of repetitive DNA elements found in this region (Fig 2). We were able to define a total of 78 MITEs (an average of one every 4.36 kb) and to classify them according to their overall sequence similarities (R. Tarchini, unpublished results). Of these, 66 were assigned to 14 different families. Each family contains at least two and at most 13 members.

Among the simple sequence repeats containing at least six repeats, AT/TA repeats were the most abundant (11), followed by AG/TC (10) and AC/GT (four), confirming previous observations of relative simple sequence repeat abundance reported in the literature (Lagerkrantz et al. 1993 Down). Ten of the 11 AT/TA repeats are concentrated in a 94-kb interval (nucleotides 210,277 to 304,540).

Genetic Mapping with Rice Probes in Maize
Thirteen loci from the sequenced region were tested for their sequence conservation and genetic map position in maize (Fig 1C). Five rice genes hybridized well with maize genomic DNA: Adh1, Adh2, RZ53, phosphatidylinositol 4-kinase (PIK), and mitogen-activated protein kinase (MAPK) genes. Three probes, Adh2, PIK, and MAPK, identified useful polymorphism on the available maize mapping populations, ALEB9 and DRAG2, each of which contained 86 individuals (see Methods). Adh2 was mapped to chromosome 4 in the DRAG2 population, 10.8 cM downstream of umc31a. PIK and MAPK were mapped as 8.7 and 9.2 cM, respectively, downstream of umc31a in the ALEB9 population. In all cases, the loci were placed on the map with LOD scores >20 (Lander et al. 1987 Down). However, because of the modest size of the mapping population, the absolute order of the markers on maize chromosome 4 must be considered provisional, and differences from the order of MAPK, PIK, RZ53, and Adh2 loci observed in rice cannot be excluded. In rice, Adh1 (GenBank accession numbers D15347 and D22600) and Adh2 (GenBank accession number D23967) map to chromosome 11S position 30.3, unknown sequence DUPR11.31 (GenBank accession number D15339) maps to chromosome 11S position 28.4, and unknown sequence DUPR11.33 (GenBank accession number D15288) maps to chromosome 11S position 28.0 (Fig 1A and Fig 1B; http://www.dna affrc.go.jp:84/publicdata/geneticmap98/chr11pre.html).


* DISCUSSION
*TOP
*ABSTRACT
*INTRODUCTION
*RESULTS
*DISCUSSION
*METHODS
*REFERENCES
*ACCESSION NUMBER

Genomic Organization
Analysis of the sequence of 339,485 bp of the rice Adh1 region revealed the presence of 33 putative genes corresponding to one gene per 10.3 kb. This is ~1.4 times higher than the average of one gene every 14 kb, calculated by assuming a total of 30,000 genes, and approximately one-half of the reported gene density for Arabidopsis (4.6 kb per gene; Bevan et al. 1998 Down).

The comparison of the physical interval with the genetic distance shows that the frequency of recombination in this region is twice the average for rice. The ratio of physical to genetic distance in this region is 147 kb/cM, as compared with the rice genome average of 295 kb/cM. This result may be explained by a local variation in the recombination frequency or by an error in estimating the genetic distance.

Gene sequences comprise a total of 101,923 bp of annotated features (including exons, introns, and 5' and 3' untranslated regions for which ESTs were available) corresponding to 30% of the 399,485 bp that were sequenced. An interesting feature of the region is the frequent occurrence of gene duplication. Examples include two alcohol dehydrogenases, two monooxygenases, three serine/threonine kinase–like genes, and a family of 13 genes of unknown function clustered in a 150-kb interval. All of the duplicated genes are closely spaced and occur mostly in the same orientation (Table 1 and Fig 2). Unequal crossing-over followed by sequence divergence is a likely explanation, as has been proposed for the evolution of multigene families (Nei et al. 1997 Down; Zhang et al. 1988 Down). An exception is a small inversion involving the region containing REP82 and one of the two copies of the flavine monooxygenase–like (FMOA) gene.

The identification of genes was aided by the availability of EST collections. Eleven of the 33 candidate genes were confirmed by the identification of homologous ESTs in our collection of >120,000 rice ESTs. Eight of the 11 rice genes homologous to DuPont rice ESTs also have corresponding GenBank sequences (Table 1). Two predicted genes (a Myb-like gene and an unknown gene) correspond to the publicly available ESTs, but there is no corresponding EST in our collection.

The main repetitive DNA fraction present in the region had sequence similarities to different classes of mobile genetic elements. Except for the non-LTR retrotransposons, all major classes of known transposable elements are represented. Excluding gene families from the calculation, the repetitive fraction comprises 28.46% of the region sequence, of which half is represented by retrotransposons. MITEs alone constitute 18.8% of the repetitive DNA (on average, one MITE per 4.36 kb; 5.34% of the total DNA) and are the third most abundant components after transposons (8.7% of the total DNA). In contrast, the retrotransposon fraction in maize accounts for 60% of the genome, and MITEs appear to be rare (M. Morgante, personal communication). This observation provides additional evidence for the proposal (SanMiguel et al. 1996 Down) that retrotransposon proliferation contributed greatly to the genome expansion observed in maize.

All of the transposons and retrotransposons except the MITEs appear to be dispersed, with no obvious clustering. This is in contrast to what was observed with maize (SanMiguel et al. 1996 Down). For maize, nested retroelements are the main component of the intergenic regions. This difference may be adequately explained by differences in the density of retroelements. Also, as SanMiguel et al. 1996 Down hypothesized, several successive invasions of different retroelement families may have contributed to the complexity of the genomic organization in maize. In rice, most of the homologs to mobile genetic elements reside between genes. Insertions in putative coding regions can be seen in two cases (REP69 and REP84; Fig 2). In both instances, the inserted element is a retrotransposon and the disrupted gene is a member of the REP gene family. This indicates that duplications leading to the expansion of that gene family are antecedent to the insertion of the retroelements. The insertion of Gypsy B between Kinase A and B might provide further evidence in favor of this view. MITE-to-MITE distances follow a gamma distribution, with skewness of 1/8, indicating a statistically significant clustering of MITEs in the region examined here. The data available were insufficient to determine possible association of MITEs with coding sequences, as has been proposed for maize (Wessler et al. 1995 Down).

Data from the comparative analysis of the orthologous Adh1 regions of maize and sorghum suggest that the dramatic increase in size of the maize Adh1 region dates back only 3 million years and that almost all of the retrotransposon insertions occurred in the last 6 million years (SanMiguel et al. 1998 Down). This is long after the germlines of maize and sorghum diverged.

Retrotransposons with LTRs, such as the rice Gypsy A and Gypsy C, increased the difficulty of assembly of the genomic sequence, a problem that is likely to be encountered in the future. Cataloging these elements may allow automated highlighting of problematic regions.

Gene Repertoire
The presence of a cluster of genes structurally similar to disease resistance genes is an interesting feature of the region. One example is an LRR protein kinase–like gene showing similarity to the rice Xa-21 (Wang et al. 1996 Down) gene product. The gene encodes a putative 1099–amino acid protein with 25 LRRs, which is followed by a potential transmembrane domain and a serine/threonine kinase–like domain. Although the similarity to Xa-21 at the protein level is <50% and no significant nucleotide similarity is detectable, the presence of a single intron in the same position in the two genes suggests that they might be paralogs. Another example is a single-copy gene encoding a protein containing a nucleotide binding site and showing similarity to a lettuce resistance protein candidate (GenBank accession number AF017751; Meyers et al. 1998 Down) and to an isolog of Arabidopsis disease resistance protein RPM1 (GenBank accession number U95973). Although the gene is actively transcribed, it contains a frameshift mutation and therefore might represent a transcribed pseudogene, presumably arising from a very recent mutation.

Three members of a family of putative serine/threonine receptor–like kinases, similar to S-domain receptor–like protein kinases and to Arabidopsis STE-like receptor kinases (Walker 1993 Down), are also present and form a tight cluster spanning ~20 kb of DNA sequence. Nucleotide sequence identity among the three genes is >85%, and one of these elements (STE Kinase A) contains a single open reading frame capable of encoding an 820–amino acid protein. The other two members of the cluster are apparent pseudogenes, as indicated by the presence of frameshifts and stop codons prematurely interrupting the open reading frame. The function of both the Arabidopsis genes and the rice genes reported here remains to be established, but a role in signal transduction that possibly involves a pathogen response may be hypothesized on the basis of the structural features.

Components of signal transduction pathways, MAPK, PIK, and a Myb-like factor are also present, as are sequences similar to enzymes involved in oxidative burst (peroxidase) or in detoxification processes (dimethylaniline monooxygenases) (Hammond-Kosack and Jones 1997 Down; Yang et al. 1997 Down).

The association between the Adh1 region and the locus controlling the resistance to M. grisea race a (Pi-a) has been reported by Goto et al. 1981 Down. According to these studies, the Pi-a locus is located 1 cM from Adh1, placing it directly in the interval sequenced. The double haploid line used for the construction of the BAC library derives from the crossing of rice varieties sensitive to race a of M. grisea. Therefore, if the Pi-a locus is included in the sequenced region, then it will be represented by its sensitive allele.

Interestingly, a homolog of NifS was found. NifS, one of the components of the major Nif gene cluster of Azotobacter vinelandii, is required for the activity of the bacterial nitrogenases (Kennedy and Dean 1992 Down). NifS homologs have been isolated from different organisms, including non-nitrogen-fixing bacteria (S. Chen et al. 1997 Down) and higher eukaryotes, including mammals (Nakai et al. 1998 Down), but to our knowledge no plant homolog has been reported.

Synteny in the Adh2 Homologous Regions of Rice and Maize
The relationship between rice and maize genetic maps in the vicinity of Adh1Adh2 was established by mapping rice genes derived from this region in two different maize populations.

Thirteen different rice probes were used to map three homologous maize loci in bin 3 of maize chromosome 4. This group included maize loci homologous to the rice Adh2, PIK, and MAPK genes. The association of the RZ53-homologous locus of maize with Adh2 is based on a published report (Ahn and Tanksley 1993 Down). The mapping results (Fig 1) indicate microcolinearity between the rice Adh1–Adh2 region on chromosome 11 and the maize chromosome 4 segment around the Adh2 locus. A notable exception to this synteny is seen at the Adh1 locus, which is discussed later. Eight of the rice probes failed to hybridize with maize genomic DNA on DNA gel blots at high and medium stringency, and the extent of gene order conservation for these genes remains to be assessed. We expect that the homologs of these rice genes are present in maize, but the degree of sequence conservation is less. The lack of DNA sequence conservation of many genes in this region represents a factor that will limit studies based on utilizing genome colinearity between these two species.

In general, two classes of DNA sequences may be distinguished. One corresponds to conserved genes encoding basic metabolic functions. This class includes Adh1, which shows as much as 94 to 100% sequence conservation between rice and maize. The other class, which shows less sequence conservation, as measured by a lack of interspecific DNA hybridization, is likely to include genes with a more specialized function and genes that have been subjected to divergent selection. Genes structurally similar to disease resistance genes are an example of this latter class of DNA sequence (Meyers et al. 1998 Down). Leister et al. 1998 Down postulates that reorganization of disease resistance genes during cereal evolution was rapid. Sequencing of the maize chromosome 4 region between the Adh2 and the MAPK genes would allow further insights into the divergence of these disease resistance–like genes between maize and rice.

As noted earlier, a significant exception to the colinearity between the rice chromosome 11 and the maize chromosome 4 occurs at the Adh1 locus. Whereas in rice, Adh1 and Adh2 are only 35 kb apart, in maize and sorghum these two loci are on different chromosomes (Adh1 is on maize chromosome 1 and sorghum linkage group C) (Paterson et al. 1995 Down). Nakajima and co-workers observed a lack of colinearity of chromosome segments surrounding maize and rice Adh1 loci (Tikhonov et al. 1999 Down). A sequence-based analysis of the phylogenetic relationship shows that maize and rice Adh1 as well as maize and rice Adh2 are pairs of orthologs (results not shown). Gaut et al. 1999 Down proposed recently that Adh duplicated into Adh1 and Adh2 before the radiation of the grasses occurred ~65 million years ago. These observations are reconciled by proposing a translocation of the Adh1 locus to a different chromosome in the lineage leading to maize and sorghum. The proposed translocation occurred before the divergence of maize and sorghum, as indicated by the overall colinearity observed in the Adh1 regions of these species (Bennetzen et al. 1998 Down; Tikhonov et al. 1999 Down). The boundaries of the Adh1 translocated region can be circumscribed to the small physical interval (34 kb) between the Adh2 and the RZ53 genes. Interestingly, the maize genome effectively doubled in size after its evolutionary separation from sorghum, but the Adh1 and Adh2 loci are still single copy in maize, as evidenced by hybridization patterns and genetic mapping (data not shown).

These data indicate that gene duplication, followed by divergent selection at different rates, and small translocations involving single genes play a role in the evolution of cereal genomes. On the background of overall colinearity, small rearrangements were also identified between maize and sorghum in the Adh1 region, extending beyond the borders of the proposed Adh1 translocation (Tikhonov et al. 1999 Down). The frequency and molecular nature of such events will become clearer once larger segments of both rice and maize genomic DNA sequence become available. These evolutionary events also complicate the use of rice as a system for synteny-based gene isolation or identification in other grass genomes. Nonetheless, rice, with its relatively simple genome organization, remains attractive for comparative genome studies with grasses.


* METHODS
*TOP
*ABSTRACT
*INTRODUCTION
*RESULTS
*DISCUSSION
*METHODS
*REFERENCES
*ACCESSION NUMBER

Physical Mapping
The rice bacterial artificial chromosome (BAC) library from a double haploid line (YT14) derived from a cross between Oryza sativa ssp japonica cv Yashiro-mochi and Tsuyuake was kindly provided by B. Valent and K.-S. Wu (DuPont Co., Wilmington, DE) (Wu et al. 1996 Down). The BAC library was gridded onto fourteen 8 x 12-cm filters in a 4 x 4 pattern by using an HDR 96-Pin tool for Beckman Biomek 1000 (Beckman Instruments, Fullerton, CA). Filters were produced and processed under the conditions recommended by Olsen et al. 1993 Down.

BAC85C11, BAC92H8, BAC166F9, and BAC196E1 were isolated by hybridization with a 1.3-kb rice alcohol dehydrogenase Adh1 probe generated by amplification of rice genomic DNA with primers OSADH1.C.for (5'-GGAAGCCCATTTACCATTT-3') and OSADH1.C. rev (5'-GCCCAGGATACACAGAAGA-3').

The hybridization probes were labeled with 32P-dCTP by using the RadPrime labeling kit (Life Technologies, Rockville, MD) according to the manufacturer's instructions.

Hybridization was performed in a solution of 1 M NaCl, 50 mM Tris-HCl, pH 7.5, 1% SDS, and 5% dextran sulfate at 65°C overnight with a final wash in 0.1 x SSPE (1 x SSPE is 150 mM NaCl, 10 mM NaH2PO4, and 1 mM EDTA, pH 7.4), and 0.1% SDS at 65°C.

The following primers were used to test the isolated BAC clones for the presence of the Adh2 locus: OSADH2.5.for (5'-GAGAGAAAAGGCATCCATCC-3'), OSADH2.5.rev (5'-AGGGCGGTGTAGAGGATCTT-3'), OSADH2.CD.for (5'-GGTGTGTGTGTGGTTTCTGC-3'), OSADH2.CD.rev (5'-AGTCCACCGTTGGTCATCTC-3'), OSADH2.AG.for (5'-GAGTCTCCGCTGCGTCAT-3'), and OSADH2.AG.rev (5'-TCTCATCCATTTTTTGCTTTCA-3'). The primers, which were designed on the basis of the rice Adh2 locus sequence (GenBank accession number M36469), were used in all possible forward and reverse combinations.

BAC DNA was extracted by using an alkaline lysis procedure followed by cesium chloride gradient purification (Sambrook et al. 1989 Down). The DNA was digested with NotI and HindIII restriction endonucleases, and the DNA fragments that were obtained were compared by using gel electrophoresis to establish the extent of overlap of different BAC clones. BAC end sequences were generated by direct dye terminator sequencing with M13 universal primers.

BAC167C6, BAC178G5, and BAC62F3 were successively isolated by screening pooled BAC clones with polymerase chain reaction (PCR) primers designed from the end sequences of, respectively, BAC85C11, BAC167C6, and BAC178G5.

DNA Sequencing and Assembly
A set of minimally overlapping clones consisting of BAC85C11, BAC178G5, and BAC62F3 were chosen for complete sequencing (Fig 1B). Smaller insert size subclones were sequenced to link BAC85C11 to the Adh2 locus (pAdh2A, pAdh2B, and pAdh2C from BAC166F9) and to BAC178G5 (pBAC85link from BAC167C6).

BAC85C11, BAC178G5, and BAC62F3 were sequenced by using a shotgun approach (Bodenteich et al. 1993 Down). Cesium chloride–purified BAC DNA was sheared by nebulization (Roe et al. 1996 Down). End repair was performed by using Pfu DNA polymerase (Stratagene, La Jolla, CA) treatment according to the manufacturer's directions. DNA fragments were size-fractionated and cloned into the SmaI site of pUC18 (Amersham Pharmacia Biotech, Piscataway, NJ). After transformation into Escherichia coli DH10B electrocompetent cells (Life Technologies, Rockville, MD), recombinant clones were randomly picked. DNA templates for sequencing were isolated by using a 96-well alkaline lysis miniprep kit (Advanced Genetic Technologies Corp, Gaithersburg, MD). Sequencing reactions were performed by using the ABI PRISM Dye Terminator Cycle Sequencing Ready Reaction kit with FS AmpliTaq DNA polymerase (PE Applied Biosystems, Foster City, CA) and analyzed on ABI 377 (PE Applied Biosystems) sequencing gels.

The sequence data were assembled by using PHRED/PHRAP software (Green 1996 Down).

Contigs were extended and joined by two successive rounds of primer walking. Primers were designed with the program PRIMO (Li et al. 1997 Down) in the version for Sun (Sun Microsystems, Inc., Palo Alto, CA). Remaining gaps were filled by sequencing PCR products bridging the ends of the existing contigs. Regions without adequate sequencing depth or with assembly ambiguities were subcloned and resequenced.

Clones pAdh2A, pAdh2B, pAdh2C, and pBAC85link were sequenced by using the Prism Primer Island Transposition kit (PE Applied Biosystems).

Sequence Analysis
Homology searches against public and private (DuPont) databases were used to identify candidate genes in the region. The final sequence of the region was divided into 3-kb overlapping fragments and searched for nucleic or protein homologies by using the BLAST program (Altschul et al. 1997 Down). The GENSCAN program (Burge and Karlin 1997 Down) was used for gene predictions. Programs from the Genetics Computer Group (Madison, WI) were used for sequence comparison and to identify motifs in the sequenced region. The DOTTER program (Sonnhammer and Durbin 1995 Down) was used to identify and classify repeat families and miniature inverted repeat transposable elements (MITEs) (Wessler et al. 1995 Down). Lasergene software (DNAStar, Inc., Madison, WI) was used for sequence similarity analysis.

Comparative Mapping
Eight expressed sequence tag (EST) clones and five genomic subclones containing gene regions of interest were used as restriction fragment length polymorphism probes on maize genomic DNA to determine the conservation and map position of these clones and subclones in maize. The ESTs included rr1.pk0002.b4 (Adh2), rls2.pk0022.f10 (PIK), and rlr24.pk0099.c2 (MAPK). High-stringency hybridization and washing conditions were used (final wash in 0.1 x SSPE and 0.1% SDS at 65°C). Probes that did not hybridize at high stringency were hybridized at lower stringency, but this did not improve the results. Maize populations ALEB9 (86 individuals; pooled F3 progeny from an R67 x P38 cross) and DRAG2 (86 individuals; pooled F3 progeny from an ED0 x MW0 cross) were used for mapping purposes. The resulting segregation data were used to place the genes on the maize genetic map with Mapmaker 3.0b (Lander et al. 1987 Down). The order of markers on maize chromosome 4 was inferred on the basis of their relative positions with respect to umc31.


* FOOTNOTES

1 Current address: Plant Research International, P.O. Box 16 6700 AA, Wageningen, The Netherlands. *


* ACKNOWLEDGMENTS

We thank Barbara Valent and Kunsheng Wu for sharing the rice BAC library, Mike Hanafey and Romeo Hubner for bioinformatics assistance, Michele Morgante for many stimulating discussions, Maureen Dolan for accommodating our sequencing needs and for comments on the manuscript, Bruce Roe for his advice on genomic sequencing, and Barbara Mazur for her support. We also thank the anonymous reviewers for suggesting improvements to the manuscript.

Received October 18, 1999; accepted January 17, 2000.


* REFERENCES
*TOP
*ABSTRACT
*INTRODUCTION
*RESULTS
*DISCUSSION
*METHODS
*REFERENCES
*ACCESSION NUMBER

Ahn, S., and Tanksley, S.D. (1993) Comparative linkage maps of the rice and the maize genomes. Proc. Natl. Acad. Sci. USA 90:7980-7984[Abstract/Free Full Text].

Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. (1997) Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25:3389-3402[Abstract/Free Full Text].

Arumuganathan, K., and Earle, E.D. (1991) Nuclear DNA content of some important plant species. Plant Mol. Biol. Rep. 9:208-218.

Bennetzen, J.L., SanMiguel, P., Chen, M., Tikhonov, A., Francki, M., and Avramova, Z. (1998) Grass genomes. Proc. Natl. Acad. Sci. USA 95:1975-1978[Abstract/Free Full Text].

Bevan, M. et al. (1998) Analysis of 1.9 Mb of contiguous sequence from chromosome 4 of Arabidopsis thaliana.. Nature 391:485-488[CrossRef][Medline].

Bodenteich, A., Chissoe, S., Wang, Y.F., and Roe, B.A. (1993) Shot-gun cloning as the strategy of choice to generate templates for high-throughput dideoxynucleotide sequencing. In Venter J.C., ed. Automated DNA Sequencing and Analysis Techniques. London, Academic Press. 42–50.pp.

Burge, C., and Karlin, S. (1997) Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268:78-94[CrossRef][ISI][Medline].

Chen, M., SanMiguel, P., de Oliveira, A.C., Woo, S.S., Zhang, H., Wing, R.A., and Bennetzen, J.L. (1997) Microcolinearity in sh2-homologous regions of the maize, rice, and sorghum genomes. Proc. Natl. Acad. Sci. USA 94:3431-3435[Abstract/Free Full Text].

Chen, S., Zheng, L., Dean, D.R., and Zalkin, H. (1997) Role of NifS in maturation of glutamine phosphoribosylpyrophosphate amidotransferase. J. Bacteriol. 179:7587-7590[Abstract/Free Full Text].

Devos, K.M., Chao, S., Li, Q.Y., Simonetti, M.C., and Gale, M.D. (1994) Relationship between chromosome 9 of maize and wheat homeologous group 7 chromosomes. Genetics 138:1287-1292[Abstract].

Gale, M.D., and Devos, K.M. (1998) Comparative genetics in the grasses. Proc. Natl. Acad. Sci. USA 95:1971-1974[Abstract/Free Full Text].

Gaut, B.S., Peek, A.S., Morton, B.R., and Clegg, M.T. (1999) Patterns of genetic diversification within the Adh1 gene family in the grasses (Poaceae). Mol. Biol. Evol. 16:1086-1097[Abstract].

Gierl, A., Saedler, A., and Peterson, P.A. (1989) Maize transposable elements. Annu. Rev. Genet. 23:71-85[CrossRef][Medline].

Goto, I., Jaw, Y.L., and Baluch, A.A. (1981) Genetic studies on resistance of rice plant to blast fungus. IV. Linkage analysis of four genes, Pi-a, Pi-k, Pi-z and Pi-i. Ann. Phytopathol. Soc. Jpn. 47:252-254.

Grandbastien, M.A. (1992) Retroelements in higher plants. Trends Genet. 8:103-108[Medline].

Green, P. (1996) Towards completely automated sequence assembly. DOE Human Genome Program Contractor-Grantee Workshop V, 157. Washington, DC, U.S. Department of Energy, Office of Science, Office of Biological and Environmental Research.

Hammond-Kosack, K.E., and Jones, J.D.G. (1997) Plant disease resistance genes. Annu. Rev. Plant Physiol. Plant Mol. Biol. 48:1-39[CrossRef][ISI].

Hartl, D.L., Lohe, A.R., and Lozovskaya, E.R. (1997) Modern thoughts on an ancyent marinere: Function, evolution, regulation. Annu. Rev. Genet. 31:337-358[CrossRef][ISI][Medline].

Harushima, Y. et al. (1998) A high-density rice genetic linkage map with 2275 markers using a single F2 population. Genetics 148:479-494[Abstract/Free Full Text].

Kennedy, C., and Dean, D. (1992) The nifU, nifS and nifV gene products are required for activity of all three nitrogenases of Azotobacter vinelandii.. Mol. Gen. Genet. 231:494-498[CrossRef][ISI][Medline].

Kilian, A., Kudrna, D.A., Kleinhofs, A., Yano, M., Kurata, N., Steffenson, B., and Sasaki, T. (1995) Rice–barley synteny and its application to saturation mapping of the barley Rpg1 region. Nucleic Acids Res. 23:2729-2733[Abstract/Free Full Text].

Kurata, N., Moore, G., Nagamyra, Y., Foote, T., Yano, M., Minobe, Y., and Gale, M. (1994) Conservation of genome structure between rice and wheat. Bio/Technology 12:276-278[CrossRef].

Lagerkrantz, U., Ellegren, H., and Andersson, L. (1993) The abundance of various polymorphic microsatellite motifs differs between plants and animals. Nucleic Acids Res. 21:1111-1115[Abstract/Free Full Text].

Lander, E., Green, P., Abrahamson, J., Barlow, A., Daley, M., Lincoln, S., and Newburg, L. (1987) MAPMAKER: An interactive computer package for constructing primary genetic linkage maps of experimental and natural populations. Genomics 1:174-181[CrossRef][Medline].

Leister, D., Kurth, J., Laurie, D.A., Yano, M., Sasaki, T., Devos, K., Graner, A., and Schulze-Lefert, P. (1998) Rapid reorganization of resistance gene homologues in cereal genomes. Proc. Natl. Acad. Sci. USA 95:370-375[Abstract/Free Full Text].

Li, P., Kupfer, K.C., Davies, C.J., Burbee, D., Evans, G.A., and Garner, H.R. (1997) PRIMO: A primer design program that applies base quality statistics for automated large-scale DNA sequencing. Genomics 40:476-485[CrossRef][ISI][Medline].

Manninen, I., and Schulman, A.H. (1993) BARE-1, a Copia-like retroelement in barley (Hordeum vulgare L.). Plant Mol. Biol. 22:829-846[CrossRef][ISI][Medline].

Matsuoka, Y., and Tsunewaki, K. (1997) Presence of wheat retrotransposons in Gramineae species and the origin of wheat retrotransposon families. Genes Genet. Syst. 72:335-343[CrossRef][ISI][Medline].

Meyers, B.C., Shen, K.A., Rohani, P., Gaut, B.S., and Michelmore, R.W. (1998) Receptor-like genes in the major resistance locus of lettuce are subject to divergent selection. Plant Cell 11:1833-1846.

Moore, G., Devos, K.M., Wang, Z., and Gale, M.D. (1995) Cereal genome evolution. Grasses, line up and form a circle. Curr. Biol. 5:737-739[CrossRef][ISI][Medline].

Nacken, W.K.F., Piotrowiak, R., Saedler, H., and Sommer, H. (1991) The transposable element Tam1 from Antirrhinum majus shows structural homology to the maize transposon En/Spm and has no sequence specificity of insertion. Mol. Gen. Genet. 228:201-208[CrossRef][ISI][Medline].

Nakai, Y., Yoshihara, Y., Hayashi, H., and Kagamiyama, H. (1998) cDNA cloning and characterization of mouse NifS-like protein, m-Nfs1: Mitochondrial localization of eukaryotic NifS-like proteins. FEBS Lett. 433:143-148[CrossRef][ISI][Medline].

Nei, M., Gu, X., and Sitnikova, T. (1997) Evolution by the birth-and-death process in multigene families of the vertebrate immune system. Proc. Natl. Acad. Sci. USA 94:7799-7806[Abstract/Free Full Text].

Olsen, A.S., Combs, J., Garcia, E., Elliott, J., Amemiya, C., de Jong, P., and Threadgill, G. (1993) Automated production of high density cosmid and YAC colony filters using a robotic workstation. BioTechniques 14:116-123[Medline].

Paterson, A.H., Lin, Y.R., Li, Z., Schertz, K.F., Doebley, J.F., Pinson, S.R.M., Liu, S.C., Stansel, J.W., and Irvine, J.E. (1995) Convergent domestication of cereal crops by independent mutations at corresponding genetic loci. Science 269:1714-1718[Abstract/Free Full Text].

Pereira, A., Cuypers, H., Gierl, A., Schwarz-Sommer, Z., and Saedler, H. (1986) Molecular analysis of the En/Spm transposable element system of Zea mays.. EMBO J. 5:835-841[ISI][Medline].

Roe, B.A., Crabtree, J.S., and Khan, A.S. (1996) DNA Isolation and Sequencing. New York, John Wiley and Sons.

Sambrook, J., Fritsch, E.F., and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual. 2nd ed Cold Spring Harbor, NY, Cold Spring Harbor Laboratory Press.

SanMiguel, P., Tikhonov, A., Jin, Y.K., Motchoulskaia, N., Zakharov, D., Melake-Berhan, A., Springer, P.S., Edwards, K.J., Lee, M., Avramova, Z., and Bennetzen, J.L. (1996) Nested retrotransposons in the intergenic regions of the maize genome. Science 274:765-768[Abstract/Free Full Text].

SanMiguel, P., Gaut, B.S., Tikhonov, A., Nakajima, Y., and Bennetzen, J.L. (1998) The paleontology of intergene retrotransposons of maize. Nat. Genet. 20:43-45[CrossRef][ISI][Medline].

Sasaki, T. (1998) The rice genome project in Japan. Proc. Natl. Acad. Sci. USA 95:2027-2028[Abstract/Free Full Text].

Smyth, D.R., Kalitsis, P., Joseph, J.L., and Sentry, J.W. (1989) Plant retrotransposon from Lilium henryi related to Ty3 of yeast and the Gypsy group of Drosophila.. Proc. Natl. Acad. Sci. USA 86:5013-5019.

Sonnhammer, E.L., and Durbin, R. (1995) A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. Gene 167:GC1-GC10[CrossRef][ISI][Medline].

Thomson, K.G., Thomas, J.E., and Dietzgen, R.G. (1998) Retrotransposon-like sequences integrated into the genome of pineapple, Ananas comosus.. Plant Mol. Biol. 38:461-465[CrossRef][ISI][Medline].

Tikhonov, A.P., SanMiguel, P.J., Nakajima, Y., Gorenstein, N.M., Bennetzen, J.L., and Avramova, Z. (1999) Colinearity and its exceptions in orthologous adh regions of maize and sorghum. Proc. Natl. Acad. Sci. USA 96:7409-7414[Abstract/Free Full Text].

Walker, J.C. (1993) Receptor-like protein kinase genes of Arabidopsis thaliana.. Plant J. 3:451-456[CrossRef][ISI][Medline].

Wang, G.-L., Song, W.-Y., Ruan, D.-L., Sideris, S., and Ronald, P.C. (1996) The cloned gene, Xa21, confers resistance to multiple Xanthomonas oryzae pv. oryzae isolates in transgenic plants. Mol. Plant-Microbe Interact. 9:850-855[ISI][Medline].

Weber, J., and May, P.E. (1989) Abundant class of human DNA polymorphisms which can be typed using the polymerase chain reaction. Am. J. Hum. Genet. 44:388-396[ISI][Medline].

Wessler, S.R., Bureau, T.E., and White, S.E. (1995) LTR-retrotransposons and MITEs: Important players in the evolution of plant genomes. Curr. Opin. Genet. Dev. 5:814-821[CrossRef][ISI][Medline].

Wu, K.S., Martinez, C., Lentini, Z., Thome, J., Chumley, F.G., Scolnik, P.A., and Valent, B. (1996). Cloning a blast resistance gene by chromosome walking. In Rice Genetics III. Proceedings of the Third International Rice Genetics Symposium, October 16–20, 1995, G.S. Khush, ed (Manila, The Philippines: International Rice Research Institute), pp. 669–674.

Yang, Y., Shah, J., and Klessig, D.F. (1997) Signal perception and transduction in plant defense responses. Genes Dev. 11:1621-1639[Free Full Text].

Zhang, J., Rosenberg, H.F., and Nei, M. (1988) Positive Darwinian selection after gene duplication in primate ribonuclease genes. Proc. Natl. Acad. Sci. USA 95:3708-3713[Abstract/Free Full Text].


* ACCESSION NUMBER
*TOP
*ABSTRACT
*INTRODUCTION
*RESULTS
*DISCUSSION
*METHODS
*REFERENCES
*ACCESSION NUMBER

The GenBank accession number for the sequence DUPR11 reported here is AF172282.





This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via CrossRef
Right arrow Citing Articles via ISI Web of Science (127)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Tarchini, R.
Right arrow Articles by Rafalski, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Tarchini, R.
Right arrow Articles by Rafalski, A.
Agricola
Right arrow Articles by Tarchini, R.
Right arrow Articles by Rafalski, A.


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
ASPB Publications THE PLANT CELL PLANT PHYSIOLOGY
Copyright © 2000 by the American Society of Plant Biologists