• 沒有找到結果。

蝴蝶蘭葉綠體基因體之研究

N/A
N/A
Protected

Academic year: 2021

Share "蝴蝶蘭葉綠體基因體之研究"

Copied!
11
0
0

加載中.... (立即查看全文)

全文

(1)

行政院國家科學委員會專題研究計畫 成果報告

蝴蝶蘭葉綠體基因體之研究

計畫類別: 個別型計畫

計畫編號: NSC92-2317-B-006-004-

執行期間: 92 年 08 月 01 日至 93 年 07 月 31 日 執行單位: 國立成功大學生物科技研究所

計畫主持人: 張清俊

計畫參與人員: 林咸嘉,林惠茹

報告類型: 精簡報告

處理方式: 本計畫涉及專利或其他智慧財產權,1 年後可公開查詢

中 華 民 國 93 年 11 月 2 日

(2)

行政院國家科學委員會補助專題研究計畫 ■ 成 果 報 告

□期中進度報告

蝴蝶蘭葉綠体基因体之研究

計畫類別:▓ 個別型計畫 □ 整合型計畫

計畫編號: NSC92-2317-B006-004

執行期間:九十 二年 八 月 一日 至 九十三 年 七 月 三十一 日

計畫主持人: 張 清 俊

共同主持人:

計畫參與人員: 張清俊,林咸嘉,林惠茹

成果報告類型(依經費核定清單規定繳交):▆精簡報告 □完整報告

本成果報告包括以下應繳交之附件:

□赴國外出差或研習心得報告一份

□赴大陸地區出差或研習心得報告一份

□出席國際學術會議心得報告及發表之論文各一份

□國際合作研究計畫國外研究報告書一份

處理方式:除產學合作研究計畫、提升產業技術及人才培育研究計畫、列

管計畫及下列情形者外,得立即公開查詢

□涉及專利或其他智慧財產權,□一年□二年後可公開查詢

執行單位: 國立成功大學生物科技研究所

中 華 民 國 九十三 年 十 月 三十 日

(3)

計畫名稱:蝴蝶蘭葉綠體基因體之研究 計畫編號: NSC92-2317-B006-004

執行期限: 九十二年八月一日至九十三年七月三十一日

主持人姓名: 張 清 俊 執行機構:成功大學生物科技研究所 一、中文摘要

台灣阿嬤蝴蝶蘭葉綠體基因體已定序、比對、註解完成。基因体總長度 148,962 bp,由一個大小 25,731 bp 的反向重覆區域,將其區隔成 85,957 bp 和 11,543 的兩個不同片段。有 108 個不同功能的基因散布於葉綠體基因体中,包括 74 個蛋白質基因,4 個 rRNA 基因,30 個 tRNA 基因。此外,也解析到 27 個未知的 ORF。我們觀察到在全部 11 個 NADH dehydrogenase 的蛋白質單元基因中,有 6 個在蝴蝶蘭葉綠體中巳喪失掉了。與其它物種來做比對,顯示蝴蝶蘭與其他高等 植物葉綠體基因組之序列有很高的相似性。

関鍵詞: 台灣阿嬤蝴蝶蘭、葉綠體基因體

Abstract

The complete nucleotide sequence of the plastid genome of Phalaenopsis Orchid (Phalaenopsis amabilis var. Formosa) has been determined. The circular double stranded DNA of 148,962 bp contains a pair of inverted repeats of 25,731 bp, which are separated by small and large single copy regions of 11,543 and 85,957 bp, respectively. The plastid genome contains 74 proteins genes, 4 ribosomal RNA, 30 tRNA genes. Unlike other plastid genome of angiosperm which have complete set of genes for 11 subunits of NADH dehydrogenase, we observed complete lack of ndhA, ndhH and ndhF genes, and lost of functional ndhE, ndhG and ndhI genes in the plastid genome of Phalaenopsis orchid. It suggests that most of ndh genes have been transfer to nucleus in the process of evolution.

Key words: Phalaenopsis amabilis var. Formosa, plastid genome

(4)

二、緣由與目的(Introduction)

Since the first two complete plastid genome sequences were reported for tobacco (Shinozaki et al., 1986)and liverwort (Ohyama et al., 1986) in 1986, currently over 40 complete sequences representing all of the major lineages of the plant kingdom are available in the NCBI genomic database (http://www.ncbi.nlm.nih.gov). The availability of the plastid genomic sequences have been greatly enhancing our opportunity to explore the chloroplast biogenesis as well as study the phyllogenetic relationship among green plants. The plastid genome of land plants are highly conserved in structure and organization (Sugiura, 1995). Furthermore, complete plastid genomic information have provided a sound basis to genetically engineering the chloroplasts for various applications in plant biotechnology (Daniell et al., 2002).

With complete sequence of plastid genome from major land plants lineages such as a liverwort Marchantia (Ohyama et al., 1986),two ferns Adiantum and Psilotum (Wakasugi et al., 2000; Wolf et al., 2003), black pine of gymnosperm (Wakasugi et al., 1994), and angiosperms such as tobacco (Shinozaki et al., 1986), spinach (Schmitz-Linneweber et al, 2001), oenothera (Hupfer et al., 2000), arabidopsis (Sato et al., 1999), lotus (Kato et al., 2000), medicago (Lin et al., 2001), rice (Hiratsuka et al., 1989), maize (Maier et al., 1995), wheat (Ogihara et al., 2002), the phylogenetic relationship of land plants have been deduced, and the monocot-dicot divergence have been dated back to less than 150 million years ago (Chaw, et. al., 2004). Nevertheless, the origin and early evolution of angiosperms is still poorly understood. Nymphaea and Amborella both are vesselless plants which were regarded as the deepest branching among angiosperms (Parkinson et al., 1999).

However, with recent complete plastomic sequence of Nymphaea alba and Amborella trichopoda revealed that they are not basal angiosperm (Goremykin et. al., 2003;

2004); in contrast, grass family of monocot lineage is the closest phylogenetic relationship to gymnosperm (Goremykin et. al., 2004). Since the availability of the complete plastomic sequence of monocot plants is only limited to three closely related grasses, rice, maize and wheat, the divergent point between gynopsperm and angiosperm are still need further investigated. Apparently, with increasing the number of complete plastomic sequences from diverse family of monocot plant, the phylogenetic origin of angiosperm will begin to emerge.

Orchidaceae is one of the largest (about 30,000 species) and most complex families of monocots. In addition to classical morphological systematic classification within orchidaceae, a number of studies employ molecular markers from nucleus, mitochondria or plastid genes to classify family/subfamilies, tribes, subtribes, genus and even species within orchidaceae (Cameron et al., 1999; Cameron, 2004).

Phalaenopsis, a monopodial epiphytic orchid, belongs to tribe vandeae in orchidaceae

(5)

family. Phalaenopsis orchid is a fascinating and important ornamental flower in floral industry. There are 63 species of wild Phalaenopsis orchid in the world distribute between the 23o latitude of both hemispheres, ranging throughout southern Taiwan, the Philippines, New Guinea, northern Australia, Indonesia, Malaysia, Indo-China, and to the hills of the Himalayas (Christenson, 2001). Two native species of Phalaenopsis have been reported in Taiwan, P. equestris and P. amabilis var. formosa.

They are frequently used as elite parent in commercial breeding. Due to the lack of proper protection, these two species are near extinction in the wild. Fortunately, they have been artificially propagated and produced in large quantities. As a first step to genetically engineering Phalaenopsis orchid chloroplast genome, we complete the plastid genome sequence of P. amabilis var. Formasa. In addition, the complete chloroplast genome data of P. amabilis var. Formasa will contribute to a better understanding of photosynthetic crassulacean acid mechanism (CAM) of orchidaceae plants, and could also provide molecular systematic information for the species of orchidaceae family.

三、研究方法(Materials And Methods)

Leaves of Phalaenopsis amabilis var. Formosa were obtained from four leaves stage of seedlings. Intact chloroplast was fractionated with step percoll (40-80%) gradient as described (Palmer, 1986). Chloroplast DNA was further isolated using CTAB-based protocol as described (Stewart et al., 1993). Chloroplast DNA was randomly fragmented into 2 to 3 kb in size with hydroshear, and clone into pBluescriptSK vector. After transformation of XL1 blue competent cell (Stratagene, La Jolla, USA), shot-gun clones were picked up and propagated on microtiter plates.

DNA were isolated and used as templates for sequencing analysis.

The sequencing reaction was performed by using the BigDye terminator cycle sequencing kit (Applied Biosystems, USA) according to the protocol recommended by manufactures. The DNA sequencer used was ABI 3700 of Applied Biosystems.

The sequence data from both end of each shot-gun clone were accumulated, trimmed, aligned and assembled using Phred-Phrap programs (Phil Green, University of Washington, Seattle, USA). Database searches were done with the BLAST algorithm provided by the National Center for Biotechnology Information (NCBI, USA.). For assignment of tRNA genes, the tRNAscan program was applied for prediction.

四、結果和討論 (Results And Disscussion)

The complete circular chloroplast DNA of Phalaenopsis orchid is 148,962 bp long and can be separated into two regions by a pairs of inverted repeat region (IR) of 25,731 bp. The large single copy region and small single copy region are

(6)

85,957 bp and 11,543 bp, respectively. The overall G+C content of the genome is 36.6%, close to that of other plants in angiosperms (38-39%). We also observed that the structure and organization of orchid plastid genome is highly conserved with that of tobacco, in contrast to extensive rearrangement of chloroplast large single copy (LSC) region due to inversion was reported within graminean family (Hiratsuka et. al., 1989).

The genes contained in Phalaenopsis orchid chloroplast DNA are listed in Table 1. We identified 108 genes (not included gene duplicated in the inverted repeat) of known function scattered around plastid genome. Those genes include 4 ribosomal RNA genes; 30 transfer RNA genes; 72 protein-coding genes.

Seventeen genes including 6 tRNA genes and 11 protein-coding genes contain one or two introns. The intron-containing gene in orchid is the same as tobacco.

Three genes such as rpoC1, clpP and rps12 contain two introns, in contrast to lost of introns in rpoC1 and clpP genes in plastid genome of graminean family of monocot plants. Orchid plastid genome contain a complete set of 30 tRNA genes for protein synthesis, the same as that of other higher plants in plastid genome.

Six tRNA genes contain intron. Previous report in the chloroplast genome of marsh orchid, the intron of trnL-UAA contain a minisatellite repeat locus (Cafasso, et. al., 2001); and the minisatellite repeat reveal some degree of variation during evolution among close-related orchid lineage (Cozzolino et. al., 2003). However, we found that the intron of plastid trnL-UAA gene of Phalaenopsis amabilis var.

Formosa did not have minisatellite repeat present. All 11 subunits of NADH dehydrogenase genes are present in the plastid genome of angiosperm, but completely lost in the pinus of gymnosperm (Wakasugi et. al., 1994). Surprsingly, we observed that intact ndhB, ndhC, ndhD, ndhJ and ndhK genes as well as truncated sequence of ndhE, ndhG and ndhI genes present in the chloroplast genome; but lack of ndhA, ndhF and ndhH genes in the plastid genome of orchid.

Apparently, ndhE, ndhG and ndhI are psudogenes, since entire sequence have been deleted 60% for ndhE gene, 45% for ndhG gene and 85% for ndhI gene at the 5’-terminal end as compared with other land plants. It suggest that most of ndh genes in plastid genome of orchid may have transfer to nucleus in the evolutionary process. Independently lost of functional protein-coding genes were observed among vascular plants. For instance, four ribosomal protein genes are either lost or as psudogene from plastid genomes of some plant species. The first, rpl21, is lost in all plant species except in Marcantia, Psilotum and Adiantum. The second, rpl22, is lost in the plastid genome of legume plant such as Lotus and Medicago of Fabaceae family. The third, rpl23, is present in all species except spinach as a psudogene. The forth gene, rps16 is lost in the chloroplast genome of

(7)

Marchantia, Psilotum, Pinus and Medicago. accD gene is either lost or as pseudogenes in rice, wheat and maize of poaceae family. infA is either lost or as pseudogene in the chloroplast genome of Arabidopsis, Lotus, Medicago, Atropa and tobacco. psaM gene is only present in the Marchantia, Psilotum and Pinus chloroplast genome. In addition, chlB, chlL, chLN is only present in the chloroplast genome of Marchantia and Pinus.

The orchid chloroplast genome contains potential open reading frames (ORFs) for which no known function has been assigned. Some of them are conserved among plastomes referred to as hypothetical chloroplast reading frames (ycf), suggesting that they encode functional polypeptides, for instance, ycf1 and ycf2 are essential for cell survival in tobacco (Drescher et al., 2000). The orchid plastid genome possesses ycf1 and ycf2, in contrast, them are lost in that of grass family of monocot plants. In addition, we have identified 27 putative ORF with a threshold of 225 bp (Table 1). Using the same threshold of 225 bp, 18 ORF were identified in plastid genome of Chlamydomonase, but no RNA transcripts were detected with RNA filter hybridization (Mude, 2002).

(8)

Table 1: List of gene contents in orchid plastid genome

RNA genes

Ribosomal RNA genes rrn23b, rrn16b, rrn5b, rrn4.5b

Transfer RNA genes trnA(UGC)ab, trnC(GCA), trnD(GUC), trnE(UUC), trnF(GAA), trnG(GCC), trnG(UCC)a trnH(GUG)b, trnI(CAU), trnI(GAU)ab, trnK(UUU)a, trnL(CAA)b, trnL(UAG), trnL(UAA)a, trnM(CAU)b, trnfM(CAU), trnN(GUU)b, trnP(UGG), trnQ(UUG), trnR(UCU), trnR(ACG)b, trnS(GCU), trnS(UGA), trnS(GGA), trnT(GGU), trnT(UGU), trnV(GAC)b, trnV(UAC)a, trnW(CCA), trnY(GUA)

Polypeptide genes

Ribosomal protein genes Small subunit

Large subunit

rps2, rps3, rps4, rps7b, rps8, rps11, rps12ab, rps14, rps15, rps16a, rps18, rps19b rpl2abd, rpl14, rpl16a, rpl20, rpl22, rpl23b, rpl32, rpl33, rpl36

Transcription/translation apparatus genes RNA polymerase

Translation factor

rpoA, rpoB, rpoC1a, rpoC2 infA

Photosynthesis Photosystem I Photosystem II

Cytochrome b6/f ATP synthase

NADH dehydrogenase Rubiscoc

psaA, psaB, psaC, psaI, psaJ, ycf3a, ycf4

psbA, psbB, psbC, psbD, psbE, psbF, psbG, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ (ycf9)

petA, petBa, petDa, petG, petL, petN atpA, atpB, atpE, atpFa, atpH, atpI

ndhBab, ndhC, ndhDd,ψ-ndhEe, ψ-ndhGe, ψ-ndhIe, ndhJ, ndhK rbcL

Miscellaneous proteins accD, clpPa, matK, cemA, ccsA Conserved proteins ycf1, ycf2b

Putative ORFs ORF87A(3166g) ORF77A(15391fg)b ORF80(31670g) ORF87C(93770g) ORF131(5755 fg)b ORF79(16318fg)b ORF98(52593g) ORF85(99063f) ORF91(6756fg)b ORF81B(17935fg)b ORF87B(53845g) ORF99(100018g) ORF170(10690fg)b ORF90(22728fg)b ORF106(63649g) ORF79(99789f) ORF86A(10947 fg)b ORF119(23949fg)b ORF96(80598f) ORF81C(103872g) ORF81A(13415fg)b ORF86B(29187fg)b ORF103(81773g) ORF77B(142291f) ORF115(13269fg)b ORF114(30091g) ORF88(91657f)

a. Intron containing gene; b. Two copies due to inverted repeat; c. Ribulose-1, 5-bisphosphate carboxylase/oxygenase.

d. Initiation codon creates by RNA editing; e. Pseudo gene(ψ); f. ORF located in forward strand;

g. ORF located in complementary strand. *The position of start codon of putative ORF is given in parenthesis.

(9)

五、參考文獻(References)

Burrows, P. A., L. A. Sazanov, et al. (1998). "Identification of a functional respiratory complex in chloroplasts through analysis of tobacco mutants containing disrupted plastid ndh genes." Embo J 17(4): 868-76.

Cafasso, D., G. Pellegrino, et al. (2001). "Characterization of a minisatellite repeat locus in the chloroplast genome of Orchis palustris (Orchidaceae)." Curr Genet 39(5-6):

394-8.

Cameron, K. M. (2004). "Utility of plastid psaB gene sequences for investigating intrafamilial relationships within Orchidaceae." Mol Phylogenet Evol 31(3):

1157-80.

Chaw, S. M., C. C. Chang, et al. (2004). "Dating the monocot-dicot divergence and the origin of core eudicots using whole chloroplast genomes." J Mol Evol 58(4):

424-41.

Chu, K. H., J. Qi, et al. (2004). "Origin and phylogeny of chloroplasts revealed by a simple correlation analysis of complete genomes." Mol Biol Evol 21(1): 200-6.

Cozzolino, S., D. Cafasso, et al. (2003). "Molecular evolution of a plastid tandem repeat locus in an orchid lineage." J Mol Evol 57 Suppl 1: S41-9.

Daniell, H., M. S. Khan, et al. (2002). "Milestones in chloroplast genetic engineering: an environmentally friendly era in biotechnology." Trends Plant Sci 7(2): 84-91.

Drescher, A., S. Ruf, et al. (2000). "The two largest chloroplast genome-encoded open reading frames of higher plants are essential genes." Plant J 22(2): 97-104.

Goremykin, V. V., K. I. Hirsch-Ernst, et al. (2003). "Analysis of the Amborella trichopoda chloroplast genome sequence suggests that amborella is not a basal angiosperm."

Mol Biol Evol 20(9): 1499-505.

Goremykin, V. V., K. I. Hirsch-Ernst, et al. (2004). "The chloroplast genome of

Nymphaea alba: whole-genome analyses and the problem of identifying the most basal angiosperm." Mol Biol Evol 21(7): 1445-54.

Hiratsuka, J., H. Shimada, et al. (1989). "The complete sequence of the rice (Oryza sativa) chloroplast genome: intermolecular recombination between distinct tRNA genes accounts for a major plastid DNA inversion during the evolution of the cereals."

Mol Gen Genet 217(2-3): 185-94.

Hupfer, H., M. Swiatek, et al. (2000). "Complete nucleotide sequence of the Oenothera elata plastid chromosome, representing plastome I of the five distinguishable euoenothera plastomes." Mol Gen Genet 263(4): 581-5.

Kato, T., T. Kaneko, et al. (2000). "Complete structure of the chloroplast genome of a legume, Lotus japonicus." DNA Res 7(6): 323-30.

Lin, S., H. Wu, et al. "Medicago truncatula Variety Jema Long A-17 Chloroplast, Complete Sequence." Unpublished.

(10)

Maier, R. M., K. Neckermann, et al. (1995). "Complete sequence of the maize chloroplast genome: gene content, hotspots of divergence and fine tuning of genetic

information by transcript editing." J Mol Biol 251(5): 614-28.

Maul, J. E., J. W. Lilly, et al. (2002). "The Chlamydomonas reinhardtii plastid

chromosome: islands of genes in a sea of repeats." Plant Cell 14(11): 2659-79.

Moon, E., T. H. Kao, et al. (1987). "Rice chloroplast DNA molecules are heterogeneous as revealed by DNA sequences of a cluster of genes." Nucleic Acids Res 15(2):

611-30.

Ogihara, Y., K. Isono, et al. (2002). "Structural features of a wheat plastome as revealed by complete sequencing of chloroplast DNA." Mol Genet Genomics 266(5):

740-6.

Ohyama, K. (1996). "Chloroplast and mitochondrial genomes from a liverwort, Marchantia polymorpha--gene organization and molecular evolution." Biosci Biotechnol Biochem 60(1): 16-24.

Ohyama, K., H. Fukuzawa, et al. (1988). "Structure and organization of Marchantia polymorpha chloroplast genome. I. Cloning and gene identification." J Mol Biol 203(2): 281-98.

Palmer, J. D. (1986). Isolation and structural analysis of chloroplast DNA. In: Methods in Enzymology, Academic Press, New York.

Parkinson, C. L., K. L. Adams, et al. (1999). "Multigene analyses identify the three earliest lineages of extant flowering plants." Curr Biol 9(24): 1485-8.

Sato, S., Y. Nakamura, et al. (1999). "Complete structure of the chloroplast genome of Arabidopsis thaliana." DNA Res 6(5): 283-90.

Schmitz-Linneweber, C., R. M. Maier, et al. (2001). "The plastid chromosome of spinach (Spinacia oleracea): complete nucleotide sequence and gene organization." Plant Mol Biol 45(3): 307-15.

Shinozaki, K., M. Ohme, et al. (1986). "The complete nucleotide sequence of tobacco chloroplast genome:its gene organization and expression." EMBO J 5:

2043-2049.

Stewart, C. N., Jr. and L. E. Via (1993). "A rapid CTAB DNA isolation technique useful for RAPD fingerprinting and other PCR applications." Biotechniques 14(5): 748-50.

Sugiura, M. (1995). "The chloroplast genome." Essays Biochem 30: 49-57.

Wakasugi, T., J. Tsudzuki, et al. (1994). "Loss of all ndh genes as determined by sequencing the entire chloroplast genome of the black pine Pinus thunbergii."

Proc Natl Acad Sci U S A 91(21): 9794-8.

Wolf, P. G., C. A. Rowe, et al. (2003). "Complete nucleotide sequence of the chloroplast genome from a leptosporangiate fern, Adiantum capillus-veneris L." DNA Res

(11)

10(2): 59-65.

六、計劃成果自評

已依計劃進度完成蝴蝶蘭葉綠體基因體之定序、基因註解的工 作。並繪製完成整個葉綠體基因體之圖譜,正在撰寫論文,以供發表於學 術期刊。

參考文獻

相關文件

An additional senior teacher post, to be offset by a post in the rank of CM or Assistant Primary School Master/Mistress (APSM) as appropriate, is provided to each primary

vs Functional grammar (i.e. organising grammar items according to the communicative functions) at the discourse level2. “…a bridge between

(b) reviewing the positioning of VPET in the higher education system in Hong Kong, exploring the merits of developing professional vocational qualifications at the degree

An additional senior teacher post, to be offset by a post in the rank of APSM, is provided to each primary special school/special school with primary section that operates six or

220V 50 Hz single phase A.C., variable stroke control, electrical components and cabling conformed to the latest B.S.S., earthing through 3 core supply cable.. and 2,300 r.p.m.,

Wang, Solving pseudomonotone variational inequalities and pseudocon- vex optimization problems using the projection neural network, IEEE Transactions on Neural Networks 17

In this paper, we build a new class of neural networks based on the smoothing method for NCP introduced by Haddou and Maheux [18] using some family F of smoothing functions.

Define instead the imaginary.. potential, magnetic field, lattice…) Dirac-BdG Hamiltonian:. with small, and matrix