基于转录组数据揭示4种兜兰的全基因组复制历史

doi:10.11983/CBB21100

[1]

王芳, 王淇, 赵曦阳 (2019). 低温胁迫下植物的表型及生理响应机制研究进展. 分子植物育种 17, 5144-5153.

[本文引用: 1]

[2]

汪浩, 张锐, 张娇, 沈慧, 戴锡玲, 严岳鸿 (2019). 转录组测序揭示翼盖蕨(Didymochlaena trancatula)的全基因组复制历史. 生物多样性 27, 1221-1227.

DOI:10.17520/biods.2019236 [本文引用: 1]

全基因组复制在动植物中普遍存在, 被认为是促进物种进化的重要动力之一。作为蕨类植物的单种科物种, 翼盖蕨(Didymochlaena trancatula)是真水龙骨类I的基部类群, 在蕨类中具有独特的演化地位。本研究基于高通量测序, 通过同义替换率(Ks)分析、相对定年分析揭示翼盖蕨的全基因组复制发生情况。Ks分析表明, 翼盖蕨至少经历了两次全基因组复制事件, 其中一次发生于59-62 million years ago (Mya), 另一次发生于90-94 Mya, 这两次全基因组复制事件分别和白垩纪第三纪的Cretaceous-Tertiary (C-T)大灭绝事件以及翼盖蕨的物种分化时间相吻合。进一步对两次全基因组复制保留的基因进行功能注释和富集分析, 结果显示与转录及代谢调控相关的基因优势被保留。翼盖蕨的全基因组复制事件可能促进了该物种的分化及其对极端环境的适应性。

[3]

王婷, 夏增强, 舒江平, 张娇, 王美娜, 陈建兵, 王慷林, 向建英, 严岳鸿 (2021). 全基因组复制事件的绝对定年揭示莲座蕨属植物的迟滞演化. 生物多样性 29, 722-734.

[本文引用: 1]

[4]

王筠竹, 陈跃, 秦德辉, 陈丽萍, 孙崇波 (2019). 兰科植物染色体研究现状及前景. 分子植物育种 17, 3717-3725.

[本文引用: 1]

[5]

王振怡, 王希胤 (2020). 染色体数目减少及B染色体产生的进化基因组学模型. 中国科学: 生命科学 50, 524-537.

[本文引用: 1]

[6]

杨有新, 王峰, 蔡加星, 喻景权, 周艳虹 (2014). 光质和光敏色素在植物逆境响应中的作用研究进展. 园艺学报 41, 1861-1872.

[本文引用: 1]

[7]

杨志娟

(2006). 兜兰属(Paphiopedilum)植物细胞学及其亲缘关系的研究. 硕士论文. 杨凌: 西北农林科技大学. pp. 12-36.

[本文引用: 1]

[8]

Adams

KL

, Wendel

JF

(2005). Polyploidy and genome evolution in plants. Curr Opin Plant Biol 8, 135-141.

DOI:10.1016/j.pbi.2005.01.001 URL [本文引用: 1]

[9]

Badouin

H

, Gouzy

J

, Grassa

CJ

, Murat

F

, Staton

SE

, Cottret

L

, Lelandais-Brière

C

, Owens

GL

, Carrère

S

, Mayjonade

B

, Legrand

L

, Gill

N

, Kane

NC

, Bowers

JE

, Hubner

S

, Bellec

A

, Bérard

A

, Bergès

H

, Blanchet

N

, Boniface

MC

, Brunel

D

, Catrice

O

, Chaidir

N

, Claudel

C

, Donnadieu

C

, Faraut

T

, Fievet

G

, Helmstetter

N

, King

M

, Knapp

SJ

, Lai

Z

, Le

Paslier MC

, Lippi

Y

, Lorenzon

L

, Mandel

JR

, Marage

G

, Marchand

G

, Marquand

E

, Bret-Mestries

E

, Morien

E

, Nambeesan

S

, Nguyen

T

, Pegot-Espagnet

P

, Pouilly

N

, Raftis

F

, Sallet

E

, Schiex

T

, Thomas

J

, Vandecasteele

C

, Varès

D

, Vear

F

, Vautrin

S

, Crespi

M

, Mangin

B

, Burke

JM

, Salse

J

, Muños

S

, Vincourt

P

, Rieseberg

LH

, Langlade

NB

(2017). The sunflower genome provides insights into oil metabolism, flowering and Asterid evolution. Nature 546, 148-152.

DOI:10.1038/nature22380 URL [本文引用: 1]

[10]

Blanc

G

, Wolfe

KH

(2004). Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes. Plant Cell 16, 1667-1678.

DOI:10.1105/tpc.021345 URL [本文引用: 1]

[11]

Bolger

AM

, Lohse

M

, Usadel

B

(2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114-2120.

DOI:10.1093/bioinformatics/btu170 URL [本文引用: 1]

[12]

Brunner

I

, Herzog

C

, Dawes

MA

, Arend

M

, Sperisen

C

(2015). How tree roots respond to drought. Front Plant Sci 6, 547.

DOI:10.3389/fpls.2015.00547 PMID:26284083 [本文引用: 1]

The ongoing climate change is characterized by increased temperatures and altered precipitation patterns. In addition, there has been an increase in both the frequency and intensity of extreme climatic events such as drought. Episodes of drought induce a series of interconnected effects, all of which have the potential to alter the carbon balance of forest ecosystems profoundly at different scales of plant organization and ecosystem functioning. During recent years, considerable progress has been made in the understanding of how aboveground parts of trees respond to drought and how these responses affect carbon assimilation. In contrast, processes of belowground parts are relatively underrepresented in research on climate change. In this review, we describe current knowledge about responses of tree roots to drought. Tree roots are capable of responding to drought through a variety of strategies that enable them to avoid and tolerate stress. Responses include root biomass adjustments, anatomical alterations, and physiological acclimations. The molecular mechanisms underlying these responses are characterized to some extent, and involve stress signaling and the induction of numerous genes, leading to the activation of tolerance pathways. In addition, mycorrhizas seem to play important protective roles. The current knowledge compiled in this review supports the view that tree roots are well equipped to withstand drought situations and maintain morphological and physiological functions as long as possible. Further, the reviewed literature demonstrates the important role of tree roots in the functioning of forest ecosystems and highlights the need for more research in this emerging field.

[13]

Cai

J

, Liu

X

, Vanneste

K

, Proost

S

, Tsai

WC

, Liu

KW

, Chen

LJ

, He

Y

, Xu

Q

, Bian

C

, Zheng

ZJ

, Sun

FM

, Liu

WQ

, Hsiao

YY

, Pan

ZJ

, Hsu

CC

, Yang

YP

, Hsu

YC

, Chuang

YC

, Dievart

A

, Dufayard

JF

, Xu

X

, Wang

JY

, Wang

J

, Xiao

XJ

, Zhao

XM

, Du

R

, Zhang

GQ

, Wang

MN

, Su

YY

, Xie

GC

, Liu

GH

, Li

LQ

, Huang

LQ

, Luo

YB

, Chen

HH

, Van

de Peer Y

, Liu

ZJ

(2015). The genome sequence of the orchid Phalaenopsis equestris. Nat Genet 47, 65-72.

DOI:10.1038/ng.3149 URL [本文引用: 3]

[14]

Castresana

J

(2000). Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol Biol Evol 17, 540-552.

PMID:10742046 [本文引用: 1]

The use of some multiple-sequence alignments in phylogenetic analysis, particularly those that are not very well conserved, requires the elimination of poorly aligned positions and divergent regions, since they may not be homologous or may have been saturated by multiple substitutions. A computerized method that eliminates such positions and at the same time tries to minimize the loss of informative sites is presented here. The method is based on the selection of blocks of positions that fulfill a simple set of requirements with respect to the number of contiguous conserved positions, lack of gaps, and high conservation of flanking positions, making the final alignment more suitable for phylogenetic analysis. To illustrate the efficiency of this method, alignments of 10 mitochondrial proteins from several completely sequenced mitochondrial genomes belonging to diverse eukaryotes were used as examples. The percentages of removed positions were higher in the most divergent alignments. After removing divergent segments, the amino acid composition of the different sequences was more uniform, and pairwise distances became much smaller. Phylogenetic trees show that topologies can be different after removing conserved blocks, particularly when there are several poorly resolved nodes. Strong support was found for the grouping of animals and fungi but not for the position of more basal eukaryotes. The use of a computerized method such as the one presented here reduces to a certain extent the necessity of manually editing multiple alignments, makes the automation of phylogenetic analysis of large data sets feasible, and facilitates the reproduction of the final alignment by other researchers.

[15]

Chase

MW

, Cameron

KM

, Freudenstein

JV

, Pridgeon

AM

, Salazar

G

, van

den Berg C

, Schuiteman

A

(2015). An updated classification of Orchidaceae. Bot J Linn Soc 177, 151-174.

DOI:10.1111/boj.12234 URL [本文引用: 1]

[16]

Chen

SC

, Liu

ZJ

, Zhu

GH

, Lang

KY

, Tsi

ZH

, Luo

YB

, Jin

XH

, Cribb

PJ

, Wood

JJ

, Gale

SW

, Ormerod

P

, Vermeulen

JJ

, Wood

HP

, Clayton

D

, Bell

A

(2009). Orchidaceae. In: Wu ZY, Raven PH, Hong DY, eds. Flora of China, Vol. 25. Beijing: Science Press. pp. 381-382.

[本文引用: 1]

[17]

Comai

L

(2005). The advantages and disadvantages of being polyploid. Nat Rev Genet 6, 836-846.

PMID:16304599 [本文引用: 1]

Polyploids - organisms that have multiple sets of chromosomes - are common in certain plant and animal taxa, and can be surprisingly stable. The evidence that has emerged from genome analyses also indicates that many other eukaryotic genomes have a polyploid ancestry, suggesting that both humans and most other eukaryotes have either benefited from or endured polyploidy. Studies of polyploids soon after their formation have revealed genetic and epigenetic interactions between redundant genes. These interactions can be related to the phenotypes and evolutionary fates of polyploids. Here, I consider the advantages and challenges of polyploidy, and its evolutionary potential.

[18]

Cox

AV

, Abdelnour

GJ

, Bennett

MD

, Leitch

IJ

(1998). Genome size and karyotype evolution in the slipper orchids (Cypripedioideae: Orchidaceae). Am J Bot 85, 681-687.

PMID:21684950 [本文引用: 1]

Nuclear DNA contents (4C) were estimated by Feulgen microdensitometry in 27 species of slipper orchids. These data and recent information concerning the molecular systematics of Cypripedioideae allow an interesting re-evaluation of karyotype and genome size variation among slipper orchids in a phylogenetic context. DNA amounts differed 5.7-fold, from 24.4 pg in Phragmipedium longifolium to 138.1 pg in Paphiopedilum wardii. The most derived clades of the conduplicate-leaved slipper orchids have undergone a radical process of genome fragmentation that is most parsimoniously explained by Robertsonian changes involving centric fission. This process seems to have occurred independently of genome size variation. However, it may reflect environmental or selective pressures favoring higher numbers of linkage groups in the karyotype.

[19]

Cox

AV

, Pridgeon

AM

, Albert

VA

, Chase

MW

(1997). Phylogenetics of the slipper orchids (Cypripedioideae, Or-chidaceae): nuclear rDNA ITS sequences. Plant Syst Evol 208, 197-223.

DOI:10.1007/BF00985442 URL [本文引用: 1]

[20]

Cui

LY

, Wall

PK

, Leebens-Mack

JH

, Lindsay

BG

, Soltis

DE

, Doyle

JJ

, Soltis

PS

, Carlson

JE

, Arumuganathan

K

, Barakat

A

, Albert

VA

, Ma

H

, dePamphilis

CW

(2006). Widespread genome duplications throughout the history of flowering plants. Genome Res 16, 738-749.

DOI:10.1101/gr.4825606 URL [本文引用: 1]

[21]

da Conceição

LP

, de Oliveira

ALPC

, Barbosa

LV

(2006). Characterization of the species Epidendrum cinnabarium salzm. (Epidendroideae: Orchidaceae) occurring in dunas do abaeté-salvador, ba-brasil. Cytologia 71, 125-129.

DOI:10.1508/cytologia.71.125 URL [本文引用: 1]

[22]

Darriba

D

, Taboada

GL

, Doallo

R

, Posada

D

(2011). ProtTest 3: fast selection of best-fit models of protein evolution. Bioinformatics 27, 1164-1165.

DOI:10.1093/bioinformatics/btr088 PMID:21335321 [本文引用: 1]

We have implemented a high-performance computing (HPC) version of ProtTest that can be executed in parallel in multicore desktops and clusters. This version, called ProtTest 3, includes new features and extended capabilities.ProtTest 3 source code and binaries are freely available under GNU license for download from http://darwin.uvigo.es/software/prottest3, linked to a Mercurial repository at Bitbucket (https://bitbucket.org/).dposada@uvigo.esSupplementary data are available at Bioinformatics online.

[23]

Das

K

, Roychoudhury

A

(2014). Reactive oxygen species (ROS) and response of antioxidants as ROS-scavengers during environmental stress in plants. Front Environ Sci 2, 53.

[本文引用: 1]

[24]

De

Bodt S

, Maere

S

, Van

de Peer Y

(2005). Genome duplication and the origin of angiosperms. Trends Ecol Evol 20, 591-597.

DOI:10.1016/j.tree.2005.07.008 URL [本文引用: 3]

[25]

Edgar

RC

(2004). MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32, 1792-1797.

DOI:10.1093/nar/gkh340 URL [本文引用: 2]

[26]

Emms

DM

, Kelly

S

(2019). OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol 20, 238.

DOI:10.1186/s13059-019-1832-y URL [本文引用: 1]

[27]

Enright

AJ

, Van

Dongen S

, Ouzounis

CA

(2002). An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res 30, 1575-1584.

PMID:11917018 [本文引用: 1]

Detection of protein families in large databases is one of the principal research objectives in structural and functional genomics. Protein family classification can significantly contribute to the delineation of functional diversity of homologous proteins, the prediction of function based on domain architecture or the presence of sequence motifs as well as comparative genomics, providing valuable evolutionary insights. We present a novel approach called TRIBE-MCL for rapid and accurate clustering of protein sequences into families. The method relies on the Markov cluster (MCL) algorithm for the assignment of proteins into families based on precomputed sequence similarity information. This novel approach does not suffer from the problems that normally hinder other protein sequence clustering algorithms, such as the presence of multi-domain proteins, promiscuous domains and fragmented proteins. The method has been rigorously tested and validated on a number of very large databases, including SwissProt, InterPro, SCOP and the draft human genome. Our results indicate that the method is ideally suited to the rapid and accurate detection of protein families on a large scale. The method has been used to detect and categorise protein families within the draft human genome and the resulting families have been used to annotate a large proportion of human proteins.

[28]

Fang

L

, Xu

X

, Li

J

, Zheng

F

, Li

MZ

, Yan

JW

, Li

Y

, Zhang

XH

, Li

L

, Ma

GH

, Zhang

AY

, Lv

FB

, Wu

KL

, Zeng

SJ

(2020). Transcriptome analysis provides insights into the non-methylated lignin synthesis in Paphiopedilum armeniacum seed. BMC Genomics 21, 524.

DOI:10.1186/s12864-020-06931-1 PMID:32727352 [本文引用: 2]

Paphiopedilum is an important genus of the orchid family Orchidaceae and has high horticultural value. The wild populations are under threat of extinction because of overcollection and habitat destruction. Mature seeds of most Paphiopedilum species are difficult to germinate, which severely restricts their germplasm conservation and commercial production. The factors inhibiting germination are largely unknown.In this study, large amounts of non-methylated lignin accumulated during seed maturation of Paphiopedilum armeniacum (P. armeniacum), which negatively correlates with the germination rate. The transcriptome profiles of P. armeniacum seed at different development stages were compared to explore the molecular clues for non-methylated lignin synthesis. Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis showed that a large number of genes associated with phenylpropanoid biosynthesis and phenylalanine metabolism during seed maturation were differentially expressed. Several key genes in the lignin biosynthetic pathway displayed different expression patterns during the lignification process. PAL, 4CL, HCT, and CSE upregulation was associated with C and H lignin accumulation. The expression of CCoAOMT, F5H, and COMT were maintained at a low level or down-regulated to inhibit the conversion to the typical G and S lignin. Quantitative real-time RT-PCR analysis confirmed the altered expression levels of these genes in seeds and vegetative tissues.This work demonstrated the plasticity of natural lignin polymer assembly in seed and provided a better understanding of the molecular mechanism of seed-specific lignification process.

[29]

Goldman

N

, Yang

Z

(1994). A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol Biol Evol 11, 725-736.

PMID:7968486 [本文引用: 1]

A codon-based model for the evolution of protein-coding DNA sequences is presented for use in phylogenetic estimation. A Markov process is used to describe substitutions between codons. Transition/transversion rate bias and codon usage bias are allowed in the model, and selective restraints at the protein level are accommodated using physicochemical distances between the amino acids coded for by the codons. Analyses of two data sets suggest that the new codon-based model can provide a better fit to data than can nucleotide-based models and can produce more reliable estimates of certain biologically important measures such as the transition/transversion rate ratio and the synonymous/nonsynonymous substitution rate ratio.

[30]

Govaerts

R

, Bernet

P

, Kratochvil

K

, Gerlach

G

, Carr

G

, Alrich

P

, Pridgeon

AM

, Pfahl

J

, Campacci

MA

, Baptista

DH

, Tigges

H

, Shaw

J

, Cribb

P

, George

A

, Kreuz

K

, Wood

J

(2021). World checklist of Orchidaceae. https://wcsp.science.kew.org/. 2021-05-08.

URL [本文引用: 1]

[31]

Gustafsson

ALS

, Verola

CF

, Antonelli

A

(2010). Reassessing the temporal evolution of orchids with new fossils and a Bayesian relaxed clock, with implications for the diversification of the rare South American genus Hoffmann seggella (Orchidaceae: Epidendroideae). BMC Evol Biol 10, 177.

DOI:10.1186/1471-2148-10-177 PMID:20546585 [本文引用: 1]

Background: The temporal origin and diversification of orchids (family Orchidaceae) has been subject to intense debate in the last decade. The description of the first reliable fossil in 2007 enabled a direct calibration of the orchid phylogeny, but little attention has been paid to the potential influence of dating methodology in obtaining reliable age estimates. Moreover, two new orchid fossils described in 2009 have not yet been incorporated in a molecular dating analysis. Here we compare the ages of major orchid clades estimated under two widely used methods, a Bayesian relaxed clock implemented in BEAST and Penalized Likelihood implemented in r8s. We then perform a new family-level analysis by integrating all 3 available fossils and using BEAST. To evaluate how the newly estimated ages may influence the evolutionary interpretation of a species-level phylogeny, we assess divergence times for the South American genus Hoffmannseggella (subfam. Epidendroideae), for which we present an almost complete phylogeny (40 out of 41 species sampled). Results: Our results provide additional support that all extant orchids shared a most recent common ancestor in the Late Cretaceous (similar to 77 million years ago, Ma). However, we estimate the crown age of the five orchid subfamilies to be generally (similar to 1-8 Ma) younger than previously calculated under the Penalized Likelihood algorithm and using a single internal fossil calibration. The crown age of Hoffmannseggella is estimated here at similar to 11 Ma, some 3 Ma more recently than estimated under Penalized Likelihood. Conclusions: Contrary to recent suggestions that orchid diversification began in a period of global warming, our results place the onset of diversification of the largest orchid subfamilies (Orchidoideae and Epidendroideae) in a period of global cooling subsequent to the Early Eocene Climatic Optimum. The diversification of Hoffmannseggella appears even more correlated to late Tertiary climatic fluctuations than previously suggested. With the incorporation of new fossils in the orchid phylogeny and the use of a method that is arguably more adequate given the present data, our results represent the most up-to-date estimate of divergence times in orchids.

[32]

Haas

BJ

, Papanicolaou

A

, Yassour

M

, Grabherr

M

, Blood

PD

, Bowden

J

, Couger

MB

, Eccles

D

, Li

B

, Lieber

M

, MacManes

MD

, Ott

M

, Orvis

J

, Pochet

N

, Strozzi

F

, Weeks

N

, Westerman

R

, William

T

, Dewey

CN

, Henschel

R

, LeDuc

RD

, Friedman

N

, Regev

A

(2013). De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat Protoc 8, 1494-1512.

[本文引用: 1]

[33]

Hasing

T

, Tang

HB

, Brym

M

, Khazi

F

, Huang

TF

, Chambers

AH

(2020). A phased Vanilla planifolia genome enables genetic improvement of flavour and production. Nat Food 1, 811-819.

DOI:10.1038/s43016-020-00197-2 URL [本文引用: 3]

[34]

Huang

CH

, Qi

XP

, Chen

DY

, Qi

J

, Ma

H

(2020). Recurrent genome duplication events likely contributed to both the ancient and recent rise of ferns. J Integr Plant Biol 62, 433-455.

DOI:10.1111/jipb.v62.4 URL [本文引用: 3]

[35]

Huerta-Cepas

J

, Forslund

K

, Coelho

LP

, Szklarczyk

D

, Jensen

LJ

, von

Mering C

, Bork

P

(2017). Fast genome- wide functional annotation through orthology assignment by eggNOG-mapper. Mol Biol Evol 34, 2115-2122.

DOI:10.1093/molbev/msx148 PMID:28460117 [本文引用: 1]

Orthology assignment is ideally suited for functional inference. However, because predicting orthology is computationally intensive at large scale, and most pipelines are relatively inaccessible (e.g., new assignments only available through database updates), less precise homology-based functional transfer is still the default for (meta-)genome annotation. We, therefore, developed eggNOG-mapper, a tool for functional annotation of large sets of sequences based on fast orthology assignments using precomputed clusters and phylogenies from the eggNOG database. To validate our method, we benchmarked Gene Ontology (GO) predictions against two widely used homology-based approaches: BLAST and InterProScan. Orthology filters applied to BLAST results reduced the rate of false positive assignments by 11%, and increased the ratio of experimentally validated terms recovered over all terms assigned per protein by 15%. Compared with InterProScan, eggNOG-mapper achieved similar proteome coverage and precision while predicting, on average, 41 more terms per protein and increasing the rate of experimentally validated terms recovered over total term assignments per protein by 35%. EggNOG-mapper predictions scored within the top-5 methods in the three GO categories using the CAFA2 NK-partial benchmark. Finally, we evaluated eggNOG-mapper for functional annotation of metagenomics data, yielding better performance than interProScan. eggNOG-mapper runs ∼15× faster than BLAST and at least 2.5× faster than InterProScan. The tool is available standalone and as an online service at http://eggnog-mapper.embl.de.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

[36]

Huerta-Cepas

J

, Szklarczyk

D

, Heller

D

, Hernández-Plaza

A

, Forslund

SK

, Cook

H

, Mende

DR

, Letunic

I

, Rattei

T

, Jensen

LJ

, von

Mering C

, Bork

P

(2019). eggNOG 5.0: a hierarchical, functionally and phylogenetically anno- tated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res 47, D309-D314.

[本文引用: 1]

[37]

Iorizzo

M

, Ellison

S

, Senalik

D

, Zeng

P

, Satapoomin

P

, Huang

JY

, Bowman

M

, Iovene

M

, Sanseverino

W

, Cavagnaro

P

, Yildiz

M

, Macko-Podgórni

A

, Moranska

E

, Grzebelus

E

, Grzebelus

D

, Ashrafi

H

, Zheng

ZJ

, Cheng

SF

, Spooner

D

, Van

Deynze A

, Simon

P

(2016). A high-quality carrot genome assembly provides new insights into carotenoid accumulation and asterid genome evolution. Nat Genet 48, 657-666.

DOI:10.1038/ng.3565 URL [本文引用: 1]

[38]

Kim

YK

, Jo

S

, Cheon

SH

, Joo

MJ

, Hong

JR

, Kwak

M

, Kim

KJ

(2020). Plastome evolution and phylogeny of Orchidaceae, with 24 new sequences. Front Plant Sci 11, 22.

DOI:10.3389/fpls.2020.00022 URL [本文引用: 1]

[39]

Klages

JP

, Salzmann

U

, Bickert

T

, Hillenbrand

CD

, Gohl

K

, Kuhn

G

, Bohaty

SM

, Titschack

J

, Müller

J

, Frederichs

T

, Bauersachs

T

, Ehrmann

W

, van

de Flierdt T

, Pereira

PS

, Larter

RD

, Lohmann

G

, Niezgodzki

I

, Uenzelmann-Neben

G

, Zundel

M

, Spiegel

C

, Mark

C

, Chew

D

, Francis

JE

, Nehrke

G

, Schwarz

F

, Smith

JA

, Freudenthal

T

, Esper

O

, Pälike

H

, Ronge

TA

, Dziadek

R

(2020). Temperate rainforests near the South Pole during peak Cretaceous warmth. Nature 580, 81-86.

DOI:10.1038/s41586-020-2148-5 URL [本文引用: 1]

[40]

Leinonen

R

, Sugawara

H

, Shumway

M

(2011). The sequence read archive. Nucleic Acids Res 39, D19-D21.

[本文引用: 1]

[41]

Leitch

IJ

, Kahandawala

I

, Suda

J

, Hanson

L

, Ingrouille

MJ

, Chase

MW

, Fay

MF

(2009). Genome size diversity in orchids: consequences and evolution. Ann Bot 104, 469-481.

DOI:10.1093/aob/mcp003 URL [本文引用: 1]

[42]

Li

D

, Yin

H

, Zhao

C

, Zhu

G

, Lǚ

F

(2014). Transcriptome analysis of tessellated and green leaves in Paphiopedilum orchids using Illumina paired-end sequencing and discovery simple sequence repeat markers. J Plant Biochem Physiol 2, 1000136.

[本文引用: 3]

[43]

Li

MH

, Zhang

GQ

, Lan

SR

, Liu

ZJ

,China Phylogeny Consortium (2016). A molecular phylogeny of Chinese orchids. J Syst Evol 54, 349-362.

DOI:10.1111/jse.12187 URL

[44]

Li

WZ

, Godzik

A

(2006). Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658-1659.

DOI:10.1093/bioinformatics/btl158 URL [本文引用: 1]

[45]

Mandáková

T

, Lysak

MA

(2018). Post-polyploid diploidization and diversification through dysploid changes. Curr Opin Plant Biol 42, 55-65.

DOI:S1369-5266(17)30189-9 PMID:29567623 [本文引用: 4]

Whole-genome duplications are widespread across land plant phylogenies and particularly frequent in ferns and angiosperms. Genome duplications spurred the evolution of key innovations associated with diversification in many angiosperm clades and lineages. Such diversifications are not initiated by genome doubling per se. Rather, differentiation of the primary polyploid populations through a range of processes results in post-polyploid genome diploidization. Structural diploidization gradually reverts the polyploid genome to one functionally diploid-like through chromosomal rearrangements which frequently result in dysploid changes. Dysploidies may lead to reproductive isolation among post-polyploid offspring and significantly contribute to speciation and cladogenetic events.Copyright © 2018 Elsevier Ltd. All rights reserved.

[46]

McInerney

FA

, Wing

SL

(2011). The paleocene-eocene thermal maximum: a perturbation of carbon cycle, climate, and biosphere with implications for the future. Annu Rev Earth Planet Sci 39, 489-516.

DOI:10.1146/earth.2011.39.issue-1 URL [本文引用: 1]

[47]

Ming

R

, VanBuren

R

, Wai

CM

, Tang

HB

, Schatz

MC

, Bowers

JE

, Lyons

E

, Wang

ML

, Chen

J

, Biggers

E

, Zhang

JS

, Huang

LX

, Zhang

LM

, Miao

WJ

, Zhang

J

, Ye

ZY

, Miao

CY

, Lin

ZC

, Wang

H

, Zhou

HY

, Yim

WC

, Priest

HD

, Zheng

CF

, Woodhouse

M

, Edger

PP

, Guyot

R

, Guo

HB

, Guo

H

, Zheng

GY

, Singh

R

, Sharma

A

, Min

XJ

, Zheng

Y

, Lee

H

, Gurtowski

J

, Sedlazeck

FJ

, Harkess

A

, McKain

MR

, Liao

ZY

, Fang

JP

, Liu

J

, Zhang

XD

, Zhang

Q

, Hu

WC

, Qin

Y

, Wang

K

, Chen

LY

, Shirley

N

, Lin

YR

, Liu

LY

, Hernandez

AG

, Wright

CL

, Bulone

V

, Tuskan

GA

, Heath

K

, Zee

F

, Moore

PH

, Sunkar

R

, Leebens-Mack

JH

, Mockler

T

, Bennetzen

JL

, Freeling

M

, Sankoff

D

, Paterson

AH

, Zhu

XG

, Yang

XH

, Smith

JAC

, Cushman

JC

, Paull

RE

, Yu

QY

(2015). The pineapple genome and the evolution of CAM photosynthesis. Nat Genet 47, 1435-1442.

DOI:10.1038/ng.3435 PMID:26523774 [本文引用: 1]

Pineapple (Ananas comosus (L.) Merr.) is the most economically valuable crop possessing crassulacean acid metabolism (CAM), a photosynthetic carbon assimilation pathway with high water-use efficiency, and the second most important tropical fruit. We sequenced the genomes of pineapple varieties F153 and MD2 and a wild pineapple relative, Ananas bracteatus accession CB5. The pineapple genome has one fewer ancient whole-genome duplication event than sequenced grass genomes and a conserved karyotype with seven chromosomes from before the ρ duplication event. The pineapple lineage has transitioned from C3 photosynthesis to CAM, with CAM-related genes exhibiting a diel expression pattern in photosynthetic tissues. CAM pathway genes were enriched with cis-regulatory elements associated with the regulation of circadian clock genes, providing the first cis-regulatory link between CAM and circadian clock regulation. Pineapple CAM photosynthesis evolved by the reconfiguration of pathways in C3 plants, through the regulatory neofunctionalization of preexisting genes and not through the acquisition of neofunctionalized genes via whole-genome or tandem gene duplication.

[48]

Murat

F

, Armero

A

, Pont

C

, Klopp

C

, Salse

J

(2017). Reconstructing the genome of the most recent common ancestor of flowering plants. Nat Genet 49, 490-496.

DOI:10.1038/ng.3813 [本文引用: 1]

We describe here the reconstruction of the genome of the most recent common ancestor (MRCA) of modern monocots and eudicots, accounting for 95% of extant angiosperms, with its potential repertoire of 22,899 ancestral genes conserved in present-day crops. The MRCA provides a starting point for deciphering the reticulated evolutionary plasticity between species (rapidly versus slowly evolving lineages), subgenomes (pre-versus post-duplication blocks), genomic compartments (stable versus labile loci), genes (ancestral versus species-specific genes) and functions (gained versus lost ontologies), the key mutational forces driving the success of polyploidy in crops. The estimation of the timing of angiosperm evolution, based on MRCA genes, suggested that this group emerged 214 million years ago during the late Triassic era, before the oldest recorded fossil. Finally, the MRCA constitutes a unique resource for scientists to dissect major agronomic traits in translational genomics studies extending from model species to crops.

[49]

Oberlander

KC

, Dreyer

LL

, Goldblatt

P

, Suda

J

, Linder

HP

(2016). Species-rich and polyploid-poor: insights into the evolutionary role of whole-genome duplication from the Cape flora biodiversity hotspot. Am J Bot 103, 1336-1347.

DOI:10.3732/ajb.1500474 PMID:27352831 [本文引用: 1]

Whole-genome duplication (WGD) in angiosperms has been hypothesized to be advantageous in unstable environments and/or to increase diversification rates, leading to radiations. Under the first hypothesis, floras in stable environments are predicted to have lower proportions of polyploids than highly, recently disturbed floras, whereas species-rich floras would be expected to have higher than expected proportions of polyploids under the second. The South African Cape flora is used to discriminate between these two hypotheses because it features a hyperdiverse flora predominantly generated by a limited number of radiations (Cape clades), against a backdrop of climatic and geological stability.We compiled all known chromosome counts for species in 21 clades present in the Cape (1653 species, including 24 Cape clades), inferred ploidy levels for these species by inspection or derived from the primary literature, and compared Cape to non-Cape ploidy levels in these clades (17,520 species) using G tests.The Cape flora has anomalously low proportions of polyploids compared with global levels. This pattern is consistently observed across nearly half the clades and across global latitudinal gradients, although individual lineages seem to be following different paths to low levels of WGD and to differing degrees.This pattern shows that the diversity of the Cape flora is the outcome of primarily diploid radiations and supports the hypothesis that WGD may be rare in stable environments.© 2016 Botanical Society of America.

[50]

One Thousand Plant Transcriptomes Initiative (2019). One thousand plant transcriptomes and the phylogenomics of green plants. Nature 574, 679-685.

DOI:10.1038/s41586-019-1693-2 URL [本文引用: 4]

[51]

Paterson

AH

, Bowers

JE

, Chapman

BA

(2004). Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. Proc Natl Acad Sci USA 101, 9903-9908.

DOI:10.1073/pnas.0307901101 URL [本文引用: 1]

[52]

Ren

R

, Wang

HF

, Guo

CC

, Zhang

N

, Zeng

LP

, Chen

YM

, Ma

H

, Qi

J

(2018). Widespread whole genome duplications contribute to genome complexity and species diversity in angiosperms. Mol Plant 11, 414-428.

DOI:S1674-2052(18)30022-4 PMID:29317285 [本文引用: 2]

Gene duplications provide evolutionary potentials for generating novel functions, while polyploidization or whole genome duplication (WGD) doubles the chromosomes initially and results in hundreds to thousands of retained duplicates. WGDs are strongly supported by evidence commonly found in many species-rich lineages of eukaryotes, and thus are considered as a major driving force in species diversification. We performed comparative genomic and phylogenomic analyses of 59 public genomes/transcriptomes and 46 newly sequenced transcriptomes covering major lineages of angiosperms to detect large-scale gene duplication events by surveying tens of thousands of gene family trees. These analyses confirmed most of the previously reported WGDs and provided strong evidence for novel ones in many lineages. The detected WGDs supported a model of exponential gene loss during evolution with an estimated half-life of approximately 21.6 million years, and were correlated with both the emergence of lineages with high degrees of diversification and periods of global climate changes. The new datasets and analyses detected many novel WGDs widely spread during angiosperm evolution, uncovered preferential retention of gene functions in essential cellular metabolisms, and provided clues for the roles of WGD in promoting angiosperm radiation and enhancing their adaptation to environmental changes.Copyright © 2018 The Author. Published by Elsevier Inc. All rights reserved.

[53]

Salse

J

, Bolot

S

, Throude

M

, Jouffe

V

, Piegu

B

, Quraishi

UM

, Calcagno

T

, Cooke

R

, Delseny

M

, Feuillet

C

(2008). Identification and characterization of shared duplications between rice and wheat provide new insight into grass genome evolution. Plant Cell 20, 11-24.

DOI:10.1105/tpc.107.056309 URL [本文引用: 1]

[54]

Schlueter

JA

, Dixon

P

, Granger

C

, Grant

D

, Clark

L

, Doyle

JJ

, Shoemaker

RC

(2004). Mining EST databases to resolve evolutionary events in major crop species. Genome 47, 868-876.

PMID:15499401 [本文引用: 1]

Using plant EST collections, we obtained 1392 potential gene duplicates across 8 plant species: Zea mays, Oryza sativa, Sorghum bicolor, Hordeum vulgare, Solanum tuberosum, Lycopersicon esculentum, Medicago truncatula, and Glycine max. We estimated the synonymous and nonsynonymous distances between each gene pair and identified two to three mixtures of normal distributions corresponding to one to three rounds of genome duplication in each species. Within the Poaceae, we found a conserved duplication event among all four species that occurred approximately 50-60 million years ago (Mya); an event that probably occurred before the major radiation of the grasses. In the Solanaceae, we found evidence for a conserved duplication event approximately 50-52 Mya. A duplication in soybean occurred approximately 44 Mya and a duplication in Medicago about 58 Mya. Comparing synonymous and nonsynonymous distances allowed us to determine that most duplicate gene pairs are under purifying, negative selection. We calculated Pearson's correlation coefficients to provide us with a measure of how gene expression patterns have changed between duplicate pairs, and compared this across evolutionary distances. This analysis showed that some duplicates seemed to retain expression patterns between pairs, whereas others showed uncorrelated expression.

[55]

Scrucca

L

, Fop

M

, Murphy

TB

, Raftery

AE

(2016). Mclust 5: clustering, classification and density estimation using Gaussian finite mixture models. R J 8, 289-317.

PMID:27818791 [本文引用: 1]

Finite mixture models are being used increasingly to model a wide variety of random phenomena for clustering, classification and density estimation. is a powerful and popular package which allows modelling of data as a Gaussian finite mixture with different covariance structures and different numbers of mixture components, for a variety of purposes of analysis. Recently, version 5 of the package has been made available on CRAN. This updated version adds new covariance structures, dimension reduction capabilities for visualisation, model selection criteria, initialisation strategies for the EM algorithm, and bootstrap-based inference, making it a full-featured R package for data analysis via finite mixture modelling.

[56]

Simão

FA

, Waterhouse

RM

, Ioannidis

P

, Kriventseva

EV

, Zdobnov

EM

(2015). BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210-3212.

DOI:10.1093/bioinformatics/btv351 URL [本文引用: 1]

[57]

Sollars

ESA

, Harper

AL

, Kelly

LJ

, Sambles

CM

, Ramirez-Gonzalez

RH

, Swarbreck

D

, Kaithakottil

G

, Cooper

ED

, Uauy

C

, Havlickova

L

, Worswick

G

, Studholme

DJ

, Zohren

J

, Salmon

DL

, Clavijo

BJ

, Li

Y

, He

ZS

, Fellgett

A

, McKinney

LV

, Nielsen

LR

, Douglas

GC

, Kjær

ED

, Downie

JA

, Boshier

D

, Lee

S

, Clark

J

, Grant

M

, Bancroft

I

, Caccamo

M

, Buggs

RJA

(2017). Genome sequence and genetic diversity of European ash trees. Nature 541, 212-216.

DOI:10.1038/nature20786 URL [本文引用: 1]

[58]

Stamatakis

A

(2014). RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312-1313.

DOI:10.1093/bioinformatics/btu033 URL [本文引用: 1]

[59]

Talavera

G

, Castresana

J

(2007). Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Syst Biol 56, 564-577.

PMID:17654362 [本文引用: 1]

Alignment quality may have as much impact on phylogenetic reconstruction as the phylogenetic methods used. Not only the alignment algorithm, but also the method used to deal with the most problematic alignment regions, may have a critical effect on the final tree. Although some authors remove such problematic regions, either manually or using automatic methods, in order to improve phylogenetic performance, others prefer to keep such regions to avoid losing any information. Our aim in the present work was to examine whether phylogenetic reconstruction improves after alignment cleaning or not. Using simulated protein alignments with gaps, we tested the relative performance in diverse phylogenetic analyses of the whole alignments versus the alignments with problematic regions removed with our previously developed Gblocks program. We also tested the performance of more or less stringent conditions in the selection of blocks. Alignments constructed with different alignment methods (ClustalW, Mafft, and Probcons) were used to estimate phylogenetic trees by maximum likelihood, neighbor joining, and parsimony. We show that, in most alignment conditions, and for alignments that are not too short, removal of blocks leads to better trees. That is, despite losing some information, there is an increase in the actual phylogenetic signal. Overall, the best trees are obtained by maximum-likelihood reconstruction of alignments cleaned by Gblocks. In general, a relaxed selection of blocks is better for short alignment, whereas a stringent selection is more adequate for longer ones. Finally, we show that cleaned alignments produce better topologies although, paradoxically, with lower bootstrap. This indicates that divergent and problematic alignment regions may lead, when present, to apparently better supported although, in fact, more biased topologies.

[60]

Tsai

CC

, Liao

PC

, Ko

YZ

, Chen

CH

, Chiang

YC

(2020). Phylogeny and historical biogeography of Paphiopedilum pfitzer (Orchidaceae) based on nuclear and plastid DNA. Front Plant Sci 11, 126.

DOI:10.3389/fpls.2020.00126 PMID:32174935 [本文引用: 1]

The phylogeny and biogeography of the genus were evaluated by using phylogenetic trees derived from analysis of nuclear ribosomal internal transcribed spacer (ITS) sequences, the plastid L intron, the L-F spacer, and the B-L spacer. This genus was divided into three subgenera:,, and. Each of them is monophyletic with high bootstrap supports according to the highly resolved phylogenetic tree reconstructed by combined sequences. There are five sections within the subgenus, including,,,, and. The subgenus is phylogenetic basal, which suggesting that is comprising more ancestral characters than other subgenera. The evolutionary trend of genus was deduced based on the maximum likelihood (ML) tree and Bayesian Evolutionary Analysis Sampling Trees (BEAST). Reconstruct Ancestral State in Phylogenies (RASP) analyses based on the combined sequence data. The biogeographic analysis indicates that species were firstly derived in Southern China and Southeast Asia, subsequently dispersed into the Southeast Asian archipelagoes. The subgenera was likely derived after these historical dispersals and vicariance events. Our research reveals the relevance of the differentiation of in Southeast Asia and geological history. Moreover, the biogeographic analysis explains that the significant evolutionary hotspots of these orchids in the Sundaland and Wallacea might be attributed to repeated migration and isolation events between the south-eastern Asia mainland and the Sunda Super Islands.Copyright © 2020 Tsai, Liao, Ko, Chen and Chiang.

[61]

Unruh

SA

, McKain

MR

, Lee

YI

, Yukawa

T

, McCormick

MK

, Shefferson

RP

, Smithson

A

, Leebens-Mack

JH

, Pires

JC

(2018). Phylotranscriptomic analysis and genome evolution of the Cypripedioideae (Orchidaceae). Am J Bot 105, 631-640.

DOI:10.1002/ajb2.2018.105.issue-4 URL [本文引用: 1]

[62]

Upchurch

RG

(2008). Fatty acid unsaturation, mobilization, and regulation in the response of plants to stress. Biotechnol Lett 30, 967-977.

DOI:10.1007/s10529-008-9639-z PMID:18227974 [本文引用: 1]

Stress acclimating plants respond to abiotic and biotic stress by remodeling membrane fluidity and by releasing alpha-linolenic (18:3) from membrane lipids. The modification of membrane fluidity is mediated by changes in unsaturated fatty acid levels, a function provided in part by the regulated activity of fatty acid desaturases. Adjustment of membrane fluidity maintains an environment suitable for the function of critical integral proteins during stress. alpha-Linolenic acid, released from membrane lipid by regulated lipase activity, is the precursor molecule for phyto-oxylipin biosynthesis. The modulation of chloroplast oleic acid (18:1) levels is central to the normal expression of defense responses to pathogens in Arabidopsis. Oleic (18:1) and linolenic (18:2) acid levels, in part, regulate development, seed colonization, and mycotoxin production by Aspergillus spp.

[63]

Van

de Peer Y

, Ashman

TL

, Soltis

PS

, Soltis

DE

(2021). Polyploidy: an evolutionary and ecological force in stressful times. Plant Cell 33, 11-26.

DOI:10.1093/plcell/koaa015 URL [本文引用: 1]

[64]

Van

de Peer Y

, Mizrachi

E

, Marchal

K

(2017). The evolutionary significance of polyploidy. Nat Rev Genet 18, 411-424.

[本文引用: 3]

[65]

Vellekoop

J

, Esmeray-Senlet

S

, Miller

KG

, Browning

JV

, Sluijs

A

, van

de Schootbrugge B

, Damsté

JSS

, Brinkhuis

H

(2016). Evidence for Cretaceous-Paleogene boundary bolide ‘impact winter' conditions from New Jersey, USA. Geology 44, 619-622.

DOI:10.1130/G37961.1 URL [本文引用: 2]

[66]

Vishwakarma

K

, Upadhyay

N

, Kumar

N

, Yadav

G

, Singh

J

, Mishra

RK

, Kumar

V

, Verma

R

, Upadhyay

RG

, Pandey

M

, Sharma

S

(2017). Abscisic acid signaling and abiotic stress tolerance in plants: a review on current knowledge and future prospects. Front Plant Sci 8, 161.

DOI:10.3389/fpls.2017.00161 PMID:28265276 [本文引用: 1]

Abiotic stress is one of the severe stresses of environment that lowers the growth and yield of any crop even on irrigated land throughout the world. A major phytohormone abscisic acid (ABA) plays an essential part in acting toward varied range of stresses like heavy metal stress, drought, thermal or heat stress, high level of salinity, low temperature, and radiation stress. Its role is also elaborated in various developmental processes including seed germination, seed dormancy, and closure of stomata. ABA acts by modifying the expression level of gene and subsequent analysis of cis- and trans-acting regulatory elements of responsive promoters. It also interacts with the signaling molecules of processes involved in stress response and development of seeds. On the whole, the stress to a plant can be susceptible or tolerant by taking into account the coordinated activities of various stress-responsive genes. Numbers of transcription factor are involved in regulating the expression of ABA responsive genes by acting together with their respective cis- acting elements. Hence, for improvement in stress-tolerance capacity of plants, it is necessary to understand the mechanism behind it. On this ground, this article enlightens the importance and role of ABA signaling with regard to various stresses as well as regulation of ABA biosynthetic pathway along with the transcription factors for stress tolerance.

[67]

Wang

YS

, Nie

F

, Shahid

MQ

, Baloch

FS

(2020). Molecular footprints of selection effects and whole genome duplication (WGD) events in three blueberry species: detected by transcriptome dataset. BMC Plant Biol 20, 250.

DOI:10.1186/s12870-020-02461-w URL [本文引用: 1]

[68]

Wei

CL

, Yang

H

, Wang

SB

, Zhao

J

, Liu

C

, Gao

LP

, Xia

EH

, Lu

Y

, Tai

YL

, She

GB

, Sun

J

, Cao

HS

, Tong

W

, Gao

Q

, Li

YY

, Deng

WW

, Jiang

XL

, Wang

WZ

, Chen

Q

, Zhang

SH

, Li

HJ

, Wu

JL

, Wang

P

, Li

PH

, Shi

CY

, Zheng

FY

, Jian

JB

, Huang

B

, Shan

D

, Shi

MM

, Fang

CB

, Yue

Y

, Li

FD

, Li

DX

, Wei

S

, Han

B

, Jiang

CJ

, Yin

Y

, Xia

T

, Zhang

ZZ

, Bennetzen

JL

, Zhao

SC

, Wan

XC

(2018). Draft genome sequence of Camellia sinensis var. sinensis provides insights into the evolution of the tea genome and tea quality. Proc Natl Acad Sci USA 115, E4151-E4158.

[本文引用: 1]

[69]

Wendel

JF

(2000). Genome evolution in polyploids. Plant Mol Biol 42, 225-249.

PMID:10688139 [本文引用: 1]

Polyploidy is a prominent process in plants and has been significant in the evolutionary history of vertebrates and other eukaryotes. In plants, interdisciplinary approaches combining phylogenetic and molecular genetic perspectives have enhanced our awareness of the myriad genetic interactions made possible by polyploidy. Here, processes and mechanisms of gene and genome evolution in polyploids are reviewed. Genes duplicated by polyploidy may retain their original or similar function, undergo diversification in protein function or regulation, or one copy may become silenced through mutational or epigenetic means. Duplicated genes also may interact through inter-locus recombination, gene conversion, or concerted evolution. Recent experiments have illuminated important processes in polyploids that operate above the organizational level of duplicated genes. These include inter-genomic chromosomal exchanges, saltational, non-Mendelian genomic evolution in nascent polyploids, inter-genomic invasion, and cytonuclear stabilization. Notwithstanding many recent insights, much remains to be learned about many aspects of polyploid evolution, including: the role of transposable elements in structural and regulatory gene evolution; processes and significance of epigenetic silencing; underlying controls of chromosome pairing; mechanisms and functional significance of rapid genome changes; cytonuclear accommodation; and coordination of regulatory factors contributed by two, sometimes divergent progenitor genomes. Continued application of molecular genetic approaches to questions of polyploid genome evolution holds promise for producing lasting insight into processes by which novel genotypes are generated and ultimately into how polyploidy facilitates evolution and adaptation.

[70]

Wu

SD

, Han

BC

, Jiao

YN

(2020). Genetic contribution of paleopolyploidy to adaptive evolution in angiosperms. Mol Plant 13, 59-71.

DOI:10.1016/j.molp.2019.10.012 URL [本文引用: 4]

[71]

Yang

ZH

(2007). PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol 24, 1586-1591.

DOI:10.1093/molbev/msm088 URL [本文引用: 1]

[72]

Yu

GC

, Wang

LG

, Han

YY

, He

QY

(2012). ClusterProfiler: an R package for comparing biological themes among gene clusters. OMICS: J Integr Biol 16, 284-287.

DOI:10.1089/omi.2011.0118 URL [本文引用: 1]

[73]

Yuan

Y

, Jin

XH

, Liu

J

, Zhao

X

, Zhou

JH

, Wang

X

, Wang

DY

, Lai

CS

, Xu

W

, Huang

JW

, Zha

LP

, Liu

DH

, Ma

X

, Wang

L

, Zhou

MY

, Jiang

Z

, Meng

HB

, Peng

HS

, Liang

YT

, Li

RQ

, Jiang

C

, Zhao

YY

, Nan

TG

, Jin

Y

, Zhan

ZL

, Yang

J

, Jiang

WK

, Huang

LQ

(2018). The Gastrodia elata genome provides insights into plant adaptation to heterotrophy. Nat Commun 9, 1615.

DOI:10.1038/s41467-018-03423-5 PMID:29691383 [本文引用: 3]

We present the 1.06 Gb sequenced genome of Gastrodia elata, an obligate mycoheterotrophic plant, which contains 18,969 protein-coding genes. Many genes conserved in other plant species have been deleted from the G. elata genome, including most of those for photosynthesis. Additional evidence of the influence of genome plasticity in the adaptation of this mycoheterotrophic lifestyle is evident in the large number of gene families that are expanded in G. elata, including glycoside hydrolases and urease that likely facilitate the digestion of hyphae are expanded, as are genes associated with strigolactone signaling, and ATPases that may contribute to the atypical energy metabolism. We also find that the plastid genome of G. elata is markedly smaller than that of green plant species while its mitochondrial genome is one of the largest observed to date. Our report establishes a foundation for studying adaptation to a mycoheterotrophic lifestyle.

[74]

Zachos

J

, Pagani

H

, Sloan

L

, Thomas

E

, Billups

K

(2001). Trends, rhythms, and aberrations in global climate 65 Ma to present. Science 292, 686-693.

PMID:11326091 [本文引用: 1]

Since 65 million years ago (Ma), Earth's climate has undergone a significant and complex evolution, the finer details of which are now coming to light through investigations of deep-sea sediment cores. This evolution includes gradual trends of warming and cooling driven by tectonic processes on time scales of 10(5) to 10(7) years, rhythmic or periodic cycles driven by orbital processes with 10(4)- to 10(6)-year cyclicity, and rare rapid aberrant shifts and extreme climate transients with durations of 10(3) to 10(5) years. Here, recent progress in defining the evolution of global climate over the Cenozoic Era is reviewed. We focus primarily on the periodic and anomalous components of variability over the early portion of this era, as constrained by the latest generation of deep-sea isotope records. We also consider how this improved perspective has led to the recognition of previously unforeseen mechanisms for altering climate.

[75]

Zhang

CF

, Huang

CH

, Liu

M

, Hu

Y

, Panero

JL

, Luebert

F

, Gao

TG

, Ma

H

(2021a). Phylotranscriptomic insights into Asteraceae diversity, polyploidy, and morphological innovation. J Integr Plant Biol 63, 1273-1293.

DOI:10.1111/jipb.v63.7 URL [本文引用: 2]

[76]

Zhang

CF

, Zhang

TK

, Luebert

F

, Xiang

YZ

, Huang

CH

, Hu

Y

, Rees

M

, Frohlich

MW

, Qi

J

, Weigend

M

, Ma

H

(2020). Asterid phylogenomics/phylotranscriptomics uncover morphological evolutionary histories and support phylogenetic placement for numerous whole-genome duplications. Mol Biol Evol 37, 3188-3210.

DOI:10.1093/molbev/msaa160 URL [本文引用: 3]

[77]

Zhang

CL

, Chen

JH

, Huang

WX

, Song

XQ

, Niu

J

(2021b). Transcriptomics and metabolomics reveal purine and phenylpropanoid metabolism response to drought stress in Dendrobium sinense, an endemic orchid species in Hainan island. Front Genet 12, 692702.

DOI:10.3389/fgene.2021.692702 URL [本文引用: 1]

[78]

Zhang

GQ

, Liu

KW

, Li

Z

, Lohaus

R

, Hsiao

YY

, Niu

SC

, Wang

JY

, Lin

YC

, Xu

Q

, Chen

LJ

, Yoshida

K

, Fujiwara

S

, Wang

ZW

, Zhang

YQ

, Mitsuda

N

, Wang

MN

, Liu

GH

, Pecoraro

L

, Huang

HX

, Xiao

XJ

, Lin

M

, Wu

XY

, Wu

WL

, Chen

YY

, Chang

SB

, Sakamoto

S

, Ohme-Takagi

M

, Yagi

M

, Zeng

SJ

, Shen

CY

, Yeh

CM

, Luo

YB

, Tsai

WC

, Van

de Peer Y

, Liu

ZJ

(2017). The Apostasia genome and the evolution of orchids. Nature 549, 379-383.

DOI:10.1038/nature23897 URL [本文引用: 7]

[79]

Zhang

GQ

, Xu

Q

, Bian

C

, Tsai

WC

, Yeh

CM

, Liu

KW

, Yoshida

K

, Zhang

LS

, Chang

SB

, Chen

F

, Shi

Y

, Su

YY

, Zhang

YQ

, Chen

LJ

, Yin

YY

, Lin

M

, Huang

HX

, Deng

H

, Wang

ZW

, Zhu

SL

, Zhao

X

, Deng

C

, Niu

SC

, Huang

J

, Wang

MN

, Liu

GH

, Yang

HJ

, Xiao

XJ

, Hsiao

YY

, Wu

WL

, Chen

YY

, Mitsuda

N

, Ohme-Takagi

M

, Luo

YB

, Van

de Peer Y

, Liu

ZJ

(2016). The Dendrobium catenatum Lindl. genome sequence provides insights into polysaccharide synthase, floral development and adaptive evolution. Sci Rep 6, 19029.

DOI:10.1038/srep19029 URL [本文引用: 3]

[80]

Zhao

YY

, Zhang

R

, Jiang

KW

, Qi

J

, Hu

Y

, Guo

J

, Zhu

RB

, Zhang

TK

, Egan

AN

, Yi

TS

, Huang

CH

, Ma

H

(2021). Nuclear phylotranscriptomics and phylogenomics support numerous polyploidization events and hypotheses for the evolution of rhizobial nitrogen-fixing symbiosis in Fabaceae. Mol Plant 14, 748-773.

DOI:10.1016/j.molp.2021.02.006 URL [本文引用: 4]

[81]

Zheng

Y

, Jiao

C

, Sun

HH

, Rosli

HG

, Pombo

MA

, Zhang

PF

, Banf

M

, Dai

XB

, Martin

GB

, Giovannoni

JJ

, Zhao

PX

, Rhee

SY

, Fei

ZJ

(2016). iTAK: a program for genome-wide prediction and classification of plant transcription factors, transcriptional regulators, and protein kinases. Mol Plant 9, 1667-1670.

[本文引用: 1]

[82]

Zwaenepoel

A

, Van de Peer

Y

(2019). Wgd-simple command line tools for the analysis of ancient whole-genome duplications. Bioinformatics 35, 2153-2155.

DOI:10.1093/bioinformatics/bty915 PMID:30398564 [本文引用: 1]

Ancient whole-genome duplications (WGDs) have been uncovered in almost all major lineages of life on Earth and the search for traces or remnants of such events has become standard practice in most genome analyses. This is especially true for plants, where ancient WGDs are abundant. Common approaches to find evidence for ancient WGDs include the construction of KS distributions and the analysis of intragenomic colinearity. Despite the increased interest in WGDs and the acknowledgment of their evolutionary importance, user-friendly and comprehensive tools for their analysis are lacking. Here, we present an easy to use command-line tool for KS distribution construction named wgd. The wgd suite provides commonly used KS and colinearity analysis workflows together with tools for modeling and visualization, rendering these analyses accessible to genomics researchers in a convenient manner.wgd is free and open source software implemented in Python and is available at https://github.com/arzwa/wgd.Supplementary data are available at Bioinformatics online.© The Author(s) 2018. Published by Oxford University Press.

低温胁迫下植物的表型及生理响应机制研究进展

1

2019

... 全基因组复制事件保留了部分复制基因, 对保留的复制基因进行功能分析可为阐明全基因组复制事件对植物适应性演化的促进作用提供遗传证据.本研究分别对4种兜兰3次全基因组复制后的保留复制基因进行了GO功能富集分析, 发现3次全基因组复制事件富集到的功能存在差异(图4, 图5).WGD1富集了脂类代谢、软木脂的生物合成、苯丙烷类的合成与代谢, 以及氧化还原酶活性和活性氧代谢过程的调控等功能(图5), 这可能与兜兰属植物应对超级温室期的干旱环境以及抵御干旱引起的活性氧失衡有关(Upchurch, 2008; Das and Roychoudhury, 2014; Brunner et al., 2015; Zhang et al., 2021b).在K-Pg灭绝时期, 大气中充满了灰尘、硫酸盐气溶胶及碳黑颗粒, 黑暗和低温成为主要的胁迫因子(Vellekoop et al., 2016).推测WGD2富集的脱落酸激活的信号通路以及昼夜节律等功能提高了兜兰属植物祖先对当时剧变环境的适应性(图4, 图5) (杨有新等, 2014; Vishwakarma et al., 2017).WGD3之后, 兜兰属植物祖先经历了全球温度骤降, 推测富集的磷脂代谢、酶联受体蛋白信号通路、色素沉着, 以及保卫细胞分化与发育、根表皮细胞分化与毛状体分化等功能, 可能与应对低温引起的植物萎蔫、叶绿素含量减少以及细胞膜发生相变有关(王芳等, 2019).综上, 推测保留的复制基因在功能上与当时特定的胁迫因子相关. ...

转录组测序揭示翼盖蕨(Didymochlaena trancatula)的全基因组复制历史

1

2019

... 多倍化(polyploid)或全基因组复制(whole-genome duplication, WGD)是物种多样性发生的重要驱动力(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018), 在植物演化历史中普遍存在, 尤其是维管束植物中多样性最高的类群被子植物和第二大类群蕨类反复发生过多轮全基因组复制(One Thousand Plant Transcriptomes Initiative, 2019; 汪浩等, 2019; Huang et al., 2020; 王婷等, 2021).基于现有证据, 在蕨类植物、被子植物第一大科菊科(Asteraceae)、第三大科豆科(Fabaceae)中分别检测到19、41、28次全基因组复制事件(Huang et al., 2020; Zhang et al., 2021a; Zhao et al., 2021), 推测多倍化与蕨类植物和被子植物物种多样性较高类群的物种形成和多样化有关(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

全基因组复制事件的绝对定年揭示莲座蕨属植物的迟滞演化

1

2021

... 多倍化(polyploid)或全基因组复制(whole-genome duplication, WGD)是物种多样性发生的重要驱动力(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018), 在植物演化历史中普遍存在, 尤其是维管束植物中多样性最高的类群被子植物和第二大类群蕨类反复发生过多轮全基因组复制(One Thousand Plant Transcriptomes Initiative, 2019; 汪浩等, 2019; Huang et al., 2020; 王婷等, 2021).基于现有证据, 在蕨类植物、被子植物第一大科菊科(Asteraceae)、第三大科豆科(Fabaceae)中分别检测到19、41、28次全基因组复制事件(Huang et al., 2020; Zhang et al., 2021a; Zhao et al., 2021), 推测多倍化与蕨类植物和被子植物物种多样性较高类群的物种形成和多样化有关(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

兰科植物染色体研究现状及前景

1

2019

... 兰科(Orchidaceae)含700余属、约26 000种, 为被子植物第二大科, 单子叶植物第一大科, 是陆生植物中极具多样性的类群之一(Chase et al., 2015; Li et al., 2016), 同时表现出染色体数目变化较大(染色体基数从x=6到x=120)的特点(Da Conceição et al., 2006; 王筠竹等, 2019), 表明兰科植物的演化过程可能存在多次全基因组复制事件.然而, 目前在兰科植物中已见报道的全基因组复制事件非常有限.基于兰科植物基因组证据(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020)以及千种植物转录组项目等转录组分析(One Thousand Plant Transcriptomes Initiative, 2019), 目前仅检测到1次兰科植物特异发生的全基因组复制事件, 与蕨类(Huang et al., 2020)、菊科(Zhang et al., 2021a)和豆科(Zhao et al., 2021)等物种多样性丰富的类群多倍化研究结果不符. ...

染色体数目减少及B染色体产生的进化基因组学模型

1

2020

... 全基因组复制使得染色体和基因组内全部基因均发生加倍, 为新性状演化和物种多样化提供了遗传材料(De Bodt et al., 2005; Wu et al., 2020).而全基因组复制后的基因丢失、沉默、亚功能化和新功能化等基因水平的变异, 以及染色体重组等染色体水平变异促进了表型和物种的多样化(Wendel, 2000; Adams and Wendel, 2005; Mandáková and Lysak, 2018).此外, 全基因组复制及后续变异导致一些类群染色体数目变异(Mandáková and Lysak, 2018).以单子叶植物禾本科为例, 禾本科祖先的染色体基数为7条, 而在经历了全基因组复制事件后(Paterson et al., 2004; Salse et al., 2008), 水稻(Oryza sativa)、高粱(Sorghum bicolor)和谷子(Setaria italica)的染色体基数并未达到加倍后的14条, 而是表现为染色体数目不同程度地减少, 分别为12、10和9条(Murat et al., 2017; 王振怡和王希胤, 2020).因此, 染色体数目的变化是全基因组复制发生及后续演化进程的重要特征之一. ...

光质和光敏色素在植物逆境响应中的作用研究进展

1

2014

... 全基因组复制事件保留了部分复制基因, 对保留的复制基因进行功能分析可为阐明全基因组复制事件对植物适应性演化的促进作用提供遗传证据.本研究分别对4种兜兰3次全基因组复制后的保留复制基因进行了GO功能富集分析, 发现3次全基因组复制事件富集到的功能存在差异(图4, 图5).WGD1富集了脂类代谢、软木脂的生物合成、苯丙烷类的合成与代谢, 以及氧化还原酶活性和活性氧代谢过程的调控等功能(图5), 这可能与兜兰属植物应对超级温室期的干旱环境以及抵御干旱引起的活性氧失衡有关(Upchurch, 2008; Das and Roychoudhury, 2014; Brunner et al., 2015; Zhang et al., 2021b).在K-Pg灭绝时期, 大气中充满了灰尘、硫酸盐气溶胶及碳黑颗粒, 黑暗和低温成为主要的胁迫因子(Vellekoop et al., 2016).推测WGD2富集的脱落酸激活的信号通路以及昼夜节律等功能提高了兜兰属植物祖先对当时剧变环境的适应性(图4, 图5) (杨有新等, 2014; Vishwakarma et al., 2017).WGD3之后, 兜兰属植物祖先经历了全球温度骤降, 推测富集的磷脂代谢、酶联受体蛋白信号通路、色素沉着, 以及保卫细胞分化与发育、根表皮细胞分化与毛状体分化等功能, 可能与应对低温引起的植物萎蔫、叶绿素含量减少以及细胞膜发生相变有关(王芳等, 2019).综上, 推测保留的复制基因在功能上与当时特定的胁迫因子相关. ...

1

2006

... 从NCBI网站SRA数据库检索下载杏黄兜兰(Paphiopedilum armeniacum S.C.Chen & F.Y.Liu) (2n=26)、同色兜兰(P. concolor (Lindl. ex Bateman) Pfitzer) (2n=26)、带叶兜兰(P. hirsutissimum (Lindl. ex Hook.) Stein) (2n=26)以及麻栗坡兜兰(P. malipoense S.C. Chen & Z.H.Tsi) (2n=26)转录组测序的原始数据(raw data) (Cox et al., 1998; 杨志娟, 2006; Li et al., 2014; Zhang et al., 2017; Fang et al., 2020), 用于后续的组装与分析.同时, 从NCBI网站Genome数据库下载深圳拟兰(Apostasia shenzhenica Z.J.Liu & L.J. Chen)基因组数据(GCA_002786265.1) (Zhang et al., 2017)用于物种间直系同源基因的K_s分析.将拟兰作为基于系统发生基因组学检测全基因组复制事件的外类群. ...

Polyploidy and genome evolution in plants

1

2005

... 全基因组复制使得染色体和基因组内全部基因均发生加倍, 为新性状演化和物种多样化提供了遗传材料(De Bodt et al., 2005; Wu et al., 2020).而全基因组复制后的基因丢失、沉默、亚功能化和新功能化等基因水平的变异, 以及染色体重组等染色体水平变异促进了表型和物种的多样化(Wendel, 2000; Adams and Wendel, 2005; Mandáková and Lysak, 2018).此外, 全基因组复制及后续变异导致一些类群染色体数目变异(Mandáková and Lysak, 2018).以单子叶植物禾本科为例, 禾本科祖先的染色体基数为7条, 而在经历了全基因组复制事件后(Paterson et al., 2004; Salse et al., 2008), 水稻(Oryza sativa)、高粱(Sorghum bicolor)和谷子(Setaria italica)的染色体基数并未达到加倍后的14条, 而是表现为染色体数目不同程度地减少, 分别为12、10和9条(Murat et al., 2017; 王振怡和王希胤, 2020).因此, 染色体数目的变化是全基因组复制发生及后续演化进程的重要特征之一. ...

The sunflower genome provides insights into oil metabolism, flowering and Asterid evolution

1

2017

... 基于同义替换速度恒定的假定前提, 根据物种内旁系同源基因K_s分布的峰值和公式T=K_s/2r, 采用深圳拟兰的绝对定年时间, 推算4种兜兰全基因组复制事件的发生时间(Badouin et al., 2017; Zhang et al., 2017).先依据深圳拟兰的绝对定年信息(K_s=1, T=74 Mya) (Zhang et al., 2017)和公式T=K_s/2r, 推算出深圳拟兰的r=6.76×10^-9(同义替换/位点/年); 然后根据正态分布拟合得到的K_s峰值, 采用深圳拟兰的r值, 推算4种兜兰全基因组复制事件的发生时间. ...

Widespread paleopolyploidy in model plant species inferred from age distributions of duplicate genes

1

2004

... 根据文献报道的方法计算物种内旁系同源基因对的K_s值(Sollars et al., 2017), 并对其进行正态分布拟合, 以检测全基因组复制事件.首先, 分别对各物种的蛋白序列进行all against all序列相似性比对(BLASTP), 阈值设置为e^-5.然后, 应用脚本KSPlotter.py计算每个物种的K_s值(https://github.com/EndymionCooper/KSPlotting).主要步骤为: 使用mclblastline pipeline构建基因家族(Enright et al., 2002), 借助MUSCLE对每个基因家族进行比对(Edgar, 2004), 最后利用CODEML软件(PAML包)计算每个物种的K_s值(Goldman and Yang, 1994; Yang, 2007).为避免随机误差和同义替换饱和效应的影响(Blanc and Wolfe, 2004; Schlueter et al., 2004; Cui et al., 2006), 本研究仅保留0.1-5之间的K_s值用于后续分析.最后, 借助R包mclust中的高斯混合模型对保留的K_s值进行正态分布拟合(Scrucca et al., 2016), 以排除假阳性峰. ...

Trimmomatic: a flexible trimmer for Illumina sequence data

1

2014

... 借助SRA Toolkit v2.10.8中的fastq-dump命令从原始数据中提取获得fastq文件, 参数为--gzip --split-e (https://github.com/ncbi/sra-tools) (Leinonen et al., 2011).利用Trimmomatic v0.39软件对fastq文件进行质控处理(参数设置: PE ILLUMINACLIP: TruSeq3- PE.fa:2: 30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW: 4:15 MINLEN:50 TOPHRED33) (Bolger et al., 2014), 过滤去除接头序列及低质量碱基等, 获取高质量数据(clean data)用于后续组装. ...

How tree roots respond to drought

1

2015

... 全基因组复制事件保留了部分复制基因, 对保留的复制基因进行功能分析可为阐明全基因组复制事件对植物适应性演化的促进作用提供遗传证据.本研究分别对4种兜兰3次全基因组复制后的保留复制基因进行了GO功能富集分析, 发现3次全基因组复制事件富集到的功能存在差异(图4, 图5).WGD1富集了脂类代谢、软木脂的生物合成、苯丙烷类的合成与代谢, 以及氧化还原酶活性和活性氧代谢过程的调控等功能(图5), 这可能与兜兰属植物应对超级温室期的干旱环境以及抵御干旱引起的活性氧失衡有关(Upchurch, 2008; Das and Roychoudhury, 2014; Brunner et al., 2015; Zhang et al., 2021b).在K-Pg灭绝时期, 大气中充满了灰尘、硫酸盐气溶胶及碳黑颗粒, 黑暗和低温成为主要的胁迫因子(Vellekoop et al., 2016).推测WGD2富集的脱落酸激活的信号通路以及昼夜节律等功能提高了兜兰属植物祖先对当时剧变环境的适应性(图4, 图5) (杨有新等, 2014; Vishwakarma et al., 2017).WGD3之后, 兜兰属植物祖先经历了全球温度骤降, 推测富集的磷脂代谢、酶联受体蛋白信号通路、色素沉着, 以及保卫细胞分化与发育、根表皮细胞分化与毛状体分化等功能, 可能与应对低温引起的植物萎蔫、叶绿素含量减少以及细胞膜发生相变有关(王芳等, 2019).综上, 推测保留的复制基因在功能上与当时特定的胁迫因子相关. ...

The genome sequence of the orchid Phalaenopsis equestris

3

2015

... 兰科(Orchidaceae)含700余属、约26 000种, 为被子植物第二大科, 单子叶植物第一大科, 是陆生植物中极具多样性的类群之一(Chase et al., 2015; Li et al., 2016), 同时表现出染色体数目变化较大(染色体基数从x=6到x=120)的特点(Da Conceição et al., 2006; 王筠竹等, 2019), 表明兰科植物的演化过程可能存在多次全基因组复制事件.然而, 目前在兰科植物中已见报道的全基因组复制事件非常有限.基于兰科植物基因组证据(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020)以及千种植物转录组项目等转录组分析(One Thousand Plant Transcriptomes Initiative, 2019), 目前仅检测到1次兰科植物特异发生的全基因组复制事件, 与蕨类(Huang et al., 2020)、菊科(Zhang et al., 2021a)和豆科(Zhao et al., 2021)等物种多样性丰富的类群多倍化研究结果不符. ...

... 分析上述情况的原因, 我们推测可能与兰科植物种类及类群众多、前期研究样本量小但种类跨度大的研究策略有关.例如, 千种植物转录组项目囊括了兰科7个样本, 但却跨了香荚兰亚科(Vanilloideae)、兰亚科(Orchidoideae)和树兰亚科(Epidendroideae) 3个亚科7个属(One Thousand Plant Transcriptomes Initiative, 2019); 分析全基因组复制事件的5套全基因组数据同样覆盖了拟兰亚科(Apostasioideae)、香荚兰亚科、树兰亚科3个亚科5个属(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020); 关于杓兰亚科基因组进化的研究包括13个兰科植物转录组和基因组数据, 覆盖了兰科所有亚科(拟兰亚科、香荚兰亚科、杓兰亚科(Cypripedioideae)、兰亚科和树兰亚科) 13个属(Unruh et al., 2018).对于兰科这样包含26 000多种的特大类群, 解析其全基因组复制历史需要借助更精细的尺度. ...

... 目前, 在兰科植物中仅检测到2次全基因组复制事件, 一次为大多数单子叶植物共享(110-135 Mya), 另一次为现存兰科植物共享(72-78 Mya)(Cai et al., 2015; Ming et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; One Thousand Plant Transcriptomes Initiative, 2019; Hasing et al., 2020).兜兰属是兰科多样性的重要代表类群, 本研究基于4种兜兰的转录组数据, 检测到3次全基因组复制事件, 分别发生在110.17-119.77 Mya (WGD1)、60.95-74.19 Mya (WGD2)和38.19-45.85 Mya (WGD3).其中, WGD1和WGD2发生时间与前期研究得出的2次全基因组复制事件相近, 且物种间K_s分析表明, 二者均发生在兜兰属与深圳拟兰分化事件之前(图2), 因此推测WGD1为大多数单子叶植物共享、WGD2为现存兰科植物共享的全基因组复制事件.而本研究中检测到的全基因组复制事件WGD3 (38.19-45.85 Mya), 在蓝莓(blueberry)、茶树(Camellia sinensis var. sinensis)和胡萝卜(Daucus carota)中同一时期也检测到了全基因组复制事件(Iorizzo et al., 2016; Wei et al., 2018; Wang et al., 2020), 豆科中更是在该段时间检测到大量全基因组复制事件(17次, 23-55 Mya) (Zhao et al., 2021), 但在兰科植物中尚未见报道. ...

Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis

1

2000

... 为验证K_s法检测结果的准确性, 应用tree2gd软件, 基于系统发生基因组学的方法再次检测全基因组复制事件(Zhang et al., 2020; Zhao et al., 2021).(1) 以4种兜兰和深圳拟兰的蛋白序列为输入文件, 利用OrthoFinder v2.5.2筛选单拷贝直系同源基因(Emms and Kelly, 2019).(2) 利用单拷贝直系同源基因构建物种树.首先, 采用MUSCLE v3.8.31对筛选得到的302个单拷贝直系同源基因进行多序列比对(Edgar, 2004); 随后, 基于比对结果使用Gblocks v0.91b筛选保守区域(Castresana, 2000; Talavera and Castresana, 2007), 并将筛选获得的保守区域串联形成多基因矩阵; 最后, 以ProtTest v3.4.2确定的PROTGAMMAJTTF为最优替代模型(Darriba et al., 2011), 利用RAxML v8.2.12软件, 采用最大似然法、基于保守序列矩阵、以深圳拟兰为外类群、在自举检验1 000次的设置下构建系统发生树(Stamatakis, 2014).(3) 以第(2)步构建的系统发生树为物种树, 利用tree2gd v1.0.39软件, 基于默认参数检测全基因组复制事件(https://github.com/Dee-chen/Tree2gd) (Zhang et al., 2020). ...

An updated classification of Orchidaceae

1

2015

... 兰科(Orchidaceae)含700余属、约26 000种, 为被子植物第二大科, 单子叶植物第一大科, 是陆生植物中极具多样性的类群之一(Chase et al., 2015; Li et al., 2016), 同时表现出染色体数目变化较大(染色体基数从x=6到x=120)的特点(Da Conceição et al., 2006; 王筠竹等, 2019), 表明兰科植物的演化过程可能存在多次全基因组复制事件.然而, 目前在兰科植物中已见报道的全基因组复制事件非常有限.基于兰科植物基因组证据(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020)以及千种植物转录组项目等转录组分析(One Thousand Plant Transcriptomes Initiative, 2019), 目前仅检测到1次兰科植物特异发生的全基因组复制事件, 与蕨类(Huang et al., 2020)、菊科(Zhang et al., 2021a)和豆科(Zhao et al., 2021)等物种多样性丰富的类群多倍化研究结果不符. ...

1

2009

... 杓兰亚科具有囊状或倒盔状唇瓣、2个可育雄蕊和1个盾状退化雄蕊等特征, 是兰科多样性的重要代表类群之一, 包括杓兰属(Cypripedium)、南美杓兰属(Selenipedium)、美洲兜兰属(Phragmipedium)、镊萼兜兰属(Mexipedium)及兜兰属(Paphiopedilum) 5个属(Cox et al., 1997; Chen et al., 2009).其中, 兜兰属是杓兰亚科最大的属, 约100多种, 占杓兰亚科总物种数一半以上(Govaerts et al., 2021).兜兰属植物的基因组普遍较大并存在一定程度的变异(16.5-35.9 pg/C), 且染色体数目变异丰富(2n=26-42) (Leitch et al., 2009).因此, 我们推测兜兰属可能存在全基因组复制事件, 然而, 在过去样本量小、跨大尺度的研究中并未检测到兜兰属特异发生的全基因组复制事件.因此, 本研究基于NCBI共享数据, 即杏黄兜兰(Paphiopedilum armeniacum)、同色兜兰(P. concolor)、带叶兜兰(P. hirsutissimum)以及麻栗坡兜兰(P. malipoense)的转录组数据, 采用经典的同义替换率(K_s)、系统发生基因组学以及相对定年的方法对其进行全基因组复制事件检测, 进而开展以下研究: (1) 过去未检测到全基因组复制事件历史的兜兰属植物是否发生了全基因组复制事件; (2) 若发生了全基因组复制事件, 进一步分析其发生时间, 以及是否为兜兰属内发生的全基因组复制事件; (3) 全基因组复制事件的发生对于兜兰属植物适应性演化的意义. ...

The advantages and disadvantages of being polyploid

1

2005

... 多倍化或全基因组复制, 特别是在稳定环境下, 常被认为是进化的终点(Comai, 2005; Oberlander et al., 2016).然而, 在植物的演化过程中, 全基因组复制并非随机发生, 而是与全球气候变化、地质变化或者大规模灭绝等密切相关, 发生全基因组复制的个体在胁迫或极端环境条件下具有较二倍体祖先更强的适应性(Van de Peer et al., 2017, 2021; Ren et al., 2018; Wu et al., 2020).与上述研究结果相似, 本研究检测到的3次全基因组复制事件发生时期出现了全球气候变化或大规模灭绝事件, 推测全基因组复制事件提高了兜兰属植物祖先应对极端环境变化的适应性.例如, WGD1 (110.17-119.77 Mya)发生在白垩纪(Cretaceous)阿普特阶(Aptian)至阿尔布阶(Albian), 随后出现了超级温室期(83.6-93.9 Mya) (Klages et al., 2020); WGD2 (60.95-74.19 Mya)发生在白垩纪与古近纪(Paleogene)交界, 出现了白垩纪-古近纪灭绝事件(K-Pg灭绝事件) (Vellekoop et al., 2016); WGD3 (38.19-45.85 Mya)发生在古近纪始新世(Eocene), 发生了古新世-始新世极热事件(56 Mya)和始新世-渐新世(Oligocene)全球变冷(Zachos et al., 2001; McInerney and Wing, 2011). ...

Genome size and karyotype evolution in the slipper orchids (Cypripedioideae: Orchidaceae)

1

1998

... 从NCBI网站SRA数据库检索下载杏黄兜兰(Paphiopedilum armeniacum S.C.Chen & F.Y.Liu) (2n=26)、同色兜兰(P. concolor (Lindl. ex Bateman) Pfitzer) (2n=26)、带叶兜兰(P. hirsutissimum (Lindl. ex Hook.) Stein) (2n=26)以及麻栗坡兜兰(P. malipoense S.C. Chen & Z.H.Tsi) (2n=26)转录组测序的原始数据(raw data) (Cox et al., 1998; 杨志娟, 2006; Li et al., 2014; Zhang et al., 2017; Fang et al., 2020), 用于后续的组装与分析.同时, 从NCBI网站Genome数据库下载深圳拟兰(Apostasia shenzhenica Z.J.Liu & L.J. Chen)基因组数据(GCA_002786265.1) (Zhang et al., 2017)用于物种间直系同源基因的K_s分析.将拟兰作为基于系统发生基因组学检测全基因组复制事件的外类群. ...

Phylogenetics of the slipper orchids (Cypripedioideae, Or-chidaceae): nuclear rDNA ITS sequences

1

1997

... 杓兰亚科具有囊状或倒盔状唇瓣、2个可育雄蕊和1个盾状退化雄蕊等特征, 是兰科多样性的重要代表类群之一, 包括杓兰属(Cypripedium)、南美杓兰属(Selenipedium)、美洲兜兰属(Phragmipedium)、镊萼兜兰属(Mexipedium)及兜兰属(Paphiopedilum) 5个属(Cox et al., 1997; Chen et al., 2009).其中, 兜兰属是杓兰亚科最大的属, 约100多种, 占杓兰亚科总物种数一半以上(Govaerts et al., 2021).兜兰属植物的基因组普遍较大并存在一定程度的变异(16.5-35.9 pg/C), 且染色体数目变异丰富(2n=26-42) (Leitch et al., 2009).因此, 我们推测兜兰属可能存在全基因组复制事件, 然而, 在过去样本量小、跨大尺度的研究中并未检测到兜兰属特异发生的全基因组复制事件.因此, 本研究基于NCBI共享数据, 即杏黄兜兰(Paphiopedilum armeniacum)、同色兜兰(P. concolor)、带叶兜兰(P. hirsutissimum)以及麻栗坡兜兰(P. malipoense)的转录组数据, 采用经典的同义替换率(K_s)、系统发生基因组学以及相对定年的方法对其进行全基因组复制事件检测, 进而开展以下研究: (1) 过去未检测到全基因组复制事件历史的兜兰属植物是否发生了全基因组复制事件; (2) 若发生了全基因组复制事件, 进一步分析其发生时间, 以及是否为兜兰属内发生的全基因组复制事件; (3) 全基因组复制事件的发生对于兜兰属植物适应性演化的意义. ...

Widespread genome duplications throughout the history of flowering plants

1

2006

... 根据文献报道的方法计算物种内旁系同源基因对的K_s值(Sollars et al., 2017), 并对其进行正态分布拟合, 以检测全基因组复制事件.首先, 分别对各物种的蛋白序列进行all against all序列相似性比对(BLASTP), 阈值设置为e^-5.然后, 应用脚本KSPlotter.py计算每个物种的K_s值(https://github.com/EndymionCooper/KSPlotting).主要步骤为: 使用mclblastline pipeline构建基因家族(Enright et al., 2002), 借助MUSCLE对每个基因家族进行比对(Edgar, 2004), 最后利用CODEML软件(PAML包)计算每个物种的K_s值(Goldman and Yang, 1994; Yang, 2007).为避免随机误差和同义替换饱和效应的影响(Blanc and Wolfe, 2004; Schlueter et al., 2004; Cui et al., 2006), 本研究仅保留0.1-5之间的K_s值用于后续分析.最后, 借助R包mclust中的高斯混合模型对保留的K_s值进行正态分布拟合(Scrucca et al., 2016), 以排除假阳性峰. ...

Characterization of the species Epidendrum cinnabarium salzm. (Epidendroideae: Orchidaceae) occurring in dunas do abaeté-salvador, ba-brasil

1

2006

... 兰科(Orchidaceae)含700余属、约26 000种, 为被子植物第二大科, 单子叶植物第一大科, 是陆生植物中极具多样性的类群之一(Chase et al., 2015; Li et al., 2016), 同时表现出染色体数目变化较大(染色体基数从x=6到x=120)的特点(Da Conceição et al., 2006; 王筠竹等, 2019), 表明兰科植物的演化过程可能存在多次全基因组复制事件.然而, 目前在兰科植物中已见报道的全基因组复制事件非常有限.基于兰科植物基因组证据(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020)以及千种植物转录组项目等转录组分析(One Thousand Plant Transcriptomes Initiative, 2019), 目前仅检测到1次兰科植物特异发生的全基因组复制事件, 与蕨类(Huang et al., 2020)、菊科(Zhang et al., 2021a)和豆科(Zhao et al., 2021)等物种多样性丰富的类群多倍化研究结果不符. ...

ProtTest 3: fast selection of best-fit models of protein evolution

1

2011

... 为验证K_s法检测结果的准确性, 应用tree2gd软件, 基于系统发生基因组学的方法再次检测全基因组复制事件(Zhang et al., 2020; Zhao et al., 2021).(1) 以4种兜兰和深圳拟兰的蛋白序列为输入文件, 利用OrthoFinder v2.5.2筛选单拷贝直系同源基因(Emms and Kelly, 2019).(2) 利用单拷贝直系同源基因构建物种树.首先, 采用MUSCLE v3.8.31对筛选得到的302个单拷贝直系同源基因进行多序列比对(Edgar, 2004); 随后, 基于比对结果使用Gblocks v0.91b筛选保守区域(Castresana, 2000; Talavera and Castresana, 2007), 并将筛选获得的保守区域串联形成多基因矩阵; 最后, 以ProtTest v3.4.2确定的PROTGAMMAJTTF为最优替代模型(Darriba et al., 2011), 利用RAxML v8.2.12软件, 采用最大似然法、基于保守序列矩阵、以深圳拟兰为外类群、在自举检验1 000次的设置下构建系统发生树(Stamatakis, 2014).(3) 以第(2)步构建的系统发生树为物种树, 利用tree2gd v1.0.39软件, 基于默认参数检测全基因组复制事件(https://github.com/Dee-chen/Tree2gd) (Zhang et al., 2020). ...

Reactive oxygen species (ROS) and response of antioxidants as ROS-scavengers during environmental stress in plants

1

2014

... 全基因组复制事件保留了部分复制基因, 对保留的复制基因进行功能分析可为阐明全基因组复制事件对植物适应性演化的促进作用提供遗传证据.本研究分别对4种兜兰3次全基因组复制后的保留复制基因进行了GO功能富集分析, 发现3次全基因组复制事件富集到的功能存在差异(图4, 图5).WGD1富集了脂类代谢、软木脂的生物合成、苯丙烷类的合成与代谢, 以及氧化还原酶活性和活性氧代谢过程的调控等功能(图5), 这可能与兜兰属植物应对超级温室期的干旱环境以及抵御干旱引起的活性氧失衡有关(Upchurch, 2008; Das and Roychoudhury, 2014; Brunner et al., 2015; Zhang et al., 2021b).在K-Pg灭绝时期, 大气中充满了灰尘、硫酸盐气溶胶及碳黑颗粒, 黑暗和低温成为主要的胁迫因子(Vellekoop et al., 2016).推测WGD2富集的脱落酸激活的信号通路以及昼夜节律等功能提高了兜兰属植物祖先对当时剧变环境的适应性(图4, 图5) (杨有新等, 2014; Vishwakarma et al., 2017).WGD3之后, 兜兰属植物祖先经历了全球温度骤降, 推测富集的磷脂代谢、酶联受体蛋白信号通路、色素沉着, 以及保卫细胞分化与发育、根表皮细胞分化与毛状体分化等功能, 可能与应对低温引起的植物萎蔫、叶绿素含量减少以及细胞膜发生相变有关(王芳等, 2019).综上, 推测保留的复制基因在功能上与当时特定的胁迫因子相关. ...

Genome duplication and the origin of angiosperms

3

2005

... 多倍化(polyploid)或全基因组复制(whole-genome duplication, WGD)是物种多样性发生的重要驱动力(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018), 在植物演化历史中普遍存在, 尤其是维管束植物中多样性最高的类群被子植物和第二大类群蕨类反复发生过多轮全基因组复制(One Thousand Plant Transcriptomes Initiative, 2019; 汪浩等, 2019; Huang et al., 2020; 王婷等, 2021).基于现有证据, 在蕨类植物、被子植物第一大科菊科(Asteraceae)、第三大科豆科(Fabaceae)中分别检测到19、41、28次全基因组复制事件(Huang et al., 2020; Zhang et al., 2021a; Zhao et al., 2021), 推测多倍化与蕨类植物和被子植物物种多样性较高类群的物种形成和多样化有关(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

... ), 推测多倍化与蕨类植物和被子植物物种多样性较高类群的物种形成和多样化有关(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

... 全基因组复制使得染色体和基因组内全部基因均发生加倍, 为新性状演化和物种多样化提供了遗传材料(De Bodt et al., 2005; Wu et al., 2020).而全基因组复制后的基因丢失、沉默、亚功能化和新功能化等基因水平的变异, 以及染色体重组等染色体水平变异促进了表型和物种的多样化(Wendel, 2000; Adams and Wendel, 2005; Mandáková and Lysak, 2018).此外, 全基因组复制及后续变异导致一些类群染色体数目变异(Mandáková and Lysak, 2018).以单子叶植物禾本科为例, 禾本科祖先的染色体基数为7条, 而在经历了全基因组复制事件后(Paterson et al., 2004; Salse et al., 2008), 水稻(Oryza sativa)、高粱(Sorghum bicolor)和谷子(Setaria italica)的染色体基数并未达到加倍后的14条, 而是表现为染色体数目不同程度地减少, 分别为12、10和9条(Murat et al., 2017; 王振怡和王希胤, 2020).因此, 染色体数目的变化是全基因组复制发生及后续演化进程的重要特征之一. ...

MUSCLE: multiple sequence alignment with high accuracy and high throughput

2

2004

... 根据文献报道的方法计算物种内旁系同源基因对的K_s值(Sollars et al., 2017), 并对其进行正态分布拟合, 以检测全基因组复制事件.首先, 分别对各物种的蛋白序列进行all against all序列相似性比对(BLASTP), 阈值设置为e^-5.然后, 应用脚本KSPlotter.py计算每个物种的K_s值(https://github.com/EndymionCooper/KSPlotting).主要步骤为: 使用mclblastline pipeline构建基因家族(Enright et al., 2002), 借助MUSCLE对每个基因家族进行比对(Edgar, 2004), 最后利用CODEML软件(PAML包)计算每个物种的K_s值(Goldman and Yang, 1994; Yang, 2007).为避免随机误差和同义替换饱和效应的影响(Blanc and Wolfe, 2004; Schlueter et al., 2004; Cui et al., 2006), 本研究仅保留0.1-5之间的K_s值用于后续分析.最后, 借助R包mclust中的高斯混合模型对保留的K_s值进行正态分布拟合(Scrucca et al., 2016), 以排除假阳性峰. ...

... 为验证K_s法检测结果的准确性, 应用tree2gd软件, 基于系统发生基因组学的方法再次检测全基因组复制事件(Zhang et al., 2020; Zhao et al., 2021).(1) 以4种兜兰和深圳拟兰的蛋白序列为输入文件, 利用OrthoFinder v2.5.2筛选单拷贝直系同源基因(Emms and Kelly, 2019).(2) 利用单拷贝直系同源基因构建物种树.首先, 采用MUSCLE v3.8.31对筛选得到的302个单拷贝直系同源基因进行多序列比对(Edgar, 2004); 随后, 基于比对结果使用Gblocks v0.91b筛选保守区域(Castresana, 2000; Talavera and Castresana, 2007), 并将筛选获得的保守区域串联形成多基因矩阵; 最后, 以ProtTest v3.4.2确定的PROTGAMMAJTTF为最优替代模型(Darriba et al., 2011), 利用RAxML v8.2.12软件, 采用最大似然法、基于保守序列矩阵、以深圳拟兰为外类群、在自举检验1 000次的设置下构建系统发生树(Stamatakis, 2014).(3) 以第(2)步构建的系统发生树为物种树, 利用tree2gd v1.0.39软件, 基于默认参数检测全基因组复制事件(https://github.com/Dee-chen/Tree2gd) (Zhang et al., 2020). ...

OrthoFinder: phylogenetic orthology inference for comparative genomics

1

2019

... 为验证K_s法检测结果的准确性, 应用tree2gd软件, 基于系统发生基因组学的方法再次检测全基因组复制事件(Zhang et al., 2020; Zhao et al., 2021).(1) 以4种兜兰和深圳拟兰的蛋白序列为输入文件, 利用OrthoFinder v2.5.2筛选单拷贝直系同源基因(Emms and Kelly, 2019).(2) 利用单拷贝直系同源基因构建物种树.首先, 采用MUSCLE v3.8.31对筛选得到的302个单拷贝直系同源基因进行多序列比对(Edgar, 2004); 随后, 基于比对结果使用Gblocks v0.91b筛选保守区域(Castresana, 2000; Talavera and Castresana, 2007), 并将筛选获得的保守区域串联形成多基因矩阵; 最后, 以ProtTest v3.4.2确定的PROTGAMMAJTTF为最优替代模型(Darriba et al., 2011), 利用RAxML v8.2.12软件, 采用最大似然法、基于保守序列矩阵、以深圳拟兰为外类群、在自举检验1 000次的设置下构建系统发生树(Stamatakis, 2014).(3) 以第(2)步构建的系统发生树为物种树, 利用tree2gd v1.0.39软件, 基于默认参数检测全基因组复制事件(https://github.com/Dee-chen/Tree2gd) (Zhang et al., 2020). ...

An efficient algorithm for large-scale detection of protein families

1

2002

... 根据文献报道的方法计算物种内旁系同源基因对的K_s值(Sollars et al., 2017), 并对其进行正态分布拟合, 以检测全基因组复制事件.首先, 分别对各物种的蛋白序列进行all against all序列相似性比对(BLASTP), 阈值设置为e^-5.然后, 应用脚本KSPlotter.py计算每个物种的K_s值(https://github.com/EndymionCooper/KSPlotting).主要步骤为: 使用mclblastline pipeline构建基因家族(Enright et al., 2002), 借助MUSCLE对每个基因家族进行比对(Edgar, 2004), 最后利用CODEML软件(PAML包)计算每个物种的K_s值(Goldman and Yang, 1994; Yang, 2007).为避免随机误差和同义替换饱和效应的影响(Blanc and Wolfe, 2004; Schlueter et al., 2004; Cui et al., 2006), 本研究仅保留0.1-5之间的K_s值用于后续分析.最后, 借助R包mclust中的高斯混合模型对保留的K_s值进行正态分布拟合(Scrucca et al., 2016), 以排除假阳性峰. ...

Transcriptome analysis provides insights into the non-methylated lignin synthesis in Paphiopedilum armeniacum seed

2

2020

... 从NCBI网站SRA数据库检索下载杏黄兜兰(Paphiopedilum armeniacum S.C.Chen & F.Y.Liu) (2n=26)、同色兜兰(P. concolor (Lindl. ex Bateman) Pfitzer) (2n=26)、带叶兜兰(P. hirsutissimum (Lindl. ex Hook.) Stein) (2n=26)以及麻栗坡兜兰(P. malipoense S.C. Chen & Z.H.Tsi) (2n=26)转录组测序的原始数据(raw data) (Cox et al., 1998; 杨志娟, 2006; Li et al., 2014; Zhang et al., 2017; Fang et al., 2020), 用于后续的组装与分析.同时, 从NCBI网站Genome数据库下载深圳拟兰(Apostasia shenzhenica Z.J.Liu & L.J. Chen)基因组数据(GCA_002786265.1) (Zhang et al., 2017)用于物种间直系同源基因的K_s分析.将拟兰作为基于系统发生基因组学检测全基因组复制事件的外类群. ...

... The statistics of raw data and de novo assembly

Table 1

	Paphiopedilum concolor	P. hirsutissimum	P. malipoense	P. armeniacum
Accession number	SRR1405683	SRR1405685	SRR5722160	SRR9842184
Tissues	Leaf	Leaf	Stem	Seed
Bases (Gb)	3.6	3	14.1	8
Number of transcripts	156581	76006	239105	164515
Average length of transcript (bp)	907.1	1162.3	884.5	993.2
N50 of transcript (bp)	1486	1971	1627	1856
Number of unigenes	116919	62565	201606	139203
Average length of unigene (bp)	863.3	1071.1	815.6	906.4
N50 of unigene (bp)	1438	1829	1480	1704
Source of raw data	Li et al., 2014	Li et al., 2014	Zhang et al., 2017	Fang et al., 2020

为评估组装的完整性, 基于包含1 614个单拷贝基因的embryophyta_odb10数据库进行BUSCO评估, 完整覆盖基因的比例(complete BUSCOs)分别为麻栗坡兜兰(94.0%)>杏黄兜兰(92.5%)>同色兜兰(88.5%)>带叶兜兰(86.9%) (图1).BUSCO评估结果显示, 组装完整性较高, 可用于后续分析. ...

A codon-based model of nucleotide substitution for protein-coding DNA sequences

1

1994

... 根据文献报道的方法计算物种内旁系同源基因对的K_s值(Sollars et al., 2017), 并对其进行正态分布拟合, 以检测全基因组复制事件.首先, 分别对各物种的蛋白序列进行all against all序列相似性比对(BLASTP), 阈值设置为e^-5.然后, 应用脚本KSPlotter.py计算每个物种的K_s值(https://github.com/EndymionCooper/KSPlotting).主要步骤为: 使用mclblastline pipeline构建基因家族(Enright et al., 2002), 借助MUSCLE对每个基因家族进行比对(Edgar, 2004), 最后利用CODEML软件(PAML包)计算每个物种的K_s值(Goldman and Yang, 1994; Yang, 2007).为避免随机误差和同义替换饱和效应的影响(Blanc and Wolfe, 2004; Schlueter et al., 2004; Cui et al., 2006), 本研究仅保留0.1-5之间的K_s值用于后续分析.最后, 借助R包mclust中的高斯混合模型对保留的K_s值进行正态分布拟合(Scrucca et al., 2016), 以排除假阳性峰. ...

World checklist of Orchidaceae

1

2021

... 杓兰亚科具有囊状或倒盔状唇瓣、2个可育雄蕊和1个盾状退化雄蕊等特征, 是兰科多样性的重要代表类群之一, 包括杓兰属(Cypripedium)、南美杓兰属(Selenipedium)、美洲兜兰属(Phragmipedium)、镊萼兜兰属(Mexipedium)及兜兰属(Paphiopedilum) 5个属(Cox et al., 1997; Chen et al., 2009).其中, 兜兰属是杓兰亚科最大的属, 约100多种, 占杓兰亚科总物种数一半以上(Govaerts et al., 2021).兜兰属植物的基因组普遍较大并存在一定程度的变异(16.5-35.9 pg/C), 且染色体数目变异丰富(2n=26-42) (Leitch et al., 2009).因此, 我们推测兜兰属可能存在全基因组复制事件, 然而, 在过去样本量小、跨大尺度的研究中并未检测到兜兰属特异发生的全基因组复制事件.因此, 本研究基于NCBI共享数据, 即杏黄兜兰(Paphiopedilum armeniacum)、同色兜兰(P. concolor)、带叶兜兰(P. hirsutissimum)以及麻栗坡兜兰(P. malipoense)的转录组数据, 采用经典的同义替换率(K_s)、系统发生基因组学以及相对定年的方法对其进行全基因组复制事件检测, 进而开展以下研究: (1) 过去未检测到全基因组复制事件历史的兜兰属植物是否发生了全基因组复制事件; (2) 若发生了全基因组复制事件, 进一步分析其发生时间, 以及是否为兜兰属内发生的全基因组复制事件; (3) 全基因组复制事件的发生对于兜兰属植物适应性演化的意义. ...

Reassessing the temporal evolution of orchids with new fossils and a Bayesian relaxed clock, with implications for the diversification of the rare South American genus Hoffmann seggella (Orchidaceae: Epidendroideae)

1

2010

... 综合类群分化的时间信息、物种间K_s检测结果以及tree2gd检测结果, 进一步分析WGD3在兰科中的系统发生位置.兰科5个亚科的亲缘关系为(拟兰亚科(香荚兰亚科(杓兰亚科(兰亚科, 树兰亚科)))), 其中杓兰亚科与姐妹类群的分化时间约为64.97 Mya (48.54-84.93 Mya) (Kim et al., 2020), 冠群时间为33 Mya (19-50 Mya) (Gustafsson et al., 2010), 而WGD3的发生时间为38.19-45.85 Mya, 初步推测WGD3为杓兰亚科特异发生的全基因组复制事件.杓兰亚科包含5个属, 其亲缘关系为(杓兰属(南美杓兰属(兜兰属(美洲兜兰属, 镊萼兜兰属)))), 兜兰属与姐妹类群的分化时间为29.9 Mya (14.6-39.1 Mya) (http://www.timetree.org/), 冠群时间为7.09 Mya (5.88-8.41 Mya) (Tsai et al., 2020), 且物种间K_s分析结果(图2)和tree2gd检测结果(图3)均提示WGD3发生在兜兰物种间分化之前, 推测WGD3可能发生在兜兰属与美洲兜兰属、镊萼兜兰属分化之前.综上, 初步推测WGD3发生在杓兰亚科与姐妹类群分化之后, 兜兰属与美洲兜兰属、镊萼兜兰属分化之前. ...

De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis

1

2013

... 采用Trinity v2.11.0对高质量数据进行de novo组装(Haas et al., 2013), 参数设置为--seqType fq --min_ kmer_cov 2 --normalize_reads --bflyCalculateCPU.随后, 利用cd-hit v4.8.1将相似性≥95%的转录本(transcript)聚为一组(参数: -c 0.95), 每一组聚类中输出最长序列, 得到非冗余的单基因簇(unigene) (Li and Godzik, 2006).基于embryophyta_odb10数据库, 利用BUSCO v4.0.6软件对组装获得的转录本进行完整性评估(Simão et al., 2015). ...

A phased Vanilla planifolia genome enables genetic improvement of flavour and production

3

2020

... 兰科(Orchidaceae)含700余属、约26 000种, 为被子植物第二大科, 单子叶植物第一大科, 是陆生植物中极具多样性的类群之一(Chase et al., 2015; Li et al., 2016), 同时表现出染色体数目变化较大(染色体基数从x=6到x=120)的特点(Da Conceição et al., 2006; 王筠竹等, 2019), 表明兰科植物的演化过程可能存在多次全基因组复制事件.然而, 目前在兰科植物中已见报道的全基因组复制事件非常有限.基于兰科植物基因组证据(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020)以及千种植物转录组项目等转录组分析(One Thousand Plant Transcriptomes Initiative, 2019), 目前仅检测到1次兰科植物特异发生的全基因组复制事件, 与蕨类(Huang et al., 2020)、菊科(Zhang et al., 2021a)和豆科(Zhao et al., 2021)等物种多样性丰富的类群多倍化研究结果不符. ...

... 分析上述情况的原因, 我们推测可能与兰科植物种类及类群众多、前期研究样本量小但种类跨度大的研究策略有关.例如, 千种植物转录组项目囊括了兰科7个样本, 但却跨了香荚兰亚科(Vanilloideae)、兰亚科(Orchidoideae)和树兰亚科(Epidendroideae) 3个亚科7个属(One Thousand Plant Transcriptomes Initiative, 2019); 分析全基因组复制事件的5套全基因组数据同样覆盖了拟兰亚科(Apostasioideae)、香荚兰亚科、树兰亚科3个亚科5个属(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020); 关于杓兰亚科基因组进化的研究包括13个兰科植物转录组和基因组数据, 覆盖了兰科所有亚科(拟兰亚科、香荚兰亚科、杓兰亚科(Cypripedioideae)、兰亚科和树兰亚科) 13个属(Unruh et al., 2018).对于兰科这样包含26 000多种的特大类群, 解析其全基因组复制历史需要借助更精细的尺度. ...

... 目前, 在兰科植物中仅检测到2次全基因组复制事件, 一次为大多数单子叶植物共享(110-135 Mya), 另一次为现存兰科植物共享(72-78 Mya)(Cai et al., 2015; Ming et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; One Thousand Plant Transcriptomes Initiative, 2019; Hasing et al., 2020).兜兰属是兰科多样性的重要代表类群, 本研究基于4种兜兰的转录组数据, 检测到3次全基因组复制事件, 分别发生在110.17-119.77 Mya (WGD1)、60.95-74.19 Mya (WGD2)和38.19-45.85 Mya (WGD3).其中, WGD1和WGD2发生时间与前期研究得出的2次全基因组复制事件相近, 且物种间K_s分析表明, 二者均发生在兜兰属与深圳拟兰分化事件之前(图2), 因此推测WGD1为大多数单子叶植物共享、WGD2为现存兰科植物共享的全基因组复制事件.而本研究中检测到的全基因组复制事件WGD3 (38.19-45.85 Mya), 在蓝莓(blueberry)、茶树(Camellia sinensis var. sinensis)和胡萝卜(Daucus carota)中同一时期也检测到了全基因组复制事件(Iorizzo et al., 2016; Wei et al., 2018; Wang et al., 2020), 豆科中更是在该段时间检测到大量全基因组复制事件(17次, 23-55 Mya) (Zhao et al., 2021), 但在兰科植物中尚未见报道. ...

Recurrent genome duplication events likely contributed to both the ancient and recent rise of ferns

3

2020

... 多倍化(polyploid)或全基因组复制(whole-genome duplication, WGD)是物种多样性发生的重要驱动力(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018), 在植物演化历史中普遍存在, 尤其是维管束植物中多样性最高的类群被子植物和第二大类群蕨类反复发生过多轮全基因组复制(One Thousand Plant Transcriptomes Initiative, 2019; 汪浩等, 2019; Huang et al., 2020; 王婷等, 2021).基于现有证据, 在蕨类植物、被子植物第一大科菊科(Asteraceae)、第三大科豆科(Fabaceae)中分别检测到19、41、28次全基因组复制事件(Huang et al., 2020; Zhang et al., 2021a; Zhao et al., 2021), 推测多倍化与蕨类植物和被子植物物种多样性较高类群的物种形成和多样化有关(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

... ).基于现有证据, 在蕨类植物、被子植物第一大科菊科(Asteraceae)、第三大科豆科(Fabaceae)中分别检测到19、41、28次全基因组复制事件(Huang et al., 2020; Zhang et al., 2021a; Zhao et al., 2021), 推测多倍化与蕨类植物和被子植物物种多样性较高类群的物种形成和多样化有关(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

... 兰科(Orchidaceae)含700余属、约26 000种, 为被子植物第二大科, 单子叶植物第一大科, 是陆生植物中极具多样性的类群之一(Chase et al., 2015; Li et al., 2016), 同时表现出染色体数目变化较大(染色体基数从x=6到x=120)的特点(Da Conceição et al., 2006; 王筠竹等, 2019), 表明兰科植物的演化过程可能存在多次全基因组复制事件.然而, 目前在兰科植物中已见报道的全基因组复制事件非常有限.基于兰科植物基因组证据(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020)以及千种植物转录组项目等转录组分析(One Thousand Plant Transcriptomes Initiative, 2019), 目前仅检测到1次兰科植物特异发生的全基因组复制事件, 与蕨类(Huang et al., 2020)、菊科(Zhang et al., 2021a)和豆科(Zhao et al., 2021)等物种多样性丰富的类群多倍化研究结果不符. ...

Fast genome- wide functional annotation through orthology assignment by eggNOG-mapper

1

2017

... 首先, 分别对每种兜兰的所有蛋白序列进行功能注释.使用eggNOG-mapper v2.0.6软件, 基于eggNOG v5.0.1数据库对预测获得的蛋白序列进行功能注释(Huerta-Cepas et al., 2017, 2019), 注释结果用于分析保留复制基因的功能富集.然后, 根据高斯混合模型拟合显著存在的峰值, 分别提取4种兜兰各峰值95%置信区间的基因作为全基因组复制事件中保留的复制基因, 对其进行GO功能富集分析.借助R包AnnotationForge, 基于4种兜兰的功能注释结果, 为每个物种分别构建数据库(https://bioconductor.org/packages/AnnotationForge/); 利用clusterProfiler分别对每种兜兰各全基因组复制事件中保留的复制基因进行GO功能富集分析(P<0.05) (Yu et al., 2012).GO功能富集结果采用R包ggplot2 (https://github.com/tidyverse/ggplot2)和pheatmap (https://github.com/raivokolde/pheatmap)进行可视化. ...

eggNOG 5.0: a hierarchical, functionally and phylogenetically anno- tated orthology resource based on 5090 organisms and 2502 viruses

1

2019

... 首先, 分别对每种兜兰的所有蛋白序列进行功能注释.使用eggNOG-mapper v2.0.6软件, 基于eggNOG v5.0.1数据库对预测获得的蛋白序列进行功能注释(Huerta-Cepas et al., 2017, 2019), 注释结果用于分析保留复制基因的功能富集.然后, 根据高斯混合模型拟合显著存在的峰值, 分别提取4种兜兰各峰值95%置信区间的基因作为全基因组复制事件中保留的复制基因, 对其进行GO功能富集分析.借助R包AnnotationForge, 基于4种兜兰的功能注释结果, 为每个物种分别构建数据库(https://bioconductor.org/packages/AnnotationForge/); 利用clusterProfiler分别对每种兜兰各全基因组复制事件中保留的复制基因进行GO功能富集分析(P<0.05) (Yu et al., 2012).GO功能富集结果采用R包ggplot2 (https://github.com/tidyverse/ggplot2)和pheatmap (https://github.com/raivokolde/pheatmap)进行可视化. ...

A high-quality carrot genome assembly provides new insights into carotenoid accumulation and asterid genome evolution

1

2016

... 目前, 在兰科植物中仅检测到2次全基因组复制事件, 一次为大多数单子叶植物共享(110-135 Mya), 另一次为现存兰科植物共享(72-78 Mya)(Cai et al., 2015; Ming et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; One Thousand Plant Transcriptomes Initiative, 2019; Hasing et al., 2020).兜兰属是兰科多样性的重要代表类群, 本研究基于4种兜兰的转录组数据, 检测到3次全基因组复制事件, 分别发生在110.17-119.77 Mya (WGD1)、60.95-74.19 Mya (WGD2)和38.19-45.85 Mya (WGD3).其中, WGD1和WGD2发生时间与前期研究得出的2次全基因组复制事件相近, 且物种间K_s分析表明, 二者均发生在兜兰属与深圳拟兰分化事件之前(图2), 因此推测WGD1为大多数单子叶植物共享、WGD2为现存兰科植物共享的全基因组复制事件.而本研究中检测到的全基因组复制事件WGD3 (38.19-45.85 Mya), 在蓝莓(blueberry)、茶树(Camellia sinensis var. sinensis)和胡萝卜(Daucus carota)中同一时期也检测到了全基因组复制事件(Iorizzo et al., 2016; Wei et al., 2018; Wang et al., 2020), 豆科中更是在该段时间检测到大量全基因组复制事件(17次, 23-55 Mya) (Zhao et al., 2021), 但在兰科植物中尚未见报道. ...

Plastome evolution and phylogeny of Orchidaceae, with 24 new sequences

1

2020

... 综合类群分化的时间信息、物种间K_s检测结果以及tree2gd检测结果, 进一步分析WGD3在兰科中的系统发生位置.兰科5个亚科的亲缘关系为(拟兰亚科(香荚兰亚科(杓兰亚科(兰亚科, 树兰亚科)))), 其中杓兰亚科与姐妹类群的分化时间约为64.97 Mya (48.54-84.93 Mya) (Kim et al., 2020), 冠群时间为33 Mya (19-50 Mya) (Gustafsson et al., 2010), 而WGD3的发生时间为38.19-45.85 Mya, 初步推测WGD3为杓兰亚科特异发生的全基因组复制事件.杓兰亚科包含5个属, 其亲缘关系为(杓兰属(南美杓兰属(兜兰属(美洲兜兰属, 镊萼兜兰属)))), 兜兰属与姐妹类群的分化时间为29.9 Mya (14.6-39.1 Mya) (http://www.timetree.org/), 冠群时间为7.09 Mya (5.88-8.41 Mya) (Tsai et al., 2020), 且物种间K_s分析结果(图2)和tree2gd检测结果(图3)均提示WGD3发生在兜兰物种间分化之前, 推测WGD3可能发生在兜兰属与美洲兜兰属、镊萼兜兰属分化之前.综上, 初步推测WGD3发生在杓兰亚科与姐妹类群分化之后, 兜兰属与美洲兜兰属、镊萼兜兰属分化之前. ...

Temperate rainforests near the South Pole during peak Cretaceous warmth

1

2020

... 多倍化或全基因组复制, 特别是在稳定环境下, 常被认为是进化的终点(Comai, 2005; Oberlander et al., 2016).然而, 在植物的演化过程中, 全基因组复制并非随机发生, 而是与全球气候变化、地质变化或者大规模灭绝等密切相关, 发生全基因组复制的个体在胁迫或极端环境条件下具有较二倍体祖先更强的适应性(Van de Peer et al., 2017, 2021; Ren et al., 2018; Wu et al., 2020).与上述研究结果相似, 本研究检测到的3次全基因组复制事件发生时期出现了全球气候变化或大规模灭绝事件, 推测全基因组复制事件提高了兜兰属植物祖先应对极端环境变化的适应性.例如, WGD1 (110.17-119.77 Mya)发生在白垩纪(Cretaceous)阿普特阶(Aptian)至阿尔布阶(Albian), 随后出现了超级温室期(83.6-93.9 Mya) (Klages et al., 2020); WGD2 (60.95-74.19 Mya)发生在白垩纪与古近纪(Paleogene)交界, 出现了白垩纪-古近纪灭绝事件(K-Pg灭绝事件) (Vellekoop et al., 2016); WGD3 (38.19-45.85 Mya)发生在古近纪始新世(Eocene), 发生了古新世-始新世极热事件(56 Mya)和始新世-渐新世(Oligocene)全球变冷(Zachos et al., 2001; McInerney and Wing, 2011). ...

The sequence read archive

1

2011

... 借助SRA Toolkit v2.10.8中的fastq-dump命令从原始数据中提取获得fastq文件, 参数为--gzip --split-e (https://github.com/ncbi/sra-tools) (Leinonen et al., 2011).利用Trimmomatic v0.39软件对fastq文件进行质控处理(参数设置: PE ILLUMINACLIP: TruSeq3- PE.fa:2: 30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW: 4:15 MINLEN:50 TOPHRED33) (Bolger et al., 2014), 过滤去除接头序列及低质量碱基等, 获取高质量数据(clean data)用于后续组装. ...

Genome size diversity in orchids: consequences and evolution

1

2009

... 杓兰亚科具有囊状或倒盔状唇瓣、2个可育雄蕊和1个盾状退化雄蕊等特征, 是兰科多样性的重要代表类群之一, 包括杓兰属(Cypripedium)、南美杓兰属(Selenipedium)、美洲兜兰属(Phragmipedium)、镊萼兜兰属(Mexipedium)及兜兰属(Paphiopedilum) 5个属(Cox et al., 1997; Chen et al., 2009).其中, 兜兰属是杓兰亚科最大的属, 约100多种, 占杓兰亚科总物种数一半以上(Govaerts et al., 2021).兜兰属植物的基因组普遍较大并存在一定程度的变异(16.5-35.9 pg/C), 且染色体数目变异丰富(2n=26-42) (Leitch et al., 2009).因此, 我们推测兜兰属可能存在全基因组复制事件, 然而, 在过去样本量小、跨大尺度的研究中并未检测到兜兰属特异发生的全基因组复制事件.因此, 本研究基于NCBI共享数据, 即杏黄兜兰(Paphiopedilum armeniacum)、同色兜兰(P. concolor)、带叶兜兰(P. hirsutissimum)以及麻栗坡兜兰(P. malipoense)的转录组数据, 采用经典的同义替换率(K_s)、系统发生基因组学以及相对定年的方法对其进行全基因组复制事件检测, 进而开展以下研究: (1) 过去未检测到全基因组复制事件历史的兜兰属植物是否发生了全基因组复制事件; (2) 若发生了全基因组复制事件, 进一步分析其发生时间, 以及是否为兜兰属内发生的全基因组复制事件; (3) 全基因组复制事件的发生对于兜兰属植物适应性演化的意义. ...

Transcriptome analysis of tessellated and green leaves in Paphiopedilum orchids using Illumina paired-end sequencing and discovery simple sequence repeat markers

3

2014

... 从NCBI网站SRA数据库检索下载杏黄兜兰(Paphiopedilum armeniacum S.C.Chen & F.Y.Liu) (2n=26)、同色兜兰(P. concolor (Lindl. ex Bateman) Pfitzer) (2n=26)、带叶兜兰(P. hirsutissimum (Lindl. ex Hook.) Stein) (2n=26)以及麻栗坡兜兰(P. malipoense S.C. Chen & Z.H.Tsi) (2n=26)转录组测序的原始数据(raw data) (Cox et al., 1998; 杨志娟, 2006; Li et al., 2014; Zhang et al., 2017; Fang et al., 2020), 用于后续的组装与分析.同时, 从NCBI网站Genome数据库下载深圳拟兰(Apostasia shenzhenica Z.J.Liu & L.J. Chen)基因组数据(GCA_002786265.1) (Zhang et al., 2017)用于物种间直系同源基因的K_s分析.将拟兰作为基于系统发生基因组学检测全基因组复制事件的外类群. ...

... The statistics of raw data and de novo assembly

Table 1

	Paphiopedilum concolor	P. hirsutissimum	P. malipoense	P. armeniacum
Accession number	SRR1405683	SRR1405685	SRR5722160	SRR9842184
Tissues	Leaf	Leaf	Stem	Seed
Bases (Gb)	3.6	3	14.1	8
Number of transcripts	156581	76006	239105	164515
Average length of transcript (bp)	907.1	1162.3	884.5	993.2
N50 of transcript (bp)	1486	1971	1627	1856
Number of unigenes	116919	62565	201606	139203
Average length of unigene (bp)	863.3	1071.1	815.6	906.4
N50 of unigene (bp)	1438	1829	1480	1704
Source of raw data	Li et al., 2014	Li et al., 2014	Zhang et al., 2017	Fang et al., 2020

为评估组装的完整性, 基于包含1 614个单拷贝基因的embryophyta_odb10数据库进行BUSCO评估, 完整覆盖基因的比例(complete BUSCOs)分别为麻栗坡兜兰(94.0%)>杏黄兜兰(92.5%)>同色兜兰(88.5%)>带叶兜兰(86.9%) (图1).BUSCO评估结果显示, 组装完整性较高, 可用于后续分析. ...

... Li et al., 2014 Zhang et al., 2017 Fang et al., 2020

为评估组装的完整性, 基于包含1 614个单拷贝基因的embryophyta_odb10数据库进行BUSCO评估, 完整覆盖基因的比例(complete BUSCOs)分别为麻栗坡兜兰(94.0%)>杏黄兜兰(92.5%)>同色兜兰(88.5%)>带叶兜兰(86.9%) (图1).BUSCO评估结果显示, 组装完整性较高, 可用于后续分析. ...

A molecular phylogeny of Chinese orchids

2016

Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences

1

2006

... 采用Trinity v2.11.0对高质量数据进行de novo组装(Haas et al., 2013), 参数设置为--seqType fq --min_ kmer_cov 2 --normalize_reads --bflyCalculateCPU.随后, 利用cd-hit v4.8.1将相似性≥95%的转录本(transcript)聚为一组(参数: -c 0.95), 每一组聚类中输出最长序列, 得到非冗余的单基因簇(unigene) (Li and Godzik, 2006).基于embryophyta_odb10数据库, 利用BUSCO v4.0.6软件对组装获得的转录本进行完整性评估(Simão et al., 2015). ...

Post-polyploid diploidization and diversification through dysploid changes

4

2018

... 多倍化(polyploid)或全基因组复制(whole-genome duplication, WGD)是物种多样性发生的重要驱动力(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018), 在植物演化历史中普遍存在, 尤其是维管束植物中多样性最高的类群被子植物和第二大类群蕨类反复发生过多轮全基因组复制(One Thousand Plant Transcriptomes Initiative, 2019; 汪浩等, 2019; Huang et al., 2020; 王婷等, 2021).基于现有证据, 在蕨类植物、被子植物第一大科菊科(Asteraceae)、第三大科豆科(Fabaceae)中分别检测到19、41、28次全基因组复制事件(Huang et al., 2020; Zhang et al., 2021a; Zhao et al., 2021), 推测多倍化与蕨类植物和被子植物物种多样性较高类群的物种形成和多样化有关(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

... ; Mandáková and Lysak, 2018; Ren et al., 2018). ...

... 全基因组复制使得染色体和基因组内全部基因均发生加倍, 为新性状演化和物种多样化提供了遗传材料(De Bodt et al., 2005; Wu et al., 2020).而全基因组复制后的基因丢失、沉默、亚功能化和新功能化等基因水平的变异, 以及染色体重组等染色体水平变异促进了表型和物种的多样化(Wendel, 2000; Adams and Wendel, 2005; Mandáková and Lysak, 2018).此外, 全基因组复制及后续变异导致一些类群染色体数目变异(Mandáková and Lysak, 2018).以单子叶植物禾本科为例, 禾本科祖先的染色体基数为7条, 而在经历了全基因组复制事件后(Paterson et al., 2004; Salse et al., 2008), 水稻(Oryza sativa)、高粱(Sorghum bicolor)和谷子(Setaria italica)的染色体基数并未达到加倍后的14条, 而是表现为染色体数目不同程度地减少, 分别为12、10和9条(Murat et al., 2017; 王振怡和王希胤, 2020).因此, 染色体数目的变化是全基因组复制发生及后续演化进程的重要特征之一. ...

... ).此外, 全基因组复制及后续变异导致一些类群染色体数目变异(Mandáková and Lysak, 2018).以单子叶植物禾本科为例, 禾本科祖先的染色体基数为7条, 而在经历了全基因组复制事件后(Paterson et al., 2004; Salse et al., 2008), 水稻(Oryza sativa)、高粱(Sorghum bicolor)和谷子(Setaria italica)的染色体基数并未达到加倍后的14条, 而是表现为染色体数目不同程度地减少, 分别为12、10和9条(Murat et al., 2017; 王振怡和王希胤, 2020).因此, 染色体数目的变化是全基因组复制发生及后续演化进程的重要特征之一. ...

The paleocene-eocene thermal maximum: a perturbation of carbon cycle, climate, and biosphere with implications for the future

1

2011

... 多倍化或全基因组复制, 特别是在稳定环境下, 常被认为是进化的终点(Comai, 2005; Oberlander et al., 2016).然而, 在植物的演化过程中, 全基因组复制并非随机发生, 而是与全球气候变化、地质变化或者大规模灭绝等密切相关, 发生全基因组复制的个体在胁迫或极端环境条件下具有较二倍体祖先更强的适应性(Van de Peer et al., 2017, 2021; Ren et al., 2018; Wu et al., 2020).与上述研究结果相似, 本研究检测到的3次全基因组复制事件发生时期出现了全球气候变化或大规模灭绝事件, 推测全基因组复制事件提高了兜兰属植物祖先应对极端环境变化的适应性.例如, WGD1 (110.17-119.77 Mya)发生在白垩纪(Cretaceous)阿普特阶(Aptian)至阿尔布阶(Albian), 随后出现了超级温室期(83.6-93.9 Mya) (Klages et al., 2020); WGD2 (60.95-74.19 Mya)发生在白垩纪与古近纪(Paleogene)交界, 出现了白垩纪-古近纪灭绝事件(K-Pg灭绝事件) (Vellekoop et al., 2016); WGD3 (38.19-45.85 Mya)发生在古近纪始新世(Eocene), 发生了古新世-始新世极热事件(56 Mya)和始新世-渐新世(Oligocene)全球变冷(Zachos et al., 2001; McInerney and Wing, 2011). ...

The pineapple genome and the evolution of CAM photosynthesis

1

2015

... 目前, 在兰科植物中仅检测到2次全基因组复制事件, 一次为大多数单子叶植物共享(110-135 Mya), 另一次为现存兰科植物共享(72-78 Mya)(Cai et al., 2015; Ming et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; One Thousand Plant Transcriptomes Initiative, 2019; Hasing et al., 2020).兜兰属是兰科多样性的重要代表类群, 本研究基于4种兜兰的转录组数据, 检测到3次全基因组复制事件, 分别发生在110.17-119.77 Mya (WGD1)、60.95-74.19 Mya (WGD2)和38.19-45.85 Mya (WGD3).其中, WGD1和WGD2发生时间与前期研究得出的2次全基因组复制事件相近, 且物种间K_s分析表明, 二者均发生在兜兰属与深圳拟兰分化事件之前(图2), 因此推测WGD1为大多数单子叶植物共享、WGD2为现存兰科植物共享的全基因组复制事件.而本研究中检测到的全基因组复制事件WGD3 (38.19-45.85 Mya), 在蓝莓(blueberry)、茶树(Camellia sinensis var. sinensis)和胡萝卜(Daucus carota)中同一时期也检测到了全基因组复制事件(Iorizzo et al., 2016; Wei et al., 2018; Wang et al., 2020), 豆科中更是在该段时间检测到大量全基因组复制事件(17次, 23-55 Mya) (Zhao et al., 2021), 但在兰科植物中尚未见报道. ...

Reconstructing the genome of the most recent common ancestor of flowering plants

1

2017

... 全基因组复制使得染色体和基因组内全部基因均发生加倍, 为新性状演化和物种多样化提供了遗传材料(De Bodt et al., 2005; Wu et al., 2020).而全基因组复制后的基因丢失、沉默、亚功能化和新功能化等基因水平的变异, 以及染色体重组等染色体水平变异促进了表型和物种的多样化(Wendel, 2000; Adams and Wendel, 2005; Mandáková and Lysak, 2018).此外, 全基因组复制及后续变异导致一些类群染色体数目变异(Mandáková and Lysak, 2018).以单子叶植物禾本科为例, 禾本科祖先的染色体基数为7条, 而在经历了全基因组复制事件后(Paterson et al., 2004; Salse et al., 2008), 水稻(Oryza sativa)、高粱(Sorghum bicolor)和谷子(Setaria italica)的染色体基数并未达到加倍后的14条, 而是表现为染色体数目不同程度地减少, 分别为12、10和9条(Murat et al., 2017; 王振怡和王希胤, 2020).因此, 染色体数目的变化是全基因组复制发生及后续演化进程的重要特征之一. ...

Species-rich and polyploid-poor: insights into the evolutionary role of whole-genome duplication from the Cape flora biodiversity hotspot

1

2016

... 多倍化或全基因组复制, 特别是在稳定环境下, 常被认为是进化的终点(Comai, 2005; Oberlander et al., 2016).然而, 在植物的演化过程中, 全基因组复制并非随机发生, 而是与全球气候变化、地质变化或者大规模灭绝等密切相关, 发生全基因组复制的个体在胁迫或极端环境条件下具有较二倍体祖先更强的适应性(Van de Peer et al., 2017, 2021; Ren et al., 2018; Wu et al., 2020).与上述研究结果相似, 本研究检测到的3次全基因组复制事件发生时期出现了全球气候变化或大规模灭绝事件, 推测全基因组复制事件提高了兜兰属植物祖先应对极端环境变化的适应性.例如, WGD1 (110.17-119.77 Mya)发生在白垩纪(Cretaceous)阿普特阶(Aptian)至阿尔布阶(Albian), 随后出现了超级温室期(83.6-93.9 Mya) (Klages et al., 2020); WGD2 (60.95-74.19 Mya)发生在白垩纪与古近纪(Paleogene)交界, 出现了白垩纪-古近纪灭绝事件(K-Pg灭绝事件) (Vellekoop et al., 2016); WGD3 (38.19-45.85 Mya)发生在古近纪始新世(Eocene), 发生了古新世-始新世极热事件(56 Mya)和始新世-渐新世(Oligocene)全球变冷(Zachos et al., 2001; McInerney and Wing, 2011). ...

One thousand plant transcriptomes and the phylogenomics of green plants

4

2019

... 多倍化(polyploid)或全基因组复制(whole-genome duplication, WGD)是物种多样性发生的重要驱动力(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018), 在植物演化历史中普遍存在, 尤其是维管束植物中多样性最高的类群被子植物和第二大类群蕨类反复发生过多轮全基因组复制(One Thousand Plant Transcriptomes Initiative, 2019; 汪浩等, 2019; Huang et al., 2020; 王婷等, 2021).基于现有证据, 在蕨类植物、被子植物第一大科菊科(Asteraceae)、第三大科豆科(Fabaceae)中分别检测到19、41、28次全基因组复制事件(Huang et al., 2020; Zhang et al., 2021a; Zhao et al., 2021), 推测多倍化与蕨类植物和被子植物物种多样性较高类群的物种形成和多样化有关(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

... 兰科(Orchidaceae)含700余属、约26 000种, 为被子植物第二大科, 单子叶植物第一大科, 是陆生植物中极具多样性的类群之一(Chase et al., 2015; Li et al., 2016), 同时表现出染色体数目变化较大(染色体基数从x=6到x=120)的特点(Da Conceição et al., 2006; 王筠竹等, 2019), 表明兰科植物的演化过程可能存在多次全基因组复制事件.然而, 目前在兰科植物中已见报道的全基因组复制事件非常有限.基于兰科植物基因组证据(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020)以及千种植物转录组项目等转录组分析(One Thousand Plant Transcriptomes Initiative, 2019), 目前仅检测到1次兰科植物特异发生的全基因组复制事件, 与蕨类(Huang et al., 2020)、菊科(Zhang et al., 2021a)和豆科(Zhao et al., 2021)等物种多样性丰富的类群多倍化研究结果不符. ...

... 分析上述情况的原因, 我们推测可能与兰科植物种类及类群众多、前期研究样本量小但种类跨度大的研究策略有关.例如, 千种植物转录组项目囊括了兰科7个样本, 但却跨了香荚兰亚科(Vanilloideae)、兰亚科(Orchidoideae)和树兰亚科(Epidendroideae) 3个亚科7个属(One Thousand Plant Transcriptomes Initiative, 2019); 分析全基因组复制事件的5套全基因组数据同样覆盖了拟兰亚科(Apostasioideae)、香荚兰亚科、树兰亚科3个亚科5个属(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020); 关于杓兰亚科基因组进化的研究包括13个兰科植物转录组和基因组数据, 覆盖了兰科所有亚科(拟兰亚科、香荚兰亚科、杓兰亚科(Cypripedioideae)、兰亚科和树兰亚科) 13个属(Unruh et al., 2018).对于兰科这样包含26 000多种的特大类群, 解析其全基因组复制历史需要借助更精细的尺度. ...

... 目前, 在兰科植物中仅检测到2次全基因组复制事件, 一次为大多数单子叶植物共享(110-135 Mya), 另一次为现存兰科植物共享(72-78 Mya)(Cai et al., 2015; Ming et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; One Thousand Plant Transcriptomes Initiative, 2019; Hasing et al., 2020).兜兰属是兰科多样性的重要代表类群, 本研究基于4种兜兰的转录组数据, 检测到3次全基因组复制事件, 分别发生在110.17-119.77 Mya (WGD1)、60.95-74.19 Mya (WGD2)和38.19-45.85 Mya (WGD3).其中, WGD1和WGD2发生时间与前期研究得出的2次全基因组复制事件相近, 且物种间K_s分析表明, 二者均发生在兜兰属与深圳拟兰分化事件之前(图2), 因此推测WGD1为大多数单子叶植物共享、WGD2为现存兰科植物共享的全基因组复制事件.而本研究中检测到的全基因组复制事件WGD3 (38.19-45.85 Mya), 在蓝莓(blueberry)、茶树(Camellia sinensis var. sinensis)和胡萝卜(Daucus carota)中同一时期也检测到了全基因组复制事件(Iorizzo et al., 2016; Wei et al., 2018; Wang et al., 2020), 豆科中更是在该段时间检测到大量全基因组复制事件(17次, 23-55 Mya) (Zhao et al., 2021), 但在兰科植物中尚未见报道. ...

Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics

1

2004

... 全基因组复制使得染色体和基因组内全部基因均发生加倍, 为新性状演化和物种多样化提供了遗传材料(De Bodt et al., 2005; Wu et al., 2020).而全基因组复制后的基因丢失、沉默、亚功能化和新功能化等基因水平的变异, 以及染色体重组等染色体水平变异促进了表型和物种的多样化(Wendel, 2000; Adams and Wendel, 2005; Mandáková and Lysak, 2018).此外, 全基因组复制及后续变异导致一些类群染色体数目变异(Mandáková and Lysak, 2018).以单子叶植物禾本科为例, 禾本科祖先的染色体基数为7条, 而在经历了全基因组复制事件后(Paterson et al., 2004; Salse et al., 2008), 水稻(Oryza sativa)、高粱(Sorghum bicolor)和谷子(Setaria italica)的染色体基数并未达到加倍后的14条, 而是表现为染色体数目不同程度地减少, 分别为12、10和9条(Murat et al., 2017; 王振怡和王希胤, 2020).因此, 染色体数目的变化是全基因组复制发生及后续演化进程的重要特征之一. ...

Widespread whole genome duplications contribute to genome complexity and species diversity in angiosperms

2

2018

... 多倍化(polyploid)或全基因组复制(whole-genome duplication, WGD)是物种多样性发生的重要驱动力(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018), 在植物演化历史中普遍存在, 尤其是维管束植物中多样性最高的类群被子植物和第二大类群蕨类反复发生过多轮全基因组复制(One Thousand Plant Transcriptomes Initiative, 2019; 汪浩等, 2019; Huang et al., 2020; 王婷等, 2021).基于现有证据, 在蕨类植物、被子植物第一大科菊科(Asteraceae)、第三大科豆科(Fabaceae)中分别检测到19、41、28次全基因组复制事件(Huang et al., 2020; Zhang et al., 2021a; Zhao et al., 2021), 推测多倍化与蕨类植物和被子植物物种多样性较高类群的物种形成和多样化有关(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

... 多倍化或全基因组复制, 特别是在稳定环境下, 常被认为是进化的终点(Comai, 2005; Oberlander et al., 2016).然而, 在植物的演化过程中, 全基因组复制并非随机发生, 而是与全球气候变化、地质变化或者大规模灭绝等密切相关, 发生全基因组复制的个体在胁迫或极端环境条件下具有较二倍体祖先更强的适应性(Van de Peer et al., 2017, 2021; Ren et al., 2018; Wu et al., 2020).与上述研究结果相似, 本研究检测到的3次全基因组复制事件发生时期出现了全球气候变化或大规模灭绝事件, 推测全基因组复制事件提高了兜兰属植物祖先应对极端环境变化的适应性.例如, WGD1 (110.17-119.77 Mya)发生在白垩纪(Cretaceous)阿普特阶(Aptian)至阿尔布阶(Albian), 随后出现了超级温室期(83.6-93.9 Mya) (Klages et al., 2020); WGD2 (60.95-74.19 Mya)发生在白垩纪与古近纪(Paleogene)交界, 出现了白垩纪-古近纪灭绝事件(K-Pg灭绝事件) (Vellekoop et al., 2016); WGD3 (38.19-45.85 Mya)发生在古近纪始新世(Eocene), 发生了古新世-始新世极热事件(56 Mya)和始新世-渐新世(Oligocene)全球变冷(Zachos et al., 2001; McInerney and Wing, 2011). ...

Identification and characterization of shared duplications between rice and wheat provide new insight into grass genome evolution

1

2008

... 全基因组复制使得染色体和基因组内全部基因均发生加倍, 为新性状演化和物种多样化提供了遗传材料(De Bodt et al., 2005; Wu et al., 2020).而全基因组复制后的基因丢失、沉默、亚功能化和新功能化等基因水平的变异, 以及染色体重组等染色体水平变异促进了表型和物种的多样化(Wendel, 2000; Adams and Wendel, 2005; Mandáková and Lysak, 2018).此外, 全基因组复制及后续变异导致一些类群染色体数目变异(Mandáková and Lysak, 2018).以单子叶植物禾本科为例, 禾本科祖先的染色体基数为7条, 而在经历了全基因组复制事件后(Paterson et al., 2004; Salse et al., 2008), 水稻(Oryza sativa)、高粱(Sorghum bicolor)和谷子(Setaria italica)的染色体基数并未达到加倍后的14条, 而是表现为染色体数目不同程度地减少, 分别为12、10和9条(Murat et al., 2017; 王振怡和王希胤, 2020).因此, 染色体数目的变化是全基因组复制发生及后续演化进程的重要特征之一. ...

Mining EST databases to resolve evolutionary events in major crop species

1

2004

... 根据文献报道的方法计算物种内旁系同源基因对的K_s值(Sollars et al., 2017), 并对其进行正态分布拟合, 以检测全基因组复制事件.首先, 分别对各物种的蛋白序列进行all against all序列相似性比对(BLASTP), 阈值设置为e^-5.然后, 应用脚本KSPlotter.py计算每个物种的K_s值(https://github.com/EndymionCooper/KSPlotting).主要步骤为: 使用mclblastline pipeline构建基因家族(Enright et al., 2002), 借助MUSCLE对每个基因家族进行比对(Edgar, 2004), 最后利用CODEML软件(PAML包)计算每个物种的K_s值(Goldman and Yang, 1994; Yang, 2007).为避免随机误差和同义替换饱和效应的影响(Blanc and Wolfe, 2004; Schlueter et al., 2004; Cui et al., 2006), 本研究仅保留0.1-5之间的K_s值用于后续分析.最后, 借助R包mclust中的高斯混合模型对保留的K_s值进行正态分布拟合(Scrucca et al., 2016), 以排除假阳性峰. ...

Mclust 5: clustering, classification and density estimation using Gaussian finite mixture models

1

2016

... 根据文献报道的方法计算物种内旁系同源基因对的K_s值(Sollars et al., 2017), 并对其进行正态分布拟合, 以检测全基因组复制事件.首先, 分别对各物种的蛋白序列进行all against all序列相似性比对(BLASTP), 阈值设置为e^-5.然后, 应用脚本KSPlotter.py计算每个物种的K_s值(https://github.com/EndymionCooper/KSPlotting).主要步骤为: 使用mclblastline pipeline构建基因家族(Enright et al., 2002), 借助MUSCLE对每个基因家族进行比对(Edgar, 2004), 最后利用CODEML软件(PAML包)计算每个物种的K_s值(Goldman and Yang, 1994; Yang, 2007).为避免随机误差和同义替换饱和效应的影响(Blanc and Wolfe, 2004; Schlueter et al., 2004; Cui et al., 2006), 本研究仅保留0.1-5之间的K_s值用于后续分析.最后, 借助R包mclust中的高斯混合模型对保留的K_s值进行正态分布拟合(Scrucca et al., 2016), 以排除假阳性峰. ...

BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs

1

2015

... 采用Trinity v2.11.0对高质量数据进行de novo组装(Haas et al., 2013), 参数设置为--seqType fq --min_ kmer_cov 2 --normalize_reads --bflyCalculateCPU.随后, 利用cd-hit v4.8.1将相似性≥95%的转录本(transcript)聚为一组(参数: -c 0.95), 每一组聚类中输出最长序列, 得到非冗余的单基因簇(unigene) (Li and Godzik, 2006).基于embryophyta_odb10数据库, 利用BUSCO v4.0.6软件对组装获得的转录本进行完整性评估(Simão et al., 2015). ...

Genome sequence and genetic diversity of European ash trees

1

2017

... 根据文献报道的方法计算物种内旁系同源基因对的K_s值(Sollars et al., 2017), 并对其进行正态分布拟合, 以检测全基因组复制事件.首先, 分别对各物种的蛋白序列进行all against all序列相似性比对(BLASTP), 阈值设置为e^-5.然后, 应用脚本KSPlotter.py计算每个物种的K_s值(https://github.com/EndymionCooper/KSPlotting).主要步骤为: 使用mclblastline pipeline构建基因家族(Enright et al., 2002), 借助MUSCLE对每个基因家族进行比对(Edgar, 2004), 最后利用CODEML软件(PAML包)计算每个物种的K_s值(Goldman and Yang, 1994; Yang, 2007).为避免随机误差和同义替换饱和效应的影响(Blanc and Wolfe, 2004; Schlueter et al., 2004; Cui et al., 2006), 本研究仅保留0.1-5之间的K_s值用于后续分析.最后, 借助R包mclust中的高斯混合模型对保留的K_s值进行正态分布拟合(Scrucca et al., 2016), 以排除假阳性峰. ...

RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies

1

2014

... 为验证K_s法检测结果的准确性, 应用tree2gd软件, 基于系统发生基因组学的方法再次检测全基因组复制事件(Zhang et al., 2020; Zhao et al., 2021).(1) 以4种兜兰和深圳拟兰的蛋白序列为输入文件, 利用OrthoFinder v2.5.2筛选单拷贝直系同源基因(Emms and Kelly, 2019).(2) 利用单拷贝直系同源基因构建物种树.首先, 采用MUSCLE v3.8.31对筛选得到的302个单拷贝直系同源基因进行多序列比对(Edgar, 2004); 随后, 基于比对结果使用Gblocks v0.91b筛选保守区域(Castresana, 2000; Talavera and Castresana, 2007), 并将筛选获得的保守区域串联形成多基因矩阵; 最后, 以ProtTest v3.4.2确定的PROTGAMMAJTTF为最优替代模型(Darriba et al., 2011), 利用RAxML v8.2.12软件, 采用最大似然法、基于保守序列矩阵、以深圳拟兰为外类群、在自举检验1 000次的设置下构建系统发生树(Stamatakis, 2014).(3) 以第(2)步构建的系统发生树为物种树, 利用tree2gd v1.0.39软件, 基于默认参数检测全基因组复制事件(https://github.com/Dee-chen/Tree2gd) (Zhang et al., 2020). ...

Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments

1

2007

... 为验证K_s法检测结果的准确性, 应用tree2gd软件, 基于系统发生基因组学的方法再次检测全基因组复制事件(Zhang et al., 2020; Zhao et al., 2021).(1) 以4种兜兰和深圳拟兰的蛋白序列为输入文件, 利用OrthoFinder v2.5.2筛选单拷贝直系同源基因(Emms and Kelly, 2019).(2) 利用单拷贝直系同源基因构建物种树.首先, 采用MUSCLE v3.8.31对筛选得到的302个单拷贝直系同源基因进行多序列比对(Edgar, 2004); 随后, 基于比对结果使用Gblocks v0.91b筛选保守区域(Castresana, 2000; Talavera and Castresana, 2007), 并将筛选获得的保守区域串联形成多基因矩阵; 最后, 以ProtTest v3.4.2确定的PROTGAMMAJTTF为最优替代模型(Darriba et al., 2011), 利用RAxML v8.2.12软件, 采用最大似然法、基于保守序列矩阵、以深圳拟兰为外类群、在自举检验1 000次的设置下构建系统发生树(Stamatakis, 2014).(3) 以第(2)步构建的系统发生树为物种树, 利用tree2gd v1.0.39软件, 基于默认参数检测全基因组复制事件(https://github.com/Dee-chen/Tree2gd) (Zhang et al., 2020). ...

Phylogeny and historical biogeography of Paphiopedilum pfitzer (Orchidaceae) based on nuclear and plastid DNA

1

2020

... 综合类群分化的时间信息、物种间K_s检测结果以及tree2gd检测结果, 进一步分析WGD3在兰科中的系统发生位置.兰科5个亚科的亲缘关系为(拟兰亚科(香荚兰亚科(杓兰亚科(兰亚科, 树兰亚科)))), 其中杓兰亚科与姐妹类群的分化时间约为64.97 Mya (48.54-84.93 Mya) (Kim et al., 2020), 冠群时间为33 Mya (19-50 Mya) (Gustafsson et al., 2010), 而WGD3的发生时间为38.19-45.85 Mya, 初步推测WGD3为杓兰亚科特异发生的全基因组复制事件.杓兰亚科包含5个属, 其亲缘关系为(杓兰属(南美杓兰属(兜兰属(美洲兜兰属, 镊萼兜兰属)))), 兜兰属与姐妹类群的分化时间为29.9 Mya (14.6-39.1 Mya) (http://www.timetree.org/), 冠群时间为7.09 Mya (5.88-8.41 Mya) (Tsai et al., 2020), 且物种间K_s分析结果(图2)和tree2gd检测结果(图3)均提示WGD3发生在兜兰物种间分化之前, 推测WGD3可能发生在兜兰属与美洲兜兰属、镊萼兜兰属分化之前.综上, 初步推测WGD3发生在杓兰亚科与姐妹类群分化之后, 兜兰属与美洲兜兰属、镊萼兜兰属分化之前. ...

Phylotranscriptomic analysis and genome evolution of the Cypripedioideae (Orchidaceae)

1

2018

... 分析上述情况的原因, 我们推测可能与兰科植物种类及类群众多、前期研究样本量小但种类跨度大的研究策略有关.例如, 千种植物转录组项目囊括了兰科7个样本, 但却跨了香荚兰亚科(Vanilloideae)、兰亚科(Orchidoideae)和树兰亚科(Epidendroideae) 3个亚科7个属(One Thousand Plant Transcriptomes Initiative, 2019); 分析全基因组复制事件的5套全基因组数据同样覆盖了拟兰亚科(Apostasioideae)、香荚兰亚科、树兰亚科3个亚科5个属(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020); 关于杓兰亚科基因组进化的研究包括13个兰科植物转录组和基因组数据, 覆盖了兰科所有亚科(拟兰亚科、香荚兰亚科、杓兰亚科(Cypripedioideae)、兰亚科和树兰亚科) 13个属(Unruh et al., 2018).对于兰科这样包含26 000多种的特大类群, 解析其全基因组复制历史需要借助更精细的尺度. ...

Fatty acid unsaturation, mobilization, and regulation in the response of plants to stress

1

2008

... 全基因组复制事件保留了部分复制基因, 对保留的复制基因进行功能分析可为阐明全基因组复制事件对植物适应性演化的促进作用提供遗传证据.本研究分别对4种兜兰3次全基因组复制后的保留复制基因进行了GO功能富集分析, 发现3次全基因组复制事件富集到的功能存在差异(图4, 图5).WGD1富集了脂类代谢、软木脂的生物合成、苯丙烷类的合成与代谢, 以及氧化还原酶活性和活性氧代谢过程的调控等功能(图5), 这可能与兜兰属植物应对超级温室期的干旱环境以及抵御干旱引起的活性氧失衡有关(Upchurch, 2008; Das and Roychoudhury, 2014; Brunner et al., 2015; Zhang et al., 2021b).在K-Pg灭绝时期, 大气中充满了灰尘、硫酸盐气溶胶及碳黑颗粒, 黑暗和低温成为主要的胁迫因子(Vellekoop et al., 2016).推测WGD2富集的脱落酸激活的信号通路以及昼夜节律等功能提高了兜兰属植物祖先对当时剧变环境的适应性(图4, 图5) (杨有新等, 2014; Vishwakarma et al., 2017).WGD3之后, 兜兰属植物祖先经历了全球温度骤降, 推测富集的磷脂代谢、酶联受体蛋白信号通路、色素沉着, 以及保卫细胞分化与发育、根表皮细胞分化与毛状体分化等功能, 可能与应对低温引起的植物萎蔫、叶绿素含量减少以及细胞膜发生相变有关(王芳等, 2019).综上, 推测保留的复制基因在功能上与当时特定的胁迫因子相关. ...

Polyploidy: an evolutionary and ecological force in stressful times

1

2021

... 多倍化或全基因组复制, 特别是在稳定环境下, 常被认为是进化的终点(Comai, 2005; Oberlander et al., 2016).然而, 在植物的演化过程中, 全基因组复制并非随机发生, 而是与全球气候变化、地质变化或者大规模灭绝等密切相关, 发生全基因组复制的个体在胁迫或极端环境条件下具有较二倍体祖先更强的适应性(Van de Peer et al., 2017, 2021; Ren et al., 2018; Wu et al., 2020).与上述研究结果相似, 本研究检测到的3次全基因组复制事件发生时期出现了全球气候变化或大规模灭绝事件, 推测全基因组复制事件提高了兜兰属植物祖先应对极端环境变化的适应性.例如, WGD1 (110.17-119.77 Mya)发生在白垩纪(Cretaceous)阿普特阶(Aptian)至阿尔布阶(Albian), 随后出现了超级温室期(83.6-93.9 Mya) (Klages et al., 2020); WGD2 (60.95-74.19 Mya)发生在白垩纪与古近纪(Paleogene)交界, 出现了白垩纪-古近纪灭绝事件(K-Pg灭绝事件) (Vellekoop et al., 2016); WGD3 (38.19-45.85 Mya)发生在古近纪始新世(Eocene), 发生了古新世-始新世极热事件(56 Mya)和始新世-渐新世(Oligocene)全球变冷(Zachos et al., 2001; McInerney and Wing, 2011). ...

The evolutionary significance of polyploidy

3

2017

... 多倍化(polyploid)或全基因组复制(whole-genome duplication, WGD)是物种多样性发生的重要驱动力(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018), 在植物演化历史中普遍存在, 尤其是维管束植物中多样性最高的类群被子植物和第二大类群蕨类反复发生过多轮全基因组复制(One Thousand Plant Transcriptomes Initiative, 2019; 汪浩等, 2019; Huang et al., 2020; 王婷等, 2021).基于现有证据, 在蕨类植物、被子植物第一大科菊科(Asteraceae)、第三大科豆科(Fabaceae)中分别检测到19、41、28次全基因组复制事件(Huang et al., 2020; Zhang et al., 2021a; Zhao et al., 2021), 推测多倍化与蕨类植物和被子植物物种多样性较高类群的物种形成和多样化有关(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

... ; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

... 多倍化或全基因组复制, 特别是在稳定环境下, 常被认为是进化的终点(Comai, 2005; Oberlander et al., 2016).然而, 在植物的演化过程中, 全基因组复制并非随机发生, 而是与全球气候变化、地质变化或者大规模灭绝等密切相关, 发生全基因组复制的个体在胁迫或极端环境条件下具有较二倍体祖先更强的适应性(Van de Peer et al., 2017, 2021; Ren et al., 2018; Wu et al., 2020).与上述研究结果相似, 本研究检测到的3次全基因组复制事件发生时期出现了全球气候变化或大规模灭绝事件, 推测全基因组复制事件提高了兜兰属植物祖先应对极端环境变化的适应性.例如, WGD1 (110.17-119.77 Mya)发生在白垩纪(Cretaceous)阿普特阶(Aptian)至阿尔布阶(Albian), 随后出现了超级温室期(83.6-93.9 Mya) (Klages et al., 2020); WGD2 (60.95-74.19 Mya)发生在白垩纪与古近纪(Paleogene)交界, 出现了白垩纪-古近纪灭绝事件(K-Pg灭绝事件) (Vellekoop et al., 2016); WGD3 (38.19-45.85 Mya)发生在古近纪始新世(Eocene), 发生了古新世-始新世极热事件(56 Mya)和始新世-渐新世(Oligocene)全球变冷(Zachos et al., 2001; McInerney and Wing, 2011). ...

Evidence for Cretaceous-Paleogene boundary bolide ‘impact winter' conditions from New Jersey, USA

2

2016

... 多倍化或全基因组复制, 特别是在稳定环境下, 常被认为是进化的终点(Comai, 2005; Oberlander et al., 2016).然而, 在植物的演化过程中, 全基因组复制并非随机发生, 而是与全球气候变化、地质变化或者大规模灭绝等密切相关, 发生全基因组复制的个体在胁迫或极端环境条件下具有较二倍体祖先更强的适应性(Van de Peer et al., 2017, 2021; Ren et al., 2018; Wu et al., 2020).与上述研究结果相似, 本研究检测到的3次全基因组复制事件发生时期出现了全球气候变化或大规模灭绝事件, 推测全基因组复制事件提高了兜兰属植物祖先应对极端环境变化的适应性.例如, WGD1 (110.17-119.77 Mya)发生在白垩纪(Cretaceous)阿普特阶(Aptian)至阿尔布阶(Albian), 随后出现了超级温室期(83.6-93.9 Mya) (Klages et al., 2020); WGD2 (60.95-74.19 Mya)发生在白垩纪与古近纪(Paleogene)交界, 出现了白垩纪-古近纪灭绝事件(K-Pg灭绝事件) (Vellekoop et al., 2016); WGD3 (38.19-45.85 Mya)发生在古近纪始新世(Eocene), 发生了古新世-始新世极热事件(56 Mya)和始新世-渐新世(Oligocene)全球变冷(Zachos et al., 2001; McInerney and Wing, 2011). ...

... 全基因组复制事件保留了部分复制基因, 对保留的复制基因进行功能分析可为阐明全基因组复制事件对植物适应性演化的促进作用提供遗传证据.本研究分别对4种兜兰3次全基因组复制后的保留复制基因进行了GO功能富集分析, 发现3次全基因组复制事件富集到的功能存在差异(图4, 图5).WGD1富集了脂类代谢、软木脂的生物合成、苯丙烷类的合成与代谢, 以及氧化还原酶活性和活性氧代谢过程的调控等功能(图5), 这可能与兜兰属植物应对超级温室期的干旱环境以及抵御干旱引起的活性氧失衡有关(Upchurch, 2008; Das and Roychoudhury, 2014; Brunner et al., 2015; Zhang et al., 2021b).在K-Pg灭绝时期, 大气中充满了灰尘、硫酸盐气溶胶及碳黑颗粒, 黑暗和低温成为主要的胁迫因子(Vellekoop et al., 2016).推测WGD2富集的脱落酸激活的信号通路以及昼夜节律等功能提高了兜兰属植物祖先对当时剧变环境的适应性(图4, 图5) (杨有新等, 2014; Vishwakarma et al., 2017).WGD3之后, 兜兰属植物祖先经历了全球温度骤降, 推测富集的磷脂代谢、酶联受体蛋白信号通路、色素沉着, 以及保卫细胞分化与发育、根表皮细胞分化与毛状体分化等功能, 可能与应对低温引起的植物萎蔫、叶绿素含量减少以及细胞膜发生相变有关(王芳等, 2019).综上, 推测保留的复制基因在功能上与当时特定的胁迫因子相关. ...

Abscisic acid signaling and abiotic stress tolerance in plants: a review on current knowledge and future prospects

1

2017

... 全基因组复制事件保留了部分复制基因, 对保留的复制基因进行功能分析可为阐明全基因组复制事件对植物适应性演化的促进作用提供遗传证据.本研究分别对4种兜兰3次全基因组复制后的保留复制基因进行了GO功能富集分析, 发现3次全基因组复制事件富集到的功能存在差异(图4, 图5).WGD1富集了脂类代谢、软木脂的生物合成、苯丙烷类的合成与代谢, 以及氧化还原酶活性和活性氧代谢过程的调控等功能(图5), 这可能与兜兰属植物应对超级温室期的干旱环境以及抵御干旱引起的活性氧失衡有关(Upchurch, 2008; Das and Roychoudhury, 2014; Brunner et al., 2015; Zhang et al., 2021b).在K-Pg灭绝时期, 大气中充满了灰尘、硫酸盐气溶胶及碳黑颗粒, 黑暗和低温成为主要的胁迫因子(Vellekoop et al., 2016).推测WGD2富集的脱落酸激活的信号通路以及昼夜节律等功能提高了兜兰属植物祖先对当时剧变环境的适应性(图4, 图5) (杨有新等, 2014; Vishwakarma et al., 2017).WGD3之后, 兜兰属植物祖先经历了全球温度骤降, 推测富集的磷脂代谢、酶联受体蛋白信号通路、色素沉着, 以及保卫细胞分化与发育、根表皮细胞分化与毛状体分化等功能, 可能与应对低温引起的植物萎蔫、叶绿素含量减少以及细胞膜发生相变有关(王芳等, 2019).综上, 推测保留的复制基因在功能上与当时特定的胁迫因子相关. ...

Molecular footprints of selection effects and whole genome duplication (WGD) events in three blueberry species: detected by transcriptome dataset

1

2020

... 目前, 在兰科植物中仅检测到2次全基因组复制事件, 一次为大多数单子叶植物共享(110-135 Mya), 另一次为现存兰科植物共享(72-78 Mya)(Cai et al., 2015; Ming et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; One Thousand Plant Transcriptomes Initiative, 2019; Hasing et al., 2020).兜兰属是兰科多样性的重要代表类群, 本研究基于4种兜兰的转录组数据, 检测到3次全基因组复制事件, 分别发生在110.17-119.77 Mya (WGD1)、60.95-74.19 Mya (WGD2)和38.19-45.85 Mya (WGD3).其中, WGD1和WGD2发生时间与前期研究得出的2次全基因组复制事件相近, 且物种间K_s分析表明, 二者均发生在兜兰属与深圳拟兰分化事件之前(图2), 因此推测WGD1为大多数单子叶植物共享、WGD2为现存兰科植物共享的全基因组复制事件.而本研究中检测到的全基因组复制事件WGD3 (38.19-45.85 Mya), 在蓝莓(blueberry)、茶树(Camellia sinensis var. sinensis)和胡萝卜(Daucus carota)中同一时期也检测到了全基因组复制事件(Iorizzo et al., 2016; Wei et al., 2018; Wang et al., 2020), 豆科中更是在该段时间检测到大量全基因组复制事件(17次, 23-55 Mya) (Zhao et al., 2021), 但在兰科植物中尚未见报道. ...

Draft genome sequence of Camellia sinensis var. sinensis provides insights into the evolution of the tea genome and tea quality

1

2018

... 目前, 在兰科植物中仅检测到2次全基因组复制事件, 一次为大多数单子叶植物共享(110-135 Mya), 另一次为现存兰科植物共享(72-78 Mya)(Cai et al., 2015; Ming et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; One Thousand Plant Transcriptomes Initiative, 2019; Hasing et al., 2020).兜兰属是兰科多样性的重要代表类群, 本研究基于4种兜兰的转录组数据, 检测到3次全基因组复制事件, 分别发生在110.17-119.77 Mya (WGD1)、60.95-74.19 Mya (WGD2)和38.19-45.85 Mya (WGD3).其中, WGD1和WGD2发生时间与前期研究得出的2次全基因组复制事件相近, 且物种间K_s分析表明, 二者均发生在兜兰属与深圳拟兰分化事件之前(图2), 因此推测WGD1为大多数单子叶植物共享、WGD2为现存兰科植物共享的全基因组复制事件.而本研究中检测到的全基因组复制事件WGD3 (38.19-45.85 Mya), 在蓝莓(blueberry)、茶树(Camellia sinensis var. sinensis)和胡萝卜(Daucus carota)中同一时期也检测到了全基因组复制事件(Iorizzo et al., 2016; Wei et al., 2018; Wang et al., 2020), 豆科中更是在该段时间检测到大量全基因组复制事件(17次, 23-55 Mya) (Zhao et al., 2021), 但在兰科植物中尚未见报道. ...

Genome evolution in polyploids

1

2000

... 全基因组复制使得染色体和基因组内全部基因均发生加倍, 为新性状演化和物种多样化提供了遗传材料(De Bodt et al., 2005; Wu et al., 2020).而全基因组复制后的基因丢失、沉默、亚功能化和新功能化等基因水平的变异, 以及染色体重组等染色体水平变异促进了表型和物种的多样化(Wendel, 2000; Adams and Wendel, 2005; Mandáková and Lysak, 2018).此外, 全基因组复制及后续变异导致一些类群染色体数目变异(Mandáková and Lysak, 2018).以单子叶植物禾本科为例, 禾本科祖先的染色体基数为7条, 而在经历了全基因组复制事件后(Paterson et al., 2004; Salse et al., 2008), 水稻(Oryza sativa)、高粱(Sorghum bicolor)和谷子(Setaria italica)的染色体基数并未达到加倍后的14条, 而是表现为染色体数目不同程度地减少, 分别为12、10和9条(Murat et al., 2017; 王振怡和王希胤, 2020).因此, 染色体数目的变化是全基因组复制发生及后续演化进程的重要特征之一. ...

Genetic contribution of paleopolyploidy to adaptive evolution in angiosperms

4

2020

... 全基因组复制使得染色体和基因组内全部基因均发生加倍, 为新性状演化和物种多样化提供了遗传材料(De Bodt et al., 2005; Wu et al., 2020).而全基因组复制后的基因丢失、沉默、亚功能化和新功能化等基因水平的变异, 以及染色体重组等染色体水平变异促进了表型和物种的多样化(Wendel, 2000; Adams and Wendel, 2005; Mandáková and Lysak, 2018).此外, 全基因组复制及后续变异导致一些类群染色体数目变异(Mandáková and Lysak, 2018).以单子叶植物禾本科为例, 禾本科祖先的染色体基数为7条, 而在经历了全基因组复制事件后(Paterson et al., 2004; Salse et al., 2008), 水稻(Oryza sativa)、高粱(Sorghum bicolor)和谷子(Setaria italica)的染色体基数并未达到加倍后的14条, 而是表现为染色体数目不同程度地减少, 分别为12、10和9条(Murat et al., 2017; 王振怡和王希胤, 2020).因此, 染色体数目的变化是全基因组复制发生及后续演化进程的重要特征之一. ...

... 多倍化或全基因组复制, 特别是在稳定环境下, 常被认为是进化的终点(Comai, 2005; Oberlander et al., 2016).然而, 在植物的演化过程中, 全基因组复制并非随机发生, 而是与全球气候变化、地质变化或者大规模灭绝等密切相关, 发生全基因组复制的个体在胁迫或极端环境条件下具有较二倍体祖先更强的适应性(Van de Peer et al., 2017, 2021; Ren et al., 2018; Wu et al., 2020).与上述研究结果相似, 本研究检测到的3次全基因组复制事件发生时期出现了全球气候变化或大规模灭绝事件, 推测全基因组复制事件提高了兜兰属植物祖先应对极端环境变化的适应性.例如, WGD1 (110.17-119.77 Mya)发生在白垩纪(Cretaceous)阿普特阶(Aptian)至阿尔布阶(Albian), 随后出现了超级温室期(83.6-93.9 Mya) (Klages et al., 2020); WGD2 (60.95-74.19 Mya)发生在白垩纪与古近纪(Paleogene)交界, 出现了白垩纪-古近纪灭绝事件(K-Pg灭绝事件) (Vellekoop et al., 2016); WGD3 (38.19-45.85 Mya)发生在古近纪始新世(Eocene), 发生了古新世-始新世极热事件(56 Mya)和始新世-渐新世(Oligocene)全球变冷(Zachos et al., 2001; McInerney and Wing, 2011). ...

... 上述分析结果与前人有关被子植物主要分支的研究结论一致(Wu et al., 2020).Wu等(2020)对包括被子植物主要分支在内的25个物种(双子叶植物10种, 单子叶植物12种, 基部被子植物、石松类植物和苔藓各1种)在3个历史时期(约120 Mya、约66 Mya及<20 Mya)发生全基因组复制后的保留复制基因进行了功能富集分析, 发现不同时期多倍化后的保留复制基因在功能上与当时的环境压力一致.(1) 在约120 Mya发生全基因组复制事件后的复制基因主要在响应缺水和盐胁迫的功能上显著富集, 当时地球正处于干旱环境; (2) 在K-Pg灭绝时期(约66 Mya)发生了全球变冷、黑暗、酸雨和野火等, 该时期富集到了与冷、热、渗透、盐和水等胁迫相关的功能, 以及脱落酸激活的信号通路等与胁迫响应相关的其它生物学过程; (3) 在约20 Mya发生全基因组复制事件后的复制基因主要在响应盐胁迫、缺水和机械伤害的功能上显著富集, 与当时CO₂浓度低和相对低温有关. ...

... ).Wu等(2020)对包括被子植物主要分支在内的25个物种(双子叶植物10种, 单子叶植物12种, 基部被子植物、石松类植物和苔藓各1种)在3个历史时期(约120 Mya、约66 Mya及<20 Mya)发生全基因组复制后的保留复制基因进行了功能富集分析, 发现不同时期多倍化后的保留复制基因在功能上与当时的环境压力一致.(1) 在约120 Mya发生全基因组复制事件后的复制基因主要在响应缺水和盐胁迫的功能上显著富集, 当时地球正处于干旱环境; (2) 在K-Pg灭绝时期(约66 Mya)发生了全球变冷、黑暗、酸雨和野火等, 该时期富集到了与冷、热、渗透、盐和水等胁迫相关的功能, 以及脱落酸激活的信号通路等与胁迫响应相关的其它生物学过程; (3) 在约20 Mya发生全基因组复制事件后的复制基因主要在响应盐胁迫、缺水和机械伤害的功能上显著富集, 与当时CO₂浓度低和相对低温有关. ...

PAML 4: phylogenetic analysis by maximum likelihood

1

2007

... 根据文献报道的方法计算物种内旁系同源基因对的K_s值(Sollars et al., 2017), 并对其进行正态分布拟合, 以检测全基因组复制事件.首先, 分别对各物种的蛋白序列进行all against all序列相似性比对(BLASTP), 阈值设置为e^-5.然后, 应用脚本KSPlotter.py计算每个物种的K_s值(https://github.com/EndymionCooper/KSPlotting).主要步骤为: 使用mclblastline pipeline构建基因家族(Enright et al., 2002), 借助MUSCLE对每个基因家族进行比对(Edgar, 2004), 最后利用CODEML软件(PAML包)计算每个物种的K_s值(Goldman and Yang, 1994; Yang, 2007).为避免随机误差和同义替换饱和效应的影响(Blanc and Wolfe, 2004; Schlueter et al., 2004; Cui et al., 2006), 本研究仅保留0.1-5之间的K_s值用于后续分析.最后, 借助R包mclust中的高斯混合模型对保留的K_s值进行正态分布拟合(Scrucca et al., 2016), 以排除假阳性峰. ...

ClusterProfiler: an R package for comparing biological themes among gene clusters

1

2012

... 首先, 分别对每种兜兰的所有蛋白序列进行功能注释.使用eggNOG-mapper v2.0.6软件, 基于eggNOG v5.0.1数据库对预测获得的蛋白序列进行功能注释(Huerta-Cepas et al., 2017, 2019), 注释结果用于分析保留复制基因的功能富集.然后, 根据高斯混合模型拟合显著存在的峰值, 分别提取4种兜兰各峰值95%置信区间的基因作为全基因组复制事件中保留的复制基因, 对其进行GO功能富集分析.借助R包AnnotationForge, 基于4种兜兰的功能注释结果, 为每个物种分别构建数据库(https://bioconductor.org/packages/AnnotationForge/); 利用clusterProfiler分别对每种兜兰各全基因组复制事件中保留的复制基因进行GO功能富集分析(P<0.05) (Yu et al., 2012).GO功能富集结果采用R包ggplot2 (https://github.com/tidyverse/ggplot2)和pheatmap (https://github.com/raivokolde/pheatmap)进行可视化. ...

The Gastrodia elata genome provides insights into plant adaptation to heterotrophy

3

2018

... 兰科(Orchidaceae)含700余属、约26 000种, 为被子植物第二大科, 单子叶植物第一大科, 是陆生植物中极具多样性的类群之一(Chase et al., 2015; Li et al., 2016), 同时表现出染色体数目变化较大(染色体基数从x=6到x=120)的特点(Da Conceição et al., 2006; 王筠竹等, 2019), 表明兰科植物的演化过程可能存在多次全基因组复制事件.然而, 目前在兰科植物中已见报道的全基因组复制事件非常有限.基于兰科植物基因组证据(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020)以及千种植物转录组项目等转录组分析(One Thousand Plant Transcriptomes Initiative, 2019), 目前仅检测到1次兰科植物特异发生的全基因组复制事件, 与蕨类(Huang et al., 2020)、菊科(Zhang et al., 2021a)和豆科(Zhao et al., 2021)等物种多样性丰富的类群多倍化研究结果不符. ...

... 分析上述情况的原因, 我们推测可能与兰科植物种类及类群众多、前期研究样本量小但种类跨度大的研究策略有关.例如, 千种植物转录组项目囊括了兰科7个样本, 但却跨了香荚兰亚科(Vanilloideae)、兰亚科(Orchidoideae)和树兰亚科(Epidendroideae) 3个亚科7个属(One Thousand Plant Transcriptomes Initiative, 2019); 分析全基因组复制事件的5套全基因组数据同样覆盖了拟兰亚科(Apostasioideae)、香荚兰亚科、树兰亚科3个亚科5个属(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020); 关于杓兰亚科基因组进化的研究包括13个兰科植物转录组和基因组数据, 覆盖了兰科所有亚科(拟兰亚科、香荚兰亚科、杓兰亚科(Cypripedioideae)、兰亚科和树兰亚科) 13个属(Unruh et al., 2018).对于兰科这样包含26 000多种的特大类群, 解析其全基因组复制历史需要借助更精细的尺度. ...

... 目前, 在兰科植物中仅检测到2次全基因组复制事件, 一次为大多数单子叶植物共享(110-135 Mya), 另一次为现存兰科植物共享(72-78 Mya)(Cai et al., 2015; Ming et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; One Thousand Plant Transcriptomes Initiative, 2019; Hasing et al., 2020).兜兰属是兰科多样性的重要代表类群, 本研究基于4种兜兰的转录组数据, 检测到3次全基因组复制事件, 分别发生在110.17-119.77 Mya (WGD1)、60.95-74.19 Mya (WGD2)和38.19-45.85 Mya (WGD3).其中, WGD1和WGD2发生时间与前期研究得出的2次全基因组复制事件相近, 且物种间K_s分析表明, 二者均发生在兜兰属与深圳拟兰分化事件之前(图2), 因此推测WGD1为大多数单子叶植物共享、WGD2为现存兰科植物共享的全基因组复制事件.而本研究中检测到的全基因组复制事件WGD3 (38.19-45.85 Mya), 在蓝莓(blueberry)、茶树(Camellia sinensis var. sinensis)和胡萝卜(Daucus carota)中同一时期也检测到了全基因组复制事件(Iorizzo et al., 2016; Wei et al., 2018; Wang et al., 2020), 豆科中更是在该段时间检测到大量全基因组复制事件(17次, 23-55 Mya) (Zhao et al., 2021), 但在兰科植物中尚未见报道. ...

Trends, rhythms, and aberrations in global climate 65 Ma to present

1

2001

... 多倍化或全基因组复制, 特别是在稳定环境下, 常被认为是进化的终点(Comai, 2005; Oberlander et al., 2016).然而, 在植物的演化过程中, 全基因组复制并非随机发生, 而是与全球气候变化、地质变化或者大规模灭绝等密切相关, 发生全基因组复制的个体在胁迫或极端环境条件下具有较二倍体祖先更强的适应性(Van de Peer et al., 2017, 2021; Ren et al., 2018; Wu et al., 2020).与上述研究结果相似, 本研究检测到的3次全基因组复制事件发生时期出现了全球气候变化或大规模灭绝事件, 推测全基因组复制事件提高了兜兰属植物祖先应对极端环境变化的适应性.例如, WGD1 (110.17-119.77 Mya)发生在白垩纪(Cretaceous)阿普特阶(Aptian)至阿尔布阶(Albian), 随后出现了超级温室期(83.6-93.9 Mya) (Klages et al., 2020); WGD2 (60.95-74.19 Mya)发生在白垩纪与古近纪(Paleogene)交界, 出现了白垩纪-古近纪灭绝事件(K-Pg灭绝事件) (Vellekoop et al., 2016); WGD3 (38.19-45.85 Mya)发生在古近纪始新世(Eocene), 发生了古新世-始新世极热事件(56 Mya)和始新世-渐新世(Oligocene)全球变冷(Zachos et al., 2001; McInerney and Wing, 2011). ...

Phylotranscriptomic insights into Asteraceae diversity, polyploidy, and morphological innovation

2

2021a

... 多倍化(polyploid)或全基因组复制(whole-genome duplication, WGD)是物种多样性发生的重要驱动力(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018), 在植物演化历史中普遍存在, 尤其是维管束植物中多样性最高的类群被子植物和第二大类群蕨类反复发生过多轮全基因组复制(One Thousand Plant Transcriptomes Initiative, 2019; 汪浩等, 2019; Huang et al., 2020; 王婷等, 2021).基于现有证据, 在蕨类植物、被子植物第一大科菊科(Asteraceae)、第三大科豆科(Fabaceae)中分别检测到19、41、28次全基因组复制事件(Huang et al., 2020; Zhang et al., 2021a; Zhao et al., 2021), 推测多倍化与蕨类植物和被子植物物种多样性较高类群的物种形成和多样化有关(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

... 兰科(Orchidaceae)含700余属、约26 000种, 为被子植物第二大科, 单子叶植物第一大科, 是陆生植物中极具多样性的类群之一(Chase et al., 2015; Li et al., 2016), 同时表现出染色体数目变化较大(染色体基数从x=6到x=120)的特点(Da Conceição et al., 2006; 王筠竹等, 2019), 表明兰科植物的演化过程可能存在多次全基因组复制事件.然而, 目前在兰科植物中已见报道的全基因组复制事件非常有限.基于兰科植物基因组证据(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020)以及千种植物转录组项目等转录组分析(One Thousand Plant Transcriptomes Initiative, 2019), 目前仅检测到1次兰科植物特异发生的全基因组复制事件, 与蕨类(Huang et al., 2020)、菊科(Zhang et al., 2021a)和豆科(Zhao et al., 2021)等物种多样性丰富的类群多倍化研究结果不符. ...

Asterid phylogenomics/phylotranscriptomics uncover morphological evolutionary histories and support phylogenetic placement for numerous whole-genome duplications

3

2020

... 为验证K_s法检测结果的准确性, 应用tree2gd软件, 基于系统发生基因组学的方法再次检测全基因组复制事件(Zhang et al., 2020; Zhao et al., 2021).(1) 以4种兜兰和深圳拟兰的蛋白序列为输入文件, 利用OrthoFinder v2.5.2筛选单拷贝直系同源基因(Emms and Kelly, 2019).(2) 利用单拷贝直系同源基因构建物种树.首先, 采用MUSCLE v3.8.31对筛选得到的302个单拷贝直系同源基因进行多序列比对(Edgar, 2004); 随后, 基于比对结果使用Gblocks v0.91b筛选保守区域(Castresana, 2000; Talavera and Castresana, 2007), 并将筛选获得的保守区域串联形成多基因矩阵; 最后, 以ProtTest v3.4.2确定的PROTGAMMAJTTF为最优替代模型(Darriba et al., 2011), 利用RAxML v8.2.12软件, 采用最大似然法、基于保守序列矩阵、以深圳拟兰为外类群、在自举检验1 000次的设置下构建系统发生树(Stamatakis, 2014).(3) 以第(2)步构建的系统发生树为物种树, 利用tree2gd v1.0.39软件, 基于默认参数检测全基因组复制事件(https://github.com/Dee-chen/Tree2gd) (Zhang et al., 2020). ...

... ) (Zhang et al., 2020). ...

... 应用tree2gd v1.0.39软件, 基于系统发生基因组学方法再次进行全基因组复制事件检测, 判定标准参照Zhang等(2020)所述方法.满足以下任一条件则认为发生了全基因组复制事件: (1) 复制基因(gene duplication, GD) >500个, 其中(AB)(AB)类型的复制基因>250个; (2) 复制基因>1 500个, 其中(AB)(AB)类型的复制基因>100个, 且(AB)(AB)类型的复制基因与(AB)A类型或(AB)B类型的复制基因之和>1 000个.tree2gd分析结果(图3)表明, 4种兜兰的祖先(即结点2)保留了556个复制基因, 其中(AB)(AB)类型的复制基因为274个, 满足全基因组复制事件的判定条件, 推测在兜兰属与深圳拟兰分化之后、兜兰属分化之前(即结点3与结点2之间)发生了1次全基因组复制事件,与采用K_s检测到的WGD3相一致.由于tree2gd软件主要基于系统发生基因组学方法进行检测, 因此受样本限制(仅4种兜兰和深圳拟兰)无法检测到兜兰属以外的全基因组复制事件. ...

Transcriptomics and metabolomics reveal purine and phenylpropanoid metabolism response to drought stress in Dendrobium sinense, an endemic orchid species in Hainan island

1

2021b

... 全基因组复制事件保留了部分复制基因, 对保留的复制基因进行功能分析可为阐明全基因组复制事件对植物适应性演化的促进作用提供遗传证据.本研究分别对4种兜兰3次全基因组复制后的保留复制基因进行了GO功能富集分析, 发现3次全基因组复制事件富集到的功能存在差异(图4, 图5).WGD1富集了脂类代谢、软木脂的生物合成、苯丙烷类的合成与代谢, 以及氧化还原酶活性和活性氧代谢过程的调控等功能(图5), 这可能与兜兰属植物应对超级温室期的干旱环境以及抵御干旱引起的活性氧失衡有关(Upchurch, 2008; Das and Roychoudhury, 2014; Brunner et al., 2015; Zhang et al., 2021b).在K-Pg灭绝时期, 大气中充满了灰尘、硫酸盐气溶胶及碳黑颗粒, 黑暗和低温成为主要的胁迫因子(Vellekoop et al., 2016).推测WGD2富集的脱落酸激活的信号通路以及昼夜节律等功能提高了兜兰属植物祖先对当时剧变环境的适应性(图4, 图5) (杨有新等, 2014; Vishwakarma et al., 2017).WGD3之后, 兜兰属植物祖先经历了全球温度骤降, 推测富集的磷脂代谢、酶联受体蛋白信号通路、色素沉着, 以及保卫细胞分化与发育、根表皮细胞分化与毛状体分化等功能, 可能与应对低温引起的植物萎蔫、叶绿素含量减少以及细胞膜发生相变有关(王芳等, 2019).综上, 推测保留的复制基因在功能上与当时特定的胁迫因子相关. ...

The Apostasia genome and the evolution of orchids

7

2017

... 兰科(Orchidaceae)含700余属、约26 000种, 为被子植物第二大科, 单子叶植物第一大科, 是陆生植物中极具多样性的类群之一(Chase et al., 2015; Li et al., 2016), 同时表现出染色体数目变化较大(染色体基数从x=6到x=120)的特点(Da Conceição et al., 2006; 王筠竹等, 2019), 表明兰科植物的演化过程可能存在多次全基因组复制事件.然而, 目前在兰科植物中已见报道的全基因组复制事件非常有限.基于兰科植物基因组证据(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020)以及千种植物转录组项目等转录组分析(One Thousand Plant Transcriptomes Initiative, 2019), 目前仅检测到1次兰科植物特异发生的全基因组复制事件, 与蕨类(Huang et al., 2020)、菊科(Zhang et al., 2021a)和豆科(Zhao et al., 2021)等物种多样性丰富的类群多倍化研究结果不符. ...

... 分析上述情况的原因, 我们推测可能与兰科植物种类及类群众多、前期研究样本量小但种类跨度大的研究策略有关.例如, 千种植物转录组项目囊括了兰科7个样本, 但却跨了香荚兰亚科(Vanilloideae)、兰亚科(Orchidoideae)和树兰亚科(Epidendroideae) 3个亚科7个属(One Thousand Plant Transcriptomes Initiative, 2019); 分析全基因组复制事件的5套全基因组数据同样覆盖了拟兰亚科(Apostasioideae)、香荚兰亚科、树兰亚科3个亚科5个属(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020); 关于杓兰亚科基因组进化的研究包括13个兰科植物转录组和基因组数据, 覆盖了兰科所有亚科(拟兰亚科、香荚兰亚科、杓兰亚科(Cypripedioideae)、兰亚科和树兰亚科) 13个属(Unruh et al., 2018).对于兰科这样包含26 000多种的特大类群, 解析其全基因组复制历史需要借助更精细的尺度. ...

... 从NCBI网站SRA数据库检索下载杏黄兜兰(Paphiopedilum armeniacum S.C.Chen & F.Y.Liu) (2n=26)、同色兜兰(P. concolor (Lindl. ex Bateman) Pfitzer) (2n=26)、带叶兜兰(P. hirsutissimum (Lindl. ex Hook.) Stein) (2n=26)以及麻栗坡兜兰(P. malipoense S.C. Chen & Z.H.Tsi) (2n=26)转录组测序的原始数据(raw data) (Cox et al., 1998; 杨志娟, 2006; Li et al., 2014; Zhang et al., 2017; Fang et al., 2020), 用于后续的组装与分析.同时, 从NCBI网站Genome数据库下载深圳拟兰(Apostasia shenzhenica Z.J.Liu & L.J. Chen)基因组数据(GCA_002786265.1) (Zhang et al., 2017)用于物种间直系同源基因的K_s分析.将拟兰作为基于系统发生基因组学检测全基因组复制事件的外类群. ...

... Z.J.Liu & L.J. Chen)基因组数据(GCA_002786265.1) (Zhang et al., 2017)用于物种间直系同源基因的K_s分析.将拟兰作为基于系统发生基因组学检测全基因组复制事件的外类群. ...

... 基于同义替换速度恒定的假定前提, 根据物种内旁系同源基因K_s分布的峰值和公式T=K_s/2r, 采用深圳拟兰的绝对定年时间, 推算4种兜兰全基因组复制事件的发生时间(Badouin et al., 2017; Zhang et al., 2017).先依据深圳拟兰的绝对定年信息(K_s=1, T=74 Mya) (Zhang et al., 2017)和公式T=K_s/2r, 推算出深圳拟兰的r=6.76×10^-9(同义替换/位点/年); 然后根据正态分布拟合得到的K_s峰值, 采用深圳拟兰的r值, 推算4种兜兰全基因组复制事件的发生时间. ...

... =1, T=74 Mya) (Zhang et al., 2017)和公式T=K_s/2r, 推算出深圳拟兰的r=6.76×10^-9(同义替换/位点/年); 然后根据正态分布拟合得到的K_s峰值, 采用深圳拟兰的r值, 推算4种兜兰全基因组复制事件的发生时间. ...

... 目前, 在兰科植物中仅检测到2次全基因组复制事件, 一次为大多数单子叶植物共享(110-135 Mya), 另一次为现存兰科植物共享(72-78 Mya)(Cai et al., 2015; Ming et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; One Thousand Plant Transcriptomes Initiative, 2019; Hasing et al., 2020).兜兰属是兰科多样性的重要代表类群, 本研究基于4种兜兰的转录组数据, 检测到3次全基因组复制事件, 分别发生在110.17-119.77 Mya (WGD1)、60.95-74.19 Mya (WGD2)和38.19-45.85 Mya (WGD3).其中, WGD1和WGD2发生时间与前期研究得出的2次全基因组复制事件相近, 且物种间K_s分析表明, 二者均发生在兜兰属与深圳拟兰分化事件之前(图2), 因此推测WGD1为大多数单子叶植物共享、WGD2为现存兰科植物共享的全基因组复制事件.而本研究中检测到的全基因组复制事件WGD3 (38.19-45.85 Mya), 在蓝莓(blueberry)、茶树(Camellia sinensis var. sinensis)和胡萝卜(Daucus carota)中同一时期也检测到了全基因组复制事件(Iorizzo et al., 2016; Wei et al., 2018; Wang et al., 2020), 豆科中更是在该段时间检测到大量全基因组复制事件(17次, 23-55 Mya) (Zhao et al., 2021), 但在兰科植物中尚未见报道. ...

The Dendrobium catenatum Lindl. genome sequence provides insights into polysaccharide synthase, floral development and adaptive evolution

3

2016

... 兰科(Orchidaceae)含700余属、约26 000种, 为被子植物第二大科, 单子叶植物第一大科, 是陆生植物中极具多样性的类群之一(Chase et al., 2015; Li et al., 2016), 同时表现出染色体数目变化较大(染色体基数从x=6到x=120)的特点(Da Conceição et al., 2006; 王筠竹等, 2019), 表明兰科植物的演化过程可能存在多次全基因组复制事件.然而, 目前在兰科植物中已见报道的全基因组复制事件非常有限.基于兰科植物基因组证据(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020)以及千种植物转录组项目等转录组分析(One Thousand Plant Transcriptomes Initiative, 2019), 目前仅检测到1次兰科植物特异发生的全基因组复制事件, 与蕨类(Huang et al., 2020)、菊科(Zhang et al., 2021a)和豆科(Zhao et al., 2021)等物种多样性丰富的类群多倍化研究结果不符. ...

... 分析上述情况的原因, 我们推测可能与兰科植物种类及类群众多、前期研究样本量小但种类跨度大的研究策略有关.例如, 千种植物转录组项目囊括了兰科7个样本, 但却跨了香荚兰亚科(Vanilloideae)、兰亚科(Orchidoideae)和树兰亚科(Epidendroideae) 3个亚科7个属(One Thousand Plant Transcriptomes Initiative, 2019); 分析全基因组复制事件的5套全基因组数据同样覆盖了拟兰亚科(Apostasioideae)、香荚兰亚科、树兰亚科3个亚科5个属(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020); 关于杓兰亚科基因组进化的研究包括13个兰科植物转录组和基因组数据, 覆盖了兰科所有亚科(拟兰亚科、香荚兰亚科、杓兰亚科(Cypripedioideae)、兰亚科和树兰亚科) 13个属(Unruh et al., 2018).对于兰科这样包含26 000多种的特大类群, 解析其全基因组复制历史需要借助更精细的尺度. ...

... 目前, 在兰科植物中仅检测到2次全基因组复制事件, 一次为大多数单子叶植物共享(110-135 Mya), 另一次为现存兰科植物共享(72-78 Mya)(Cai et al., 2015; Ming et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; One Thousand Plant Transcriptomes Initiative, 2019; Hasing et al., 2020).兜兰属是兰科多样性的重要代表类群, 本研究基于4种兜兰的转录组数据, 检测到3次全基因组复制事件, 分别发生在110.17-119.77 Mya (WGD1)、60.95-74.19 Mya (WGD2)和38.19-45.85 Mya (WGD3).其中, WGD1和WGD2发生时间与前期研究得出的2次全基因组复制事件相近, 且物种间K_s分析表明, 二者均发生在兜兰属与深圳拟兰分化事件之前(图2), 因此推测WGD1为大多数单子叶植物共享、WGD2为现存兰科植物共享的全基因组复制事件.而本研究中检测到的全基因组复制事件WGD3 (38.19-45.85 Mya), 在蓝莓(blueberry)、茶树(Camellia sinensis var. sinensis)和胡萝卜(Daucus carota)中同一时期也检测到了全基因组复制事件(Iorizzo et al., 2016; Wei et al., 2018; Wang et al., 2020), 豆科中更是在该段时间检测到大量全基因组复制事件(17次, 23-55 Mya) (Zhao et al., 2021), 但在兰科植物中尚未见报道. ...

Nuclear phylotranscriptomics and phylogenomics support numerous polyploidization events and hypotheses for the evolution of rhizobial nitrogen-fixing symbiosis in Fabaceae

4

2021

... 多倍化(polyploid)或全基因组复制(whole-genome duplication, WGD)是物种多样性发生的重要驱动力(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018), 在植物演化历史中普遍存在, 尤其是维管束植物中多样性最高的类群被子植物和第二大类群蕨类反复发生过多轮全基因组复制(One Thousand Plant Transcriptomes Initiative, 2019; 汪浩等, 2019; Huang et al., 2020; 王婷等, 2021).基于现有证据, 在蕨类植物、被子植物第一大科菊科(Asteraceae)、第三大科豆科(Fabaceae)中分别检测到19、41、28次全基因组复制事件(Huang et al., 2020; Zhang et al., 2021a; Zhao et al., 2021), 推测多倍化与蕨类植物和被子植物物种多样性较高类群的物种形成和多样化有关(De Bodt et al., 2005; Van de Peer et al., 2017; Mandáková and Lysak, 2018; Ren et al., 2018). ...

... 兰科(Orchidaceae)含700余属、约26 000种, 为被子植物第二大科, 单子叶植物第一大科, 是陆生植物中极具多样性的类群之一(Chase et al., 2015; Li et al., 2016), 同时表现出染色体数目变化较大(染色体基数从x=6到x=120)的特点(Da Conceição et al., 2006; 王筠竹等, 2019), 表明兰科植物的演化过程可能存在多次全基因组复制事件.然而, 目前在兰科植物中已见报道的全基因组复制事件非常有限.基于兰科植物基因组证据(Cai et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; Hasing et al., 2020)以及千种植物转录组项目等转录组分析(One Thousand Plant Transcriptomes Initiative, 2019), 目前仅检测到1次兰科植物特异发生的全基因组复制事件, 与蕨类(Huang et al., 2020)、菊科(Zhang et al., 2021a)和豆科(Zhao et al., 2021)等物种多样性丰富的类群多倍化研究结果不符. ...

... 为验证K_s法检测结果的准确性, 应用tree2gd软件, 基于系统发生基因组学的方法再次检测全基因组复制事件(Zhang et al., 2020; Zhao et al., 2021).(1) 以4种兜兰和深圳拟兰的蛋白序列为输入文件, 利用OrthoFinder v2.5.2筛选单拷贝直系同源基因(Emms and Kelly, 2019).(2) 利用单拷贝直系同源基因构建物种树.首先, 采用MUSCLE v3.8.31对筛选得到的302个单拷贝直系同源基因进行多序列比对(Edgar, 2004); 随后, 基于比对结果使用Gblocks v0.91b筛选保守区域(Castresana, 2000; Talavera and Castresana, 2007), 并将筛选获得的保守区域串联形成多基因矩阵; 最后, 以ProtTest v3.4.2确定的PROTGAMMAJTTF为最优替代模型(Darriba et al., 2011), 利用RAxML v8.2.12软件, 采用最大似然法、基于保守序列矩阵、以深圳拟兰为外类群、在自举检验1 000次的设置下构建系统发生树(Stamatakis, 2014).(3) 以第(2)步构建的系统发生树为物种树, 利用tree2gd v1.0.39软件, 基于默认参数检测全基因组复制事件(https://github.com/Dee-chen/Tree2gd) (Zhang et al., 2020). ...

... 目前, 在兰科植物中仅检测到2次全基因组复制事件, 一次为大多数单子叶植物共享(110-135 Mya), 另一次为现存兰科植物共享(72-78 Mya)(Cai et al., 2015; Ming et al., 2015; Zhang et al., 2016, 2017; Yuan et al., 2018; One Thousand Plant Transcriptomes Initiative, 2019; Hasing et al., 2020).兜兰属是兰科多样性的重要代表类群, 本研究基于4种兜兰的转录组数据, 检测到3次全基因组复制事件, 分别发生在110.17-119.77 Mya (WGD1)、60.95-74.19 Mya (WGD2)和38.19-45.85 Mya (WGD3).其中, WGD1和WGD2发生时间与前期研究得出的2次全基因组复制事件相近, 且物种间K_s分析表明, 二者均发生在兜兰属与深圳拟兰分化事件之前(图2), 因此推测WGD1为大多数单子叶植物共享、WGD2为现存兰科植物共享的全基因组复制事件.而本研究中检测到的全基因组复制事件WGD3 (38.19-45.85 Mya), 在蓝莓(blueberry)、茶树(Camellia sinensis var. sinensis)和胡萝卜(Daucus carota)中同一时期也检测到了全基因组复制事件(Iorizzo et al., 2016; Wei et al., 2018; Wang et al., 2020), 豆科中更是在该段时间检测到大量全基因组复制事件(17次, 23-55 Mya) (Zhao et al., 2021), 但在兰科植物中尚未见报道. ...

iTAK: a program for genome-wide prediction and classification of plant transcription factors, transcriptional regulators, and protein kinases

1

2016

... 在默认设置条件下, 利用TransDecoder v5.5.0对unigene序列进行蛋白编码区预测(https://github.com/TransDecoder/TransDecoder/releases/tag/TransDecoder-v5.5.0), 获得蛋白编码序列(protein co ding sequence, CDS)和相应的蛋白序列.利用iTAK软件, 基于软件内置数据库进行植物转录因子(transcription factors, TFs)预测(Zheng et al., 2016). ...

Wgd-simple command line tools for the analysis of ancient whole-genome duplications

1

2019

... 为分析全基因组复制事件与类群分化间的时间关系, 利用wgd软件, 采用wf2流程分别计算4种兜兰与深圳拟兰、3种兜兰与杏黄兜兰(位于兜兰基部类群)间直系同源基因的K_s值(Zwaenepoel and Van De Peer, 2019).若物种内旁系同源基因的K_s峰值(代表全基因组复制事件)小于物种间直系同源基因的K_s峰值(代表类群分化事件), 则认为全基因组复制事件发生在类群分化事件后; 反之, 则认为全基因组复制事件发生在类群分化事件前(One Thousand Plant Tran scriptomes Initiative, 2019). ...

	Paphiopedilum concolor	P. hirsutissimum	P. malipoense	P. armeniacum
Number of protein coding sequences	56439	33207	79854	58575
Average length of protein coding sequence (bp)	936.1	994.9	829.1	914.7
N50 of protein coding sequence (bp)	1209	1308	1089	1215
Number of CDS identified as transcription factor	1950	1181	2586	2014
Number of transcription factor families	66	67	67	68

Species	No. of components	No. of duplicates	BIC	Variance	Mean (K_s)	Proportion
Paphiopedilum concolor	9	207	-4673.731	0.0000	0.1077	0.0527
	9	385	-4673.731	0.0002	0.1314	0.1048
	9	416	-4673.731	0.0007	0.1740	0.1194
	9	311	-4673.731	0.0026	0.2517	0.0964
	9	522	-4673.731	0.0175	0.5161	0.1544
	9	642	-4673.731	0.0241	0.8236	0.1741
	9	567	-4673.731	0.1557	1.5043	0.1594
	9	300	-4673.731	0.5765	2.4529	0.1148
	9	90	-4673.731	0.1277	4.3036	0.0240
P. hirsutissimum	7	196	-4604.688	0.0001	0.1146	0.0632
	7	277	-4604.688	0.0005	0.1504	0.0991
	7	290	-4604.688	0.0026	0.2292	0.1109
	7	558	-4604.688	0.0282	0.5407	0.2162
	7	508	-4604.688	0.0260	0.8544	0.1693
	7	585	-4604.688	0.2594	1.5894	0.2351
	7	236	-4604.688	0.8550	3.0527	0.1062
Species	No. of components	No. of duplicates	BIC	Variance	Mean (K_s)	Proportion
P. malipoense	9	377	-14027.68	0.0000	0.1081	0.0399
	9	751	-14027.68	0.0003	0.1362	0.0890
	9	820	-14027.68	0.0016	0.2006	0.1005
	9	579	-14027.68	0.0067	0.3290	0.0768
	9	1611	-14027.68	0.0287	0.6196	0.1983
	9	1464	-14027.68	0.0447	1.0026	0.1778
	9	1634	-14027.68	0.1002	1.6186	0.1952
	9	683	-14027.68	0.5901	2.6205	0.1105
	9	105	-14027.68	0.0568	4.5773	0.0121
P. armeniacum	9	206	-8261.607	0.0000	0.1063	0.0359
	9	472	-8261.607	0.0002	0.1296	0.0870
	9	478	-8261.607	0.0008	0.1744	0.0950
	9	400	-8261.607	0.0039	0.2729	0.0849
	9	1089	-8261.607	0.0213	0.5664	0.2105
	9	778	-8261.607	0.0287	0.8825	0.1457
	9	999	-8261.607	0.1443	1.4888	0.1980
	9	455	-8261.607	0.5375	2.4393	0.1195
	9	120	-8261.607	0.1481	4.3256	0.0234

Species	Name of WGD	Mean (K_s)	Age of WGD calculated by K_s mean value (Mya)	Age of WGD with 95% confidence interval (Mya)
Paphiopedilum concolor	WGD3	0.5161	38.19	37.35-39.03
	WGD2	0.8236	60.95	60.06-61.83
	WGD1	1.5043	111.32	108.91-113.72
P. hirsutissimum	WGD3	0.5407	40.01	38.98-41.04
	WGD2	0.8544	63.22	62.19-64.26
	WGD1	1.5894	117.61	114.56-120.67
P. malipoense	WGD3	0.6196	45.85	45.24-46.46
	WGD2	1.0026	74.19	73.39-74.99
	WGD1	1.6186	119.77	118.64-120.91
P. armeniacum	WGD3	0.5664	41.92	41.28-42.56
	WGD2	0.8825	65.31	64.43-66.19
	WGD1	1.4888	110.17	108.42-111.91

基于转录组数据揭示4种兜兰的全基因组复制历史

Revealing the New Whole-genome Duplication Event of Four Paphiopedilum Species Based on Transcriptome Data

1 材料与方法

1.1 测序数据下载

1.2 测序数据提取和质控

1.3 转录组组装和质量评估

1.4 蛋白编码区及转录因子预测

1.5 全基因组复制事件检测

1.6 全基因组复制事件相对定年

1.7 转录组功能注释和复制基因功能富集分析

2 结果与讨论

2.1 原始数据下载、组装和质量评估

图1

2.2 蛋白编码区和转录因子预测

2.3 全基因组复制事件检测

图2

图3

2.4 全基因组复制事件的相对定年

2.5 转录组功能注释和复制基因的功能富集分析

图4

图5

2.6 讨论

参考文献

原文顺序

文献年度倒序

文中引用次数倒序

被引期刊影响因子

基于转录组数据揭示4种兜兰的全基因组复制历史

Revealing the New Whole-genome Duplication Event of Four Paphiopedilum Species Based on Transcriptome Data

1 材料与方法

1.1 测序数据下载

1.2 测序数据提取和质控

1.3 转录组组装和质量评估

1.4 蛋白编码区及转录因子预测

1.5 全基因组复制事件检测

1.6 全基因组复制事件相对定年

1.7 转录组功能注释和复制基因功能富集分析

2 结果与讨论

2.1 原始数据下载、组装和质量评估

图1

2.2 蛋白编码区和转录因子预测

2.3 全基因组复制事件检测

图2

图3

2.4 全基因组复制事件的相对定年

2.5 转录组功能注释和复制基因的功能富集分析

图4

图5

2.6 讨论

参考文献 View Option 原文顺序 文献年度倒序 文中引用次数倒序 被引期刊影响因子

参考文献

原文顺序

文献年度倒序

文中引用次数倒序

被引期刊影响因子