系统发生树构建ppt课件.ppt
《系统发生树构建ppt课件.ppt》由会员分享,可在线阅读,更多相关《系统发生树构建ppt课件.ppt(87页珍藏版)》请在三一办公上搜索。
1、基础生物信息学及应用,李裕强2009.09,第部分生物分子信息的分析,第八章 分子进化分析 系统发生树构建,本章内容:分子进化分析介绍系统发生树构建方法系统发生树构建实例,第一节 分子进化分析介绍,基本概念:系统发生(phylogeny)是指生物形成或进化的历史系统发生学(phylogenetics)研究物种之间的进化关系 系统发生树(phylogenetic tree)表示形式,描述物种之间进化关系,分子进化研究的目的从物种的一些分子特性出发,从而了解物种之间的生物系统发生的关系。蛋白和核酸序列通过序列同源性的比较进而了解基因的进化以及生物系统发生的内在规律,分子进化分析介绍,分子进化分析介
2、绍,分子进化研究的基础基本理论:在各种不同的发育谱系及足够大的进化时间尺度中,许多序列的进化速率几乎是恒定不变的。(分子钟理论, Molecular clock 1965 ),分子进化分析介绍,主要假定条件:To use molecular data to reconstruct evolutionary history requires making a number of reasonable assumptions:The first is that the molecular sequences used in phylogenetic construction are homolog
3、ous, meaning that they share a common origin and subsequently diverged through time.Phylogenetic divergence is assumed to be bifurcating, meaning that a parent branch splits into two daughter branches at any given point. Another assumption in phylogenetics is that each position in a sequence evolved
4、 independently. The variability among sequences is sufficiently informative for constructing unambiguous phylogenetic trees.,分子进化分析介绍,实际情况:虽然很多时候仍然存在争议,但是分子进化确实能阐述一些生物系统发生的内在规律,分子进化分析介绍,直系同源与旁系同源Orthologs(直系同源): Homologous sequences in different species that arose from a common ancestral gene during
5、 speciation; may or may not be responsible for a similar function. Paralogs(旁系同源): Homologous sequences within a single species that arose by gene duplication. 。以上两个概念代表了两个不同的进化事件。用于分子进化分析中的序列必须是直系同源的,才能真实反映进化过程。,分子进化分析介绍,分子进化分析介绍,系统发生树(phylogenetic tree):又名进化树(evolutionary tree)已发展成为多学科交叉形成的一个边缘领域。
6、包括生命科学中的进化论、遗传学、分类学、分子生物学、生物化学、生物物理学和生态学,又包括数学中的概率统计、图论、计算机科学和群论。闻名国际生物学界的美国冷泉港定量生物学会议于1987年特辟出进化树专栏进行学术讨论,标志着该领域已成为现代生物学的前沿之一,迄今仍很活跃。,分子进化分析介绍,分子进化分析介绍,系统发生树结构The lines in the tree are called branches(分支). At the tips of the branches are present-day species or sequences known as taxa (分类,the singul
7、ar form is taxon) or operational taxonomic units(运筹分类单位). The connecting point where two adjacent branches join is called a node(节点), which represents an inferred ancestor of extant taxa. The bifurcating point at the very bottom of the tree is the root node(根节), which represents the common ancestor
8、of all members of the tree.A group of taxa descended from a single common ancestor is defined as a clade or monophyletic group (单源群).The branching pattern in a tree is called tree topology(拓扑结构).,分子进化分析介绍,有根树与无根树树根代表一组分类的共同祖先,分子进化分析介绍,如何确定树根根据外围群:One is to use an outgroup(外围群), which is a sequence t
9、hat is homologous to the sequences under consideration, but separated from those sequences at an early evolutionary time.根据中点:In the absence of a good outgroup, a tree can be rooted using the midpoint rooting approach, in which the midpoint of the two most divergent groups judged by overall branch l
10、engths is assigned as the root.,Rooted by outgroup,分子进化分析介绍,分子进化分析介绍,树形系统发生图(Phylograms):有分支和支长信息分支图( Cladograms)只有分支信息,无支长信息,第二节 系统发生树构建方法,Molecular phylogenetic tree construction can be divided into five steps: (1) choosing molecular markers; (2) performing multiple sequence alignment;(3) choosing
11、 a model of evolution; (4) determining a tree building method; (5) assessing tree reliability.,系统发生树构建方法,(1) Choosing molecular markers For studying very closely related organisms, nucleotide sequences, which evolve more rapidly than proteins, can be used.For studying the evolution of more widely di
12、vergent groups of organisms, one may choose either slowly evolving nucleotide sequences, such as ribosomal RNA or protein sequences.,(2) Performing multiple sequence alignment:Probably the most critical step in the procedure. Only the correct alignment produces correct phylogenetic inference.Multipl
13、e state-of-the-art alignment programs (such as T-Coffee) should be used.Manual editing is often critical in ensuring alignment quality.It is also often necessary to decide whether to use the full alignment or to extract parts of it. Truly ambiguously aligned regions have to be removed from considera
14、tion prior to phylogenetic analysis.,系统发生树构建方法,(2) Performing multiple sequence alignment:Using automatic approach to improving alignment quality. Rascal (ftp:/ftp-igbmc.u-strasbg.fr/pub/RASCAL)and NorMD (ftp:/ftp-igbmc.u-strasbg.fr/pub/NORMD) can help to improve alignment by correcting alignment er
15、rors and removing potentially unrelated or highly divergent sequences. the program Gblocks (http:/woody.embl-heidelberg.de/phylo/ ) can help to detect and eliminate the poorly aligned positions and divergent regions so to make the alignment more suitable for phylogenetic analysis.,系统发生树构建方法,(3) Choo
16、sing a model of evolution:何为进化模型The statistical models used to correct homoplasy(非同源相似,平行演化 ) are called substitution models or evolutionary models.,系统发生树构建方法,(3) Choosing a model of evolution:为何要考虑进化模型:The observed number of substitutions may not represent the true evolutionary events that actually
17、 occurred. For instance,Observed: A replaced by C, actually: ATGCBack mutation (回复突变)could have occurred: GCGParallel mutations(平行突变): both sequences mutate into T,.Such multiple substitutions and convergence at individual positions obscure the estimation of the true evolutionary distances between s
18、equences. This effect is known as homoplasy, which, if not corrected, can lead to the generation of incorrect trees.To correct homoplasy, statistical models are needed to infer the true evolutionary distances between sequences.,系统发生树构建方法,(3) Choosing a model of evolution:For protein sequences, the e
19、volutionary distances from an alignment can be corrected using a PAM or JTT amino acid substitution matrix whose construction already takes into account the multiple substitutions. For constructing DNA phylogenies, there are a number of nucleotide substitution models available.JukesCantor ModelKimur
20、a Model,系统发生树构建方法,(3) Choosing a model of evolution:JukesCantor Model距离函数: dAB = (3/4) ln1 (4/3)pAB如,两20核苷酸序列A、B,有6个不同,则距离为 dAB = 3/4 ln1 (4/3 0.3) = 0.38The JukesCantor model can only handle reasonably closely related sequences.,系统发生树构建方法,(3) Choosing a model of evolution:Kimura Model增加计算转换(transit
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 系统 发生 构建 ppt 课件

链接地址:https://www.31ppt.com/p-1898337.html