• 沒有找到結果。

Integrating Genome

N/A
N/A
Protected

Academic year: 2022

Share "Integrating Genome"

Copied!
74
0
0

加載中.... (立即查看全文)

全文

(1)

Integrating Genome

劉恕維  洪士堯  李宗燁   林修竹  田耕豪  廖奎達 

D. R. Zerbino, B. Paten, and D. Haussler  Science 13 April 2012: 179-182. 

(2)

 Introduction

 Obtaining genome sequence

 Modeling the evolution of genotype

 From genotype to phenotype

 Looking ahead to application

Outline

(3)

Introduction

劉恕維

(4)

1970 Walter Fiers

History – The Pioneer

(5)

History – The technique

(6)

History – Hardware

3000 4000 5000 6000 7000 8000 9000 10000

Moor's law 欄 1

16

10000

(7)

Genotype & Phenotype

http://big5.ifeng.com/gate/big5 /baby.ifeng.com/yuer/special/de tail_2011_01/27/4481382_0.shtml

Autism

(8)

Genotype & Phenotype

A A

A O

B B B O

Phenotype

A B

O O

(9)

 Genome evolution

 Model molecular phenotype as consequen ce of genotype

 Predict organismal phenotype

Challenge research

(10)

Obtaining Genomic Sequences

洪士堯

(11)

 Process of reconstructing an entire ge nome from relatively short random DNA fragments, called reads.

 Detect read overlaps and thereby progr essively reconstitute most of the geno me sequence.

Genome assembly

(12)

Genome assembly

(13)

PCR : Polymerase Chain Reactio n.

Procedure :

Denaturation step

Annealing step

Extension/elongation step

PCR

(14)

DNA replication in the pres ence of both dNTPs and ddNT Ps will terminate the growi ng DNA strand at each base.

In the presence of 5% ddTTP s and 95% dTTPs Taq polymer ase will incorporate a term inating ddTTP at each ‘T’

position in the growing DNA

Chain-termination method

(15)

Gel Electrophoresis se parates DNA by fragmen t size. The larger the DNA piece the slower i t will progress throug h the gel matrix towar d the positive cathode .

Chain-termination method (co

nt)

(16)

 genomes commonly contain large redunda nt regions (repeats).

 regions where the statistical distribu tion of bases is significantly biased (lowcomplexity DNA)

Problems

(17)

new genomes from that spe cies or closely related s pecies are generally not assembled de novo.

Using the reference genom e as a template.

After the first complete

(18)

Modeling the evolution of genot ype

李宗燁

(19)

 Alignment and Assembly

 Phylogenetic analysis

 Evolutionary relationships between DNA

Modeling the Evolution of Genot

ype

(20)

 Genomes are compared by alignment

 Large scale

Indicate changes in segment order and copy number

 Small scale

Indicate specific base substitutions

Alignment and Assembly

(21)

Alignment

Alignment and Assembly

(22)

 Assembly

Primary challenge

Distinguish spurious sequence similaritie s from those due to common ancestry

Alignment and Assembly

(23)

Regions of genomes

Subject to purifying selection

Similarity of sequence is conserved

Orthologous protein-coding regions

Reliably aligned across great evolutionary distance s

Between vertebrates and invertebrates

Alignment and Assembly

(24)

 Regions of genomes

Therefore, common to distinguish alignments of subregions

Local alignment

Used between conserved functional regions of mor e distantly related genomes

Full genome alighnments

Alignment and Assembly

(25)

 Applied to

more than two species

or to multiple gene copies within a species

 NP-hard.

 Considerable effort has been devoted

Phylogenetic analysis

(26)

 Complicated by homologous recombinatio n

Creates DNA molecules whose parts have diff erent evolutionary histories

Phylogenetic analysis

(27)
(28)

 Balanced structural rearrangements

Change the order and the orientation of the bases in the genome

substitutions

 Segmental duplications/gains/losses

Alter the number of copies of homologous bases

Short indels

Evolutionary relationships between DNA

(29)

 Construction of a mathematically and a lgorithmically tractable unified theor y

remains a major challenge for the field.

Evolutionary relationships between DNA

(30)

FROM GENOTYPE TO

PHENOTYPE

(31)

Gregor Mendel

Austrian Monk who experimented with pea plants

He noticed that not all peas are the same:

Green vs. yellow

Tall vs. short

Round vs. wrinkled

He discovered that crossing peas depended on the genes of the plant rather than only the outward

appearance of the plant

(32)

Phenotype vs. Genotype

 Phenotype: the physic al appearance of a pl ant or animal because of its genetic makeup (genotype)

 Genotype: genetic con stitution (makeup) of

www.ansi.okstate.edu/breeds/s wine/

(33)

The Punnett Square

 A way for determining the genotype and phenotype of offspring

 Capital letters are assigned to domina nt genes and lower-case letters are as signed to recessive genes

(34)

Using the Punnet Square

T T

Purebred (homozygous) dominant – the genes only have the dominant trait in its code.

Example – Dominant Tall -- TT

Purebred (homozygous) recessive – the genes only have the recessive trait in its code.

Example – Recessive short – tt

Hybrid (heterozygous) – the genes are

(35)

Massive increase in Sequencing Sp

eed

(36)
(37)
(38)
(39)

New Methods of Exploring

 Cross Species

History and dive rsity of life

Climate, competi tor, disease

(40)

More Studies will be Derived From E xperimental Data

(41)

Single Specie

Human Genome Studies

Number of

Genes Single Multiple

Frequency of genetic defects

Rare (<

1%) Common (>

1%)

(42)

Association studies are critical to the study of complex diseases

Association

Tag, or genotype, SNPs on the basis of Linkage Disequilibrium patterns.

Select tags to provide as much information about surrounding region based on association with

untagged SNPs.

(43)

Genome-Wide Association (GWA) addresses

some of these issues.

(44)

GWA has multiple advantages

Discovery

Studies not limited to current biological knowledge

Quantitative

Better characterize complex, quantitative traits

(45)

GWA has multiple advantages

Discovery

Studies not limited to current biological knowledge Coronary Heart Disease (CHD)

Type 2 Diabetes

Recent GWA studies discovered:

Associated regions containing no annotated genes

(46)

GWA has multiple advantages

Cardiac Arrhythm

ias

Quantitative

Better characterize complex, quantitative traits

Identification of polymorphism accounting for variance of quantitative trait

(47)

Going forward

Association Studies

Cannot provide unambiguous identification of causal genes

But

can highlight pathways and mechanisms of particular interest.

(48)

Leading to systems-level understand ing of genetics and disease

(49)

And Better Medicine!

(50)

Databases

• ENCODE

• Epigenetics roadmap

• modENCODE

• EMSEMBL

• UCSC Gene Browser

(51)

Epigenetics, RNA, Protein

(52)

Epigenetics, RNA, Protein

 Can’t be directly measured

 Inferred by mathematical model

Markov Models

Factor Graphs

Bayesian Networks

Markov Random Fields

(53)

Classification and Regression mod el

 Classification of Epigenetic, Transcripit ional, Proteomic state to predict phenoty pe to genotype

(54)

MDS

(55)

Clustering analysis

(56)

Cox Regression

(57)

Looking ahead to applicatio n

田耕豪 廖奎達

(58)

 Medicine a. Cancer b. Vaccine c. Stem cell

 Agriculture

Applications

(59)

 Cancer

Genomic modifications are the source o f nearly all cancers.

Applications

(60)

Applications

(61)

 Acute Myeloid Leukemia ( 急性骨髓性白血病 )

 骨髓性造血芽細胞異常

增殖的血液惡性腫瘤

Applications

(62)

Applications

(63)

 Coding mutations identified in eight p rimary tumor–relapse pairs

Applications

(64)

Applications

(65)

Applications

 High-throughput genomics data

 Vaccine design

 Treatment of disease

 Infectious disease

 Autoimmune diseases

(66)

Applications

Vaccine

(67)

Applications

 Vaccine

 Every year in February…

(68)

Application

 Stem-cell

 Genomic variants

 Epigenetic state

 Expression pattern

 Induced pluripotent stem (iPS) cel

ls and lineage-specific directly r

(69)

Application

(70)

Application

 Next step…

 Integrating advances of different res earch fields

 Combining above into mathematical mod els

 Build comprehensive and computable mo dels

 So we can…

(71)

Thank You for

Your Attention!!!!!!

參考文獻

相關文件

[This function is named after the electrical engineer Oliver Heaviside (1850–1925) and can be used to describe an electric current that is switched on at time t = 0.] Its graph

• However, these studies did not capture the full scope of human expansion, which may be due to the models not allowing for a recent acceleration in growth

Miroslav Fiedler, Praha, Algebraic connectivity of graphs, Czechoslovak Mathematical Journal 23 (98) 1973,

In gender wisdom, when facing female disciples, most of the male Zen masters emphasized “regardless of the appearance of man and woman.” Qi-Yuan never emphasized this, because

In these lessons, students will evaluate the impacts of genetic engineering on our daily life, and analyze the moral issues raised in its development, especially those related

Experiment a little with the Hello program. It will say that it has no clue what you mean by ouch. The exact wording of the error message is dependent on the compiler, but it might

A multi-objective genetic algorithm is proposed to solve 3D differentiated WSN deployment problems with the objectives of the coverage of sensors, satisfaction of detection

Moreover, this chapter also presents the basic of the Taguchi method, artificial neural network, genetic algorithm, particle swarm optimization, soft computing and