• 沒有找到結果。

IGDB.NSCLC: integrated genomic database of non-small cell lung cancer

N/A
N/A
Protected

Academic year: 2021

Share "IGDB.NSCLC: integrated genomic database of non-small cell lung cancer"

Copied!
6
0
0

加載中.... (立即查看全文)

全文

(1)

IGDB.NSCLC: integrated genomic database of

non-small cell lung cancer

Sen Kao

1

, Cheng-Kai Shiau

2

, De-Leung Gu

3

, Chun-Ming Ho

4,5

, Wen-Hui Su

6

,

Chian-Feng Chen

7

, Chi-Hung Lin

3,7

and Yuh-Shan Jou

1,2,

*

1

Graduate Institute of Life Sciences, National Defense Medical Center, 2Institute of Biomedical Sciences,

Academia Sinica,3Institute of Microbiology and Immunology, School of Life Science, National Yang-Ming

University, Taipei, 4Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu,

5

Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia

Sinica, Taipei, 6Department of Biomedical Sciences, Graduate Institute of Biomedical Sciences, Chang Gung

Molecular Medicine Research Center, Chang Gung University, Taoyuan and7VYM Genome Research Center,

National Yang-Ming University, Taipei, Taiwan

Received August 15, 2011; Revised October 3, 2011; Accepted November 15, 2011

ABSTRACT

Lung cancer is the most common cause of cancer-related mortality with more than 1.4 million deaths per year worldwide. To search for significant somatic alterations in lung cancer, we analyzed, integrated and manually curated various data sets and literatures to present an integrated genomic

database of non-small cell lung cancer

(IGDB.NSCLC, http://igdb.nsclc.ibms.sinica.edu.tw). We collected data sets derived from hundreds of

human NSCLC (lung adenocarcinomas and/or

squamous cell carcinomas) to illustrate genomic

alterations [chromosomal regions with copy

number alterations (CNAs), gain/loss and loss of heterozygosity], aberrant expressed genes and microRNAs, somatic mutations and experimental evidence and clinical information of alterations retrieved from literatures. IGDB.NSCLC provides user friendly interfaces and searching functions to

display multiple layers of evidence especially

emphasizing on concordant alterations of CNAs with co-localized altered gene expression, aberrant microRNAs expression, somatic mutations or genes with associated clinicopathological features. These significant concordant alterations in NSCLC are graphically or tabularly presented to facilitate and prioritize as the putative cancer targets for patho-logical and mechanistic studies of lung tumorigen-esis and for developing new strategies in clinical interventions.

INTRODUCTION

Cancer is the leading killer of human beings with more than 7.4 million deaths worldwide per year. Among them, lung cancer is the most common cause of cancer-related mortality in both men and women with over 1.4 million

deaths annually (1). Lung cancer can be divided into two

categories: small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC). Approximately 85% of total lung cancers are NSCLC that can be further classified into two major subtypes: squamous cell carcinoma (SCC) and adenocarcinoma (AD). Recent remarkable advances in identification of genetic alterations including p53, KRAS, EGFR, HER2, c-MET, LKB1, PIK3CA, BRAF and

EML4-ALKnot only provided mechanistic understanding

of tumor growth and metastatic advantages but also

served as targets for diagnosis and therapy (2–12).

Moreover, a recent report suggested that high risk lung cancer individuals aged 55–74 years with heavy smoking history is associated with a 20% reduction in lung cancer mortality when computed tomography (CT) lung cancer screening is conducted in comparison with chest X-ray screening (http://www.cancer.gov/images/dsmb-nlst.pdf). Even with these recent advances, early screening of lung cancer is still inconsistent and the overall 5-year survival

rate of lung cancer remains around 15% (13–15).

Therefore, discovery of altered genes for early detection and as therapeutic targets is urgently needed to prolong the life of lung cancer patients.

Similar to many other cancers, lung cancer is a complex and heterogeneous genetic disease resulting from the ac-cumulation of genetic and epigenetic alterations during multiple steps of tumor progression and metastasis

(16,17). Depending on tumor subtype, ethnicity, gender

*To whom correspondence should be addressed. Tel: +886 02 26523521; Fax: +886 02 27827654; Email: [email protected] ß The Author(s) 2011. Published by Oxford University Press.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/ by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

at National Chiao Tung University Library on April 28, 2014

http://nar.oxfordjournals.org/

(2)

and exposure of carcinogens (e.g. smoking, radon gas, asbestos and cooking oil fumes), accumulated alterations exploit different tumorigenic mechanisms resulting in aberrant activation of oncogenic signaling pathways and

uncontrolled tumor growth and metastasis (18–20). To

identify common cancer-associated alterations, applica-tion of high-throughput genomic technologies including DNA sequencing and high density microarrays allowed investigators to generate comprehensive genome-wide data sets of somatic mutations, copy number alterations

(CNAs), transcriptomics and altered microRNA

(miRNAs) expression of lung cancer genomes (21–33).

In combination with clinical features, investigators were able to independently identified profiles from these genomic alterations for categorization of lung cancer subtypes, identification of cancer genes for tumorigenic studies, diagnostic and prognostic prediction of patient outcomes and development of various strategies to improve patient care.

To facilitate identification of common alterations that might embrace oncogenic or tumor suppressive genes in NSCLC, we curated numerous genomic data sets and established the integrated database IGDB.NSCLC. We graphically displayed multiple dimensional data of lung cancer somatic alterations including somatic mutations, CNAs, aberrant expression of genes and miRNAs, chromosomal gain and loss regions, frequent loss of het-erozygosity (LOH) regions and experimental supports of alterations retrieved from lung cancer literatures. Altered gene expression in association with clinicopathological features and patient outcomes is also included for develop-ing biomarkers in clinical managements. Although lung cancer is the most common cause of cancer-related mor-tality with rapid expansion of cancer genomic studies, limited efforts were made to integrate these genomic resources for a coherent and user-friendly presentation. A database HLungDB was reported as a helpful resource of human lung cancer research with integrated and networking analysis of lung cancer-related genes, proteins and miRNAs extracted manually from literatures

(34). However, IGDB.NSCLC provides not only lung

cancer genes and miRNAs from published reports but also analyzed various lung cancer genomic data sets for

simultaneous illustrations of somatic alterations at

genome, RNA, protein, function and application levels. The simultaneous illustrations provide multiple layers of evidence for investigators to detect and prioritize the common altered genes and miRNAs accessible to the sci-entific and medical communities. Moreover, we specific-ally emphasized on concordant alterations in altered NSCLC cancer genomes such as (i) up-regulated genes and miRNAs encoded in regions of genomic amplifica-tion; (ii) down-regulated genes and miRNAs encoded in regions of genomic deletion; (iii) mutation genes with concordant genomic alterations; and (iv) genes with sig-nificant association with clinical information encoded in regions of genomic alterations. These altered genes defined as significant genes and miRNAs in NSCLC were graph-ically and tabularly displayed in IGDB.NSCLC and should be prioritized to develop as useful targets for improvement of patient management.

DATABASE CONSTRUCTION

For integration of various genomic resources of NSCLC, we collected genomic data generated from primary tissue

samples of lung AD and SCC separately (Supplementary

Table S1). For altered gene expression performed in microarrays, we collected 15 data sets from 1099 AD, 295 SCC and 189 normal tissue samples from five different microarray platforms. An expression profiling data set (GSE14814) of 90 NSCLC samples with and without cis-platin/vinorelbine treatment was included for revealing

potential prognostic markers in NSCLC (35). We also

included a reported pair data set of 193 lung AD samples with genomic alteration performed in 44 K

Agilent CGH array and expression alteration in

Affymetrix U133A and U133 2.0 arrays for concordant

alterations from the same samples (36).

For processing Affymetrix gene expression data sets, the raw data (CEL files) were normalized by MAS 5.0 (an affy

package from Bioconductor/R at http://www

.bioconductor.org/packages/release/bioc/html/affy.html)

(37). Quality control (QC) results were performed by the

simpleaffy and affyQCReport packages (http://www .bioconductor.org/packages/release/bioc/html/

affyQCReport.html and http://www.bioconductor.org/

packages/release/bioc/html/simpleaffy.html) with three

stringent QC criteria for inclusion of a microarray data: (i) the scaling factor of a given sample should be within two standard deviations of mean of the same array platform, (ii) the present calls of a given sample should

be >25% and (iii) the 30/50 GAPDH ratios of a given

sample should be <3 (38). After these QC process, 46

samples were eliminated (HG_U95A: 22 samples,

HG-U133A: 2 samples and HG-U133_Plus_2: 22

samples). The data sets from the same platform were normalized using the normalize.quantiles in R. Finally, all probeset intensity value was transformed to log 2 value. For data performed in two color microarrays (Agilent and Stanford data sets), expression profiles were downloaded form GEO website directly. The log 10 ratio of the Agilent data set was transformed to log 2 ratio. The differential expression genes for each platform were ranked on moderated t-statistics and selected with

P <0.01 under the Bayesian adjusted t-statistics from the

linear models for microarray data (limma) package (39).

For detection of CNAs, we downloaded three data sets of 191 AD, 117 SCC and 271 normal tissue samples per-formed in Affymetrix GeneChip single nucleotide poly-morphism (SNP) arrays and analyzed by dChip software

(40). In brief, CEL format data are normalized using

invariant set normalization algorithms and then

normalized-within-chip intensity data are generated

based on the reference data set of 50 normal individuals genotyped in the same platform. Based on these signal values, the raw copy number for an SNP in a sample is computed as: [log 2(intensity of SNP/mean of intensity of reference  2)]. A window size of three SNPs is then applied for median smoothing method and to infer raw

copy number (ICN) (41). We defined the amplified regions

with ICN more than three and deleted regions with ICN

less than one (42). For altered expression of miRNAs,

at National Chiao Tung University Library on April 28, 2014

http://nar.oxfordjournals.org/

(3)

two data sets of 193 AD and 137 SCC tissue samples were collected and the miRNAs expression was analyzed by calculating the mean (±standard deviation) of the log 2 ratios of tumor/non-tumor adjacent tissues. For analysis of arrayed comparative genome hybridization (aCGH) data, we obtained two data sets of 70 SCC tissue samples and analyzed them using the cghMCR package (http://www.bioconductor.org/packages/2.4/bioc/html/ cghMCR.html). The gain and loss regions were calculated based on the SGOL (Segment Gain Or Loss) scores by calculating the summations of all the positive values over a threshold of copy number (CN) >3 and all the negative values below a threshold of CN < 1.2, respect-ively. In addition, we also integrated at least 1112 somatic mutation genes of NSCLC from COSMIC and literatures, 214 lung cancer genes with experimental evidence, 131 genes with association with clinico-pathological features of NSCLC and other genomic alter-ations such as LOH and minimum region of alteralter-ations from literatures and our unpublished data.

RESULTS

Multiple levels of somatic alterations in NSCLC

To facilitate identification of cancer-associated somatic altered genes, we aimed to provide lines of evidence for investigators to prioritize these genes for mechanistic

studies and clinical applications in NSCLC. The

integrated framework of IGDB.NSCLC is constructed based on the physical map of human genome sequence from Ensembl with various integrated somatic alterations

alongside (Figure 1). Users could apply various searching

terms (gene, marker, cytogentic location and other key words) for displaying the altered data in lung AD, SCC and/or combined NSCLC. Three major illustrations including chromosome view, gene view and miRNA view are provided for simultaneous displays of concordant alterations in the same interface. In the chromosome view,

the interface demonstrated the comprehensive and

integrated alterations in relation to NSCLC genes and regions including evidence of experimental support, somatic mutation, altered gene expression, CNAs, LOH and other chromosomal alterations. We provided ERBB2, EGFR and MET as tutorial examples (please see our quick examples of chromosome view). In gene view, we presented the details of aberrant information of the selected gene including (i) mutation frequency and details of mutation changes; (ii) frequency of altered gene expression supported by experimental evidence such as immunohistochemistry (IHC), RT–PCR or western blot analysis and in association with clinical information; (iii) the fold changes and the statistic significance of altered expression of each probe set in a selected gene from a given microarray platform; (iv) the distribution plots of the altered expressing gene in a set of multiple

NSCLC tissues shown in log 2 ratios of tumor/

non-tumor adjacent tissues; and (v) the publication status of the aberrant gene in various cancers provided in the cancer gene index and the literature (see our quick examples of gene view). In miRNA view, we also provided

details of the aberrant expression of a miRNA as similar to that of gene view in addition to external links for

predicting miRNA target genes in MICROCOSM (43),

TARGETSCAN (44), PICTAR-VERT (45) and

miRTarBase (46). We provided three miRNAs including

hsa-let-7e, hsa-mir-17 and hsa-mir-31 as tutorial examples (see our quick examples of miRNA view).

Concordant somatic alterations in NSCLC

To further reveal significant cancer-associated genes and miRNAs involved in tumor progression of NSCLC, we applied CNA (amplification: Inferred copy number ICN >3 and deletion: ICN <1) as the framework and searched for concordant existence of (i) genes with altered expres-sion, (ii) miRNA with altered expresexpres-sion, (iii) genes with common somatic mutations and (iv) genes associated with clinicopathological features from literatures. For genes with altered expression existing in regions of CNA, we found 27 up-regulated genes located in amplified regions and seven down-regulated genes residing in deleted regions in AD; and 23 up-regulated genes in amplified regions and 13 down-regulated genes in deleted regions

in SCC (Figure 2 and Supplementary Table S2). Some

of these genes were previously identified as altered expres-sion genes in NSCLC for validation but majority of these genes remain unknown in the tumorigenesis of NSCLC. For miRNAs with altered expression residing in CNA regions, we identified 21 up-regulated miRNAs located in the amplified regions and 19 down-regulated miRNAs in the deleted regions in AD, and 19 up-regulated miRNAs in the amplified regions and 20 down-regulated

miRNAs in the deleted regions in SCC (Supplementary

Figure S1 and Supplementary Table S3). Similarly, we found that some of these aberrant miRNAs were validated by previous reports but majority of them are novel signifi-cant miRNAs in the tumor formation of NSCLC.

We also examined the concordance of genes with common somatic mutations (at least mutations in five cases) that resides in the CNA regions of NSCLC. A total of 1112 genes with somatic mutations including 386 in AD and 55 in SCC were downloaded from COSMIC

database (47). Majority of these somatic mutations

occurred only once in NSCLC tissues except 43 genes in AD and seven genes in SCC conferred at least five inde-pendent somatic mutations. Interestingly, majority of these common mutated genes including 33/43 (76.7%) in AD and 6/7 (85.7%) in SCC are co-localized with CNA

regions in NSCLC (Table 1). Our results further

demonstrated that genes with frequent somatic mutations are commonly affiliated with chromosomal alterations involved in the tumorigenesis of NSCLC.

To further examine the link between clinicopathological features with somatic alterations, we collected 309 papers with 125 genes in AD and 105 genes in SCC validated by experimental evidence such as IHC, RT–PCR and western analysis for confirmation of altered expression in NSCLC. Among them, 62 (62/125, 49.6%) genes in AD and 54 (54/105, 51.4%) genes in SCC extracted from 159 papers were shown to associate with 22 clinicopathological features and to be located in the CNA regions

at National Chiao Tung University Library on April 28, 2014

http://nar.oxfordjournals.org/

(4)

Figure 2. The graphic integration of CNAs with altered expression genes in lung AD and SCC. The red lines represent the amplification regions for CNA and up-regulation genes. The green lines stand for the deletion regions for CNA and down-regulated genes. The lines are relatively corresponding to the physical position along the chromosomes.

Figure 1. The framework of IGDB.NSCLC.

at National Chiao Tung University Library on April 28, 2014

http://nar.oxfordjournals.org/

(5)

(Supplementary Table S4). Our results suggested that

NSCLC genes associated with clinicopathological

features commonly (50%) reside in the CNA regions, reflecting the genetic effects participated in pathological changes of tumor formation in NSCLC.

DISCUSSION

As far as we know, IGDB.NSCLC is the largest integra-tion of lung cancer genomic resources providing multiple levels of evidence to search for the concordantly altered targets and to prioritize putative NSCLC genes for future studies. In addition to the database with various searching options and user-friendly interfaces, we provided concord-ant somatic alterations based on the genome-wide CNAs data with co-localization of altered gene expression, aberrant miRNA expression, somatic mutation and genes in association with clinicopathological features. The high concordance of CNA data with these somatic alterations and clinical features in IGDB.NSCLC sug-gested the quality of data integration with experimental validations, the heterogeneity of genetic pathways to NSCLC and the important roles of genomic alterations involved in tumor formation of NSCLC. The future de-velopment of integrating other NSCLC resources into IGDB.NSCLC will focus on increasing the data of clinicopathological features for dissecting the altered genes involved in tumor progression and on somatic alter-ation data from next-generalter-ation sequencing results. In conclusion, IGDB.NSCLC is an invaluable resource for selecting putative cancer genes in NSCLC to better under-stand the heterogeneous tumorigenic mechanisms and for developing useful strategies in clinical applications to prolong the life of lung cancer patients.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online: Supplementary Tables 1–4, Supplementary Figure 1.

ACKNOWLEDGEMENTS

The following authors contributed toward this study: database construction: S.K. and C.K.S.; data analysis and interpretation: S.K., D.L.G., C.M.H. and C.F.C.; drafting of the manuscript and study supervision: W.H.S., C.F.C., C.H.L. and Y.S.J.

FUNDING

National Research Program for Genomic Medicine

(NSC98-3112-B-001-004, NSC98-3112-B-001-031);

National Research Program for Biopharmaceuticals, National Science Council, Taiwan (NSC100-2325-B-001-012). Funding for open access charge: Institute of Biomedical Sciences, Academia Sinica, Taiwan.

Conflict of interest statement. None declared.

REFERENCES

1. WHO. World Health Organization. ‘‘Cancer’’. http://www.who .int/mediacentre/factsheets/fs297/en/. (3 March 2010, date last accessed).

2. Brose,M.S., Volpe,P., Feldman,M., Kumar,M., Rishi,I.,

Gerrero,R., Einhorn,E., Herlyn,M., Minna,J., Nicholson,A. et al. (2002) BRAF and RAS mutations in human lung cancer and melanoma. Cancer Res., 62, 6997–7000.

3. Lynch,T.J., Bell,D.W., Sordella,R., Gurubhagavatula,S., Okimoto,R.A., Brannigan,B.W., Harris,P.L., Haserlat,S.M., Supko,J.G., Haluska,F.G. et al. (2004) Activating mutations in the epidermal growth factor receptor underlying responsiveness of non-small-cell lung cancer to gefitinib. N. Engl. J. Med., 350, 2129–2139.

4. Ma,P.C., Jagadeeswaran,R., Jagadeesh,S., Tretiakova,M.S., Nallasura,V., Fox,E.A., Hansen,M., Schaefer,E., Naoki,K., Lader,A. et al. (2005) Functional expression and mutations of c-Met and its therapeutic inhibition with SU11274 and small interfering RNA in non-small cell lung cancer. Cancer Res., 65, 1479–1488.

5. Naoki,K., Chen,T.H., Richards,W.G., Sugarbaker,D.J. and Meyerson,M. (2002) Missense mutations of the BRAF gene in human lung adenocarcinoma. Cancer Res., 62, 7001–7003. 6. Paez,J.G., Janne,P.A., Lee,J.C., Tracy,S., Greulich,H., Gabriel,S.,

Herman,P., Kaye,F.J., Lindeman,N., Boggon,T.J. et al. (2004) EGFR mutations in lung cancer: correlation with clinical response to gefitinib therapy. Science, 304, 1497–1500. 7. Samuels,Y. and Velculescu,V.E. (2004) Oncogenic mutations of

PIK3CA in human cancers. Cell Cycle, 3, 1221–1224.

8. Samuels,Y., Wang,Z., Bardelli,A., Silliman,N., Ptak,J., Szabo,S., Yan,H., Gazdar,A., Powell,S.M., Riggins,G.J. et al. (2004) High frequency of mutations of the PIK3CA gene in human cancers. Science, 304, 554.

9. Sanchez-Cespedes,M., Parrella,P., Esteller,M., Nomoto,S., Trink,B., Engles,J.M., Westra,W.H., Herman,J.G. and Sidransky,D. (2002) Inactivation of LKB1/STK11 is a common event in adenocarcinomas of the lung. Cancer Res., 62, 3659–3662.

10. Santos,E., Martin-Zanca,D., Reddy,E.P., Pierotti,M.A., Della Porta,G. and Barbacid,M. (1984) Malignant activation of a K-ras oncogene in lung carcinoma but not in normal tissue of the same patient. Science, 223, 661–664.

11. Stephens,P., Hunter,C., Bignell,G., Edkins,S., Davies,H., Teague,J., Stevens,C., O’Meara,S., Smith,R., Parker,A. et al. Table 1. Common mutated genes in copy number alteration regions of NSCLC

Non-small cell lung carcinoma

Alteration Genes with at least five somatic mutations

AD Mutated genes in amplified regions (30 genes) APC, BRAF, CDC42BPA, CDKN2A, CTNNB1, EGFR, EPHA3, EPHA5,

EPHA7, EPHB6, FGFR4, FLT1, INSRR, JAK2, KDR, KIAA1804, KRAS, LMTK2, MET, NRAS, NTRK1, PAK7, PDGFRA, PIK3C3, PIK3CA, PIK3CG, PRKDC, RB1, ROBO2, TERT

Mutated genes in deleted regions NOTCH1, PTEN, TP53

SCC Mutated genes in amplified regions BRAF, CDKN2A, EGFR, KRAS, PIK3CA

Mutated genes in deleted regions TP53

at National Chiao Tung University Library on April 28, 2014

http://nar.oxfordjournals.org/

(6)

(2004) Lung cancer: intragenic ERBB2 kinase mutations in tumours. Nature, 431, 525–526.

12. Takahashi,T., Nau,M.M., Chiba,I., Birrer,M.J., Rosenberg,R.K., Vinocour,M., Levitt,M., Pass,H., Gazdar,A.F. and Minna,J.D. (1989) p53: a frequent target for genetic abnormalities in lung cancer. Science, 246, 491–494.

13. Bach,P.B. (2011) Inconsistencies in findings from the early lung cancer action project studies of lung cancer screening. J. Natl. Cancer Inst., 103, 1002–1006.

14. Van’t Westeinde,S.C. and van Klaveren,R.J. (2011) Screening and early detection of lung cancer. Cancer J., 17, 3–10.

15. Jemal,A., Siegel,R., Ward,E., Hao,Y., Xu,J. and Thun,M.J. (2009) Cancer statistics, 2009. CA Cancer J. Clin., 59, 225–249. 16. Agullo-Ortuno,M.T., Lopez-Rios,F. and Paz-Ares,L. (2010)

Lung cancer genomic signatures. J. Thorac. Oncol., 5, 1673–1691. 17. Risch,A. and Plass,C. (2008) Lung cancer epigenetics and

genetics. Int. J. Cancer, 123, 1–7.

18. Chaffer,C.L. and Weinberg,R.A. (2011) A perspective on cancer cell metastasis. Science, 331, 1559–1564.

19. Hanahan,D. and Weinberg,R.A. (2011) Hallmarks of cancer: the next generation. Cell, 144, 646–674.

20. Sun,S., Schiller,J.H. and Gazdar,A.F. (2007) Lung cancer in never smokers—a different disease. Nat. Rev. Cancer, 7, 778–790. 21. Puissegur,M.P., Mazure,N.M., Bertero,T., Pradelli,L., Grosso,S.,

Robbe-Sermesant,K., Maurin,T., Lebrigand,K., Cardinaud,B., Hofman,V. et al. (2011) miR-210 is overexpressed in late stages of lung cancer and mediates mitochondrial alterations associated with modulation of HIF-1 activity. Cell Death Differ., 18, 465–478.

22. Landi,M.T., Zhao,Y., Rotunno,M., Koshiol,J., Liu,H., Bergen,A.W., Rubagotti,M., Goldstein,A.M., Linnoila,I., Marincola,F.M. et al. (2010) MicroRNA expression differentiates histology and predicts survival of lung cancer. Clin. Cancer Res., 16, 430–441.

23. Bass,A.J., Watanabe,H., Mermel,C.H., Yu,S., Perner,S., Verhaak,R.G., Kim,S.Y., Wardwell,L., Tamayo,P., Gat-Viks,I. et al. (2009) SOX2 is an amplified lineage-survival oncogene in lung and esophageal squamous cell carcinomas. Nat. Genet., 41, 1238–1242.

24. Boelens,M.C., Kok,K., van der Vlies,P., van der Vries,G., Sietsma,H., Timens,W., Postma,D.S., Groen,H.J. and van den Berg,A. (2009) Genomic aberrations in squamous cell lung carcinoma related to lymph node or distant metastasis. Lung Cancer, 66, 372–378.

25. Broet,P., Camilleri-Broet,S., Zhang,S., Alifano,M., Bangarusamy,D., Battistella,M., Wu,Y., Tuefferd,M.,

Regnard,J.F., Lim,E. et al. (2009) Prediction of clinical outcome in multiple lung cancer cohorts by integrative genomics: implications for chemotherapy selection. Cancer Res., 69, 1055–1062.

26. Ding,L., Getz,G., Wheeler,D.A., Mardis,E.R., McLellan,M.D., Cibulskis,K., Sougnez,C., Greulich,H., Muzny,D.M.,

Morgan,M.B. et al. (2008) Somatic mutations affect key pathways in lung adenocarcinoma. Nature, 455, 1069–1075.

27. Shedden,K., Taylor,J.M., Enkemann,S.A., Tsao,M.S.,

Yeatman,T.J., Gerald,W.L., Eschrich,S., Jurisica,I., Giordano,T.J., Misek,D.E. et al. (2008) Gene expression-based survival

prediction in lung adenocarcinoma: a multi-site, blinded validation study. Nat. Med., 14, 822–827.

28. Weir,B.A., Woo,M.S., Getz,G., Perner,S., Ding,L., Beroukhim,R., Lin,W.M., Province,M.A., Kraja,A., Johnson,L.A. et al. (2007) Characterizing the cancer genome in lung adenocarcinoma. Nature, 450, 893–898.

29. Raponi,M., Zhang,Y., Yu,J., Chen,G., Lee,G., Taylor,J.M., Macdonald,J., Thomas,D., Moskaluk,C., Wang,Y. et al. (2006) Gene expression signatures for predicting prognosis of squamous cell and adenocarcinomas of the lung. Cancer Res., 66,

7466–7472.

30. Takeuchi,T., Tomida,S., Yatabe,Y., Kosaka,T., Osada,H., Yanagisawa,K., Mitsudomi,T. and Takahashi,T. (2006)

Expression profile-defined classification of lung adenocarcinoma shows close relationship with underlying major genetic changes and clinicopathologic behaviors. J. Clin. Oncol., 24, 1679–1688. 31. Tseng,R.C., Chang,J.W., Hsien,F.J., Chang,Y.H., Hsiao,C.F.,

Chen,J.T., Chen,C.Y., Jou,Y.S. and Wang,Y.C. (2005)

Genomewide loss of heterozygosity and its clinical associations in non small cell lung cancer. Int. J. Cancer, 117, 241–247. 32. Garber,M.E., Troyanskaya,O.G., Schluens,K., Petersen,S.,

Thaesler,Z., Pacyna-Gengelbach,M., van de Rijn,M., Rosen,G.D., Perou,C.M., Whyte,R.I. et al. (2001) Diversity of gene expression in adenocarcinoma of the lung. Proc. Natl. Acad. Sci. USA, 98, 13784–13789.

33. Bhattacharjee,A., Richards,W.G., Staunton,J., Li,C., Monti,S., Vasa,P., Ladd,C., Beheshti,J., Bueno,R., Gillette,M. et al. (2001) Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses. Proc. Natl. Acad. Sci. USA, 98, 13790–13795.

34. Wang,L., Xiong,Y., Sun,Y., Fang,Z., Li,L., Ji,H. and Shi,T. (2010) HLungDB: an integrated database of human lung cancer research. Nucleic Acids Res., 38, D665–D669.

35. Zhu,C.Q., Ding,K., Strumpf,D., Weir,B.A., Meyerson,M., Pennell,N., Thomas,R.K., Naoki,K., Ladd-Acosta,C., Liu,N. et al. (2010) Prognostic and predictive gene signature for adjuvant chemotherapy in resected non-small-cell lung cancer. J. Clin. Oncol., 28, 4417–4424.

36. Chitale,D., Gong,Y., Taylor,B.S., Broderick,S., Brennan,C., Somwar,R., Golas,B., Wang,L., Motoi,N., Szoke,J. et al. (2009) An integrated genomic analysis of lung cancer reveals loss of DUSP4 in EGFR-mutant tumors. Oncogene, 28, 2773–2783. 37. Gautier,L., Cope,L., Bolstad,B.M. and Irizarry,R.A. (2004)

affy—analysis of Affymetrix GeneChip data at the probe level. Bioinformatics, 20, 307–315.

38. Tumor Analysis Best Practices Working Group (2004) Expression profiling—best practices for data generation and interpretation in clinical trials. Nat. Rev. Genet, 5, 229–237.

39. Wettenhall,J.M. and Smyth,G.K. (2004) limmaGUI: a graphical user interface for linear modeling of microarray data.

Bioinformatics, 20, 3705–3706.

40. Li,C. and Wong,W.H. (2001) Model-based analysis of

oligonucleotide arrays: expression index computation and outlier detection. Proc. Natl. Acad. Sci. USA, 98, 31–36.

41. Shiau,C.K., Gu,D.L., Chen,C.F., Lin,C.H. and Jou,Y.S. (2011) IGRhCellID: integrated genomic resources of human cell lines for identification. Nucleic Acids Res., 39, D520–D524.

42. Chen,C.F., Hsu,E.C., Lin,K.T., Tu,P.H., Chang,H.W., Lin,C.H., Chen,Y.J., Gu,D.L., Wu,J.Y., Chen,Y.T. et al. (2010)

Overlapping high-resolution copy number alterations in cancer genomes identified putative cancer genes in hepatocellular carcinoma. Hepatology, 52, 1690–1701.

43. Griffiths-Jones,S., Saini,H.K., van Dongen,S. and Enright,A.J. (2008) miRBase: tools for microRNA genomics. Nucleic Acids Res., 36, D154–D158.

44. Lewis,B.P., Burge,C.B. and Bartel,D.P. (2005) Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell, 120, 15–20.

45. Krek,A., Grun,D., Poy,M.N., Wolf,R., Rosenberg,L., Epstein,E.J., MacMenamin,P., da Piedade,I., Gunsalus,K.C., Stoffel,M. et al. (2005) Combinatorial microRNA target predictions. Nat. Genet., 37, 495–500.

46. Sethupathy,P., Corda,B. and Hatzigeorgiou,A.G. (2006) TarBase: A comprehensive database of experimentally supported animal microRNA targets. RNA, 12, 192–197.

47. Forbes,S.A., Tang,G., Bindal,N., Bamford,S., Dawson,E., Cole,C., Kok,C.Y., Jia,M., Ewing,R., Menzies,A. et al. (2010) COSMIC (the Catalogue of Somatic Mutations in Cancer): a resource to investigate acquired mutations in human cancer. Nucleic Acids Res., 38, D652–D657.

at National Chiao Tung University Library on April 28, 2014

http://nar.oxfordjournals.org/

數據

Figure 2. The graphic integration of CNAs with altered expression genes in lung AD and SCC

參考文獻

相關文件

Good Data Structure Needs Proper Accessing Algorithms: get, insert. rule of thumb for speed: often-get

• To enhance teachers’ knowledge and understanding about the learning and teaching of grammar in context through the use of various e-learning resources in the primary

patients with stage I/II disease but not in those with stage III disease.43 A high serum level of VEGF is associated with poor survival among patients with small cell lung

Background: Gold standard for the diagnosis of oral dysplasia (OD) oral squamous cell carcinoma (OSCC) and malignant lesions is the histological examination.. Several

Our case highlights an enigmatic presentation of oral submucous fibrosis and its coexistence with oral cancer presenting with unusual neurological disturbance of the inferior

Less than 1% of all breast cancers occur in male patients, and to date, only 8 cases of metastatic breast adeno- carcinoma to the oral and maxillofacial region in a male patient

This study showed that pleomorphic adenoma and recurrent pleomorphic adenoma exhibit few somatic copy number alterations and that they show similar patterns of genomic profiles by

Distant Metastases Staging (M0) and Detection of Synchronous