• 沒有找到結果。

Large-Scale Investigation of Human TF-miRNA Relations Based on Coexpression Profiles

N/A
N/A
Protected

Academic year: 2021

Share "Large-Scale Investigation of Human TF-miRNA Relations Based on Coexpression Profiles"

Copied!
9
0
0

加載中.... (立即查看全文)

全文

(1)

Research Article

Large-Scale Investigation of Human TF-miRNA Relations

Based on Coexpression Profiles

Chia-Hung Chien,

1

Yi-Fan Chiang-Hsieh,

1

Ann-Ping Tsou,

2

Shun-Long Weng,

3,4,5

Wen-Chi Chang,

1

and Hsien-Da Huang

6,7,8,9

1Institute of Tropical Plant Sciences, National Cheng Kung University, Tainan 701, Taiwan

2Department of Biotechnology and Laboratory Science in Medicine, National Yang-Ming University, Taipei 112, Taiwan 3Mackay Medicine, Nursing and Management College, Taipei 112, Taiwan

4Department of Medicine, Mackay Medical College, New Taipei City 252, Taiwan

5Department of Obstetrics and Gynecology, Hsinchu Mackay Memorial Hospital, Hsinchu 300, Taiwan 6Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu 300, Taiwan 7Department of Biological Science and Technology, National Chiao Tung University, Hsinchu 300, Taiwan

8Department of Biomedical Science and Environmental Biology, Kaohsiung Medical University, Kaohsiung 807, Taiwan 9Institute of Biomedical Engineering, National Chiao Tung University, Hsinchu 300, Taiwan

Correspondence should be addressed to Wen-Chi Chang; [email protected] and Hsien-Da Huang; [email protected]

Received 11 March 2014; Revised 2 May 2014; Accepted 18 May 2014; Published 9 June 2014 Academic Editor: Tzong-Yi Lee

Copyright © 2014 Chia-Hung Chien et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Noncoding, endogenous microRNAs (miRNAs) are fairly well known for regulating gene expression rather than protein coding. Dysregulation of miRNA gene, either upregulated or downregulated, may lead to severe diseases or oncogenesis, especially when the miRNA disorder involves significant bioreactions or pathways. Thus, how miRNA genes are transcriptionally regulated has been highlighted as well as target recognition in recent years. In this study, a large-scale investigation of novel cis- and trans-elements was undertaken to further determine TF-miRNA regulatory relations, which are necessary to unravel the transcriptional regulation of miRNA genes. Based on miRNA and annotated gene expression profiles, the term “coTFBS” was introduced to detect common transcription factors and the corresponding binding sites within the promoter regions of each miRNA and its coexpressed annotated genes. The computational pipeline was successfully established to filter redundancy due to short sequence motifs for TFBS pattern search. Eventually, we identified more convinced TF-miRNA regulatory relations for 225 human miRNAs. This valuable information is helpful in understanding miRNA functions and provides knowledge to evaluate the therapeutic potential in clinical research. Once most expression profiles of miRNAs in the latest database are completed, TF candidates of more miRNAs can be explored by this filtering approach in the future.

1. Introduction

Among functional noncoding RNAs (ncRNAs), microR-NAs (miRmicroR-NAs) are tiny molecules (∼21–23 nt) with giant roles. miRNAs participate in gene regulation by targeting messenger RNAs (mRNAs) and influencing their stability and the initiation of translation. It is implied that if the expression of miRNA is aberrant, miRNA-mediated gene circuitries will be disordered, resulting in homeostatic

imbal-ance, pathogenesis, and oncogenesis [1, 2]. In recent years,

elucidating transcriptional regulatory mechanisms of miRNA genes has been highlighted when studying miRNA function. Core promoters of miRNA genes were promptly identified for depicting full-length primary transcripts [3–5]. High-throughput sequencing datasets derived from epigenetic sig-nature and TSS-relevant experiments unfold transcriptional start sites (TSSs) of miRNAs and offer a practical strategy to determine miRNA promoters [6–9].

To further understand upstream regulatory elements controlling miRNA expression, transcription factors (TFs) Volume 2014, Article ID 623078, 8 pages

(2)

and their binding sites of miRNA promoters were deciphered by either literature survey or computational prediction [10– 13]. By integrating the information of TF-miRNA regulatory relations and miRNA target interactions, regulatory networks that revolved around miRNAs provide a biological insight into how miRNAs dominate functional processes in bio-chemical reactions or metabolic pathways [14–17]. However, most of the foregoing studies regarded the upstream regions

of pre-miRNAs as promoters (e.g., 10 kb upstream or−900∼

+100 of the 5󸀠-start of the pre-miRNA). More convinced

miRNA promoters should be used for searching putative TF-binding sites. In addition, the experimentally verified TF-miRNA relations are insufficient for current miRNAs. According to the statistics in the latest version 1.2 of

Trans-miR, only 735 entries, which include ∼201 transcriptional

factors,∼209 miRNAs, and 16 organisms from 268

publi-cations, were curated. A large-scale investigation of novel cis- and trans-elements is necessary to fulfill the unmet need for locating transcription factor binding sites (TFBSs) within miRNA promoter regions.

Since sequence-specific TFs possess DNA-binding domains (DBDs) to recognize specific motifs in miRNA promoter sequences, potential binding sites can be detected by sequence-based computational approaches, for example, position weight matrix (PWM). A position weight matrix (also called position specific scoring matrix, PSSM) infers a pattern of DNA segment and is widely applied in searching TFBSs [18–21]. Two well-known databases collecting matrix information are TRANSFAC [22] and JASPAR [23]. To avoid excess false positives due to short sequence motifs used for TFBS pattern search, cis-regulatory analysis of coregulated gene sets was executed to determine overrepresented TFBSs [24, 25]. Although previously published web server Pscan [26] and oPOSSUM [27] can search transcription factor binding sites in coexpressed gene promoters to remit the impact of false positive issue, users have to define their coexpressed gene groups with valid gene IDs. In addition, miRNA promoter sequences are not included in these two web servers.

Therefore, the purpose of this study is to create a computational pipeline to undertake large-scale investigation of novel cis- and trans-elements for human miRNA genes based on coexpression strategy. First, we constructed 255 coexpressed gene groups of human miRNAs. Moreover, instead of grapping the upstream regions of pre-miRNAs as promoter sequences, we exploited more concrete human miRNA promoters by following the processes as previously described [7]. Through the detection of transcription factor binding sites within promoter regions of human miRNAs and their coexpressed genes by using matrix information from TRANSFAC, the common TFBSs were identified. We then filtered the redundancy by not only the occurrence of common TFBSs but also the expression correlation between TF-encoded genes and corresponding human miRNA gene groups. Finally, more reliable TF-miRNA regulatory relations of 225 human miRNAs were provided. Furthermore, the liver-specific hsa-miR-122 was also selected as the case study to demonstrate the usage of this filtering approach and its practicability.

2. Materials and Methods

2.1. Human miRNA Promoter Sequences. The latest genomic coordinates of human miRNAs were retrieved from miRBase release 19 [28]. Human miRNA TSSs were identified by repro-ducing the computational procedures described in miRStart resource [7]. The formula which determines tag-enriched loci was modified by weighted tag density (tag density multiply by tag number) in the SVM step to precisely reveal the effect of tag intensity. Then, 1 kb upstream sequences of human miRNA TSSs were acquired from UCSC Genome Browser [29] (hg19/GRCh37 assembly) in FASTA format. The updated intergenic miRNA TSSs in human genome are listed in supplementary Table S1 (available online at http://dx.doi.org/10.1155/2014/623078) in additional file for reference.

2.2. Expression Profiles of Human miRNAs and Annotated Genes. In order to determine human genes that are coex-pressed with specific miRNA genes, GDS596 record, the Affymetrix gene expression profiles from 79 physiologically human normal tissues, was downloaded from Gene

Expres-sion Omnibus (GEO) [30, 31]. Among annotated human

genes in GDS596, genes were filtered out if their HGNC symbols are invalid or have been withdrawn. On the other hand, the expression data of 345 miRNAs in 40 normal human tissues generated by a new type of real time reverse transcription- (RT-) PCR-based miRNA assays were also collected [32]. Mature miRNAs with eliminated miRBase IDs (release 19) were discarded. In both expression datasets, only 17 tissues (adrenal, brain, heart, kidney, liver, lung, lymph node, ovary, pancreas, placenta, prostate, skeletal muscle, testicle, trachea, thymus, thyroid, and uterus) are in common and were considered to be integrated expression profiles of annotated genes and miRNA genes with 17 conditions. To standardize expression levels across extensive range of both

datasets, the raw intensity was𝑍-score transformed [33].

2.3. Coexpressed Gene Groups of Human miRNAs. After the transformation process between two expression datasets, Pearson’s correlation coefficient (PCC) was calculated to estimate which annotated genes are coexpressed with specific miRNAs. The formula of Pearson’s correlation coefficient

(usually using the letter𝑟) is as follows:

𝑟𝑥𝑦= ∑ 𝑛 𝑖=1(𝑋𝑖− 𝑋) (𝑌𝑖− 𝑌) √∑𝑛 𝑖=1(𝑋𝑖− 𝑋)2√∑𝑛𝑖=1(𝑌𝑖− 𝑌)2 , (1)

where𝑋𝑖and 𝑌𝑖 denote the expression level of𝑖 condition

(tissue) of two genes 𝑥 and 𝑦 that are calculated for 𝑟,

whereas 𝑋 and 𝑌 represent the average of corresponding

expression levels in total 𝑛 conditions. Since the value of

Pearson’s correlation coefficient ranges from +1 (positively

correlated) to−1 (negatively correlated), we defined that two

genes of interest are coexpressed if their Pearson’s correlation coefficient is more than 0.8. Subsequently, coexpressed genes of human miRNAs were determined, and their promoter

(3)

Promoter sequence miRNA TSS Expression profiles Expression profile of TF-encoded gene Coding gene 1 C o exp re ss ed g enes Coding gene 2 Coding gene 3 Coding gene 4 Coding gene 5 Coding gene 6 Coding gene 7 Coding gene 8 Coding gene 9 Coding gene 10 Identifying coexpressed genes of miRNA of interest

Filtering redundancy

TFBS?

miRNA Calculate coTFBS occurrence

coTFBS

Evaluate expression correlation between miRNA and coTFBS

More convinced TF-miRNA relations

Obtain the promoter sequences (upstream1000 bps from TSSs) of miRNA

coexpressed gene group and use Match to detect TFBSs

Match

1 kb

Figure 1: The summary of detecting TF-miR regulatory relations in human genome.

sequences (1 kb upstream from TSS) were obtained from BioMart of Ensembl release 69 [34].

2.4. CoTFBSs in Human miRNA Promoters. In this work, we defined coTFBS as a common TFBS located in promoter regions of coexpressed genes. The TRANSFAC database [22], comprising data on transcription factors, their target genes, and regulatory binding sites, has been widely used when studying eukaryotic transcriptional regulation. After acquiring 1 kb promoter sequences of miRNA genes and their coexpressed genes in FASTA format, the Match program [35] was executed for sequence motif search of transcription factor binding sites according to the matrix information pro-vided by TRANSFAC (version 2011.4). A customized profile which specified human matrices in TRANSFAC library was used for Match to minimize both false positive and false negative rates with core similarity and matrix similarity cut-off values for each matrix. As defined in the publication of Match, the core of each matrix refers to the first five most conserved consecutive positions of a matrix. A putative TFBS with core similarity less than 1 was filtered out. The occurrence of each coTFBS and its expression correlation with miRNA in coexpressed gene groups was calculated finally. Only those whose encoded genes are coexpressed with corresponding miRNA were considered.

3. Results

3.1. Human miRNAs with Coexpressed Gene Group for

Detect-ing TF-miRNA Relations. Figure 1summarizes the workflow

to investigate human TF-miRNA regulatory relations based on expression profiles of miRNA and annotated genes. Pear-son’s correlation coefficient (PCC) was applied to measure the similarity of expression patterns across 17 human normal tissues, which represents the coexpressed level between a specific miRNA gene and an annotated gene of interest.

Due to the reason that Pearson’s𝑟 cannot be calculated if

the expression levels in all 17 conditions are identical, 29 mature miRNAs with such expression profiles were excluded. After discarding the mature miRNAs whose miRBase IDs are eliminated, 289 human miRNAs were selected to estimate PCC values with annotated human genes in GDS596 record. Among them, however, only 255 human miRNAs have

more than one coexpressed genes (PCC > 0.8). Moreover,

25 out of 255 human miRNAs have no identified TSSs by following the previous strategy [7] and have to be filtered. In total, 230 human miRNAs were qualified to discover putative cis- and trans-elements in their promoter regions. The number of members of each miRNA coexpressed gene group ranges from several to thousand (see Table S2 in additional file for details).

3.2. Putative TF-miRNA Relations Were Explored according to the Occurrence of CoTFBS. Theoretically, a group of genes that are coexpressed may be regulated by common transcrip-tion factors. Based on this concept, a specific transcriptranscrip-tion factor binding site located in the promoter regions of most genes in each coexpressed group implies that its correspond-ing TF is the most possible one controllcorrespond-ing the expression of these genes. Here, we introduce the term “coTFBS” which

(4)

Promoter region miRNA gene TSS Coding gene 2 TSS Coding gene 3 TSS Coding gene 4 Coding gene 1 TSS coTFBS TSS

Figure 2: The concept of coTFBS. Protein-coding genes 1, 2, 3, and 4 are coexpressed with the miRNA gene. Colored rectangles represent transcription factor binding motifs in each promoter region. The common TFBS (pink) located within the promoters of miRNA gene and other four genes represents the “coTFBS” of this gene group.

represents the common TFBS of a coexpressed gene group to filter redundant TF candidate of miRNA genes. For example, the “pink rectangle” binding motifs were detected in the proposed miRNA and other four gene promoters in Figure 2. The occurrence of this coTFBS is five. By scanning TFBSs in promoter regions of each miRNA gene and its coexpressed gene group using Match based on TRANSFAC library (version 2011.4) and following the filtering process, the putative TFs that regulate a specific miRNA were determined. In this work, we successfully identified putative TF-miRNA relations of 225 human miRNAs. The full list of putative TFs for each human miRNA promoter can be accessed in Table S3.

3.3. Case Study: hsa-miR-122. hsa-miR-122 is one of the intergenic miRNAs whose TSS and promoter have been experimentally characterized [7]. Previous studies reported that this liver-specific miRNA is significantly downregulated in hepatocellular carcinoma and profoundly affects carcino-genesis [36]. Because of the explicit promoter and biological importance, hsa-miR-122 was selected as the case study to investigate which TFs may regulate its gene expression. 261 annotated genes coexpressed with miR-122 were identified according to their expression levels with Pearson’s correlation

coefficient (PCC) more than 0.8. Figure 3 compares the

expression patterns of 261 coexpressed genes (the pink line represents the average value of their expression) with hsa-miR-122 among 17 human normal tissues, indicating that remarkable peaks appear in liver for all the coexpressed genes. The expression image (see Figure S1 in additional file) also reveals the similar trends between hsa-miR-122 and its coexpressed genes.

Then, the promoter sequences (1 kb upstream from TSS) of hsa-miR-122 gene and 261 coexpressed genes were col-lected to identify coTFBSs using Match. The occurrence of putative transcription factors of miR-122 is listed in Table 1. Among TF candidates regulating hsa-miR-122, the TF binding motif of HNF-4alpha can be found in 191 coexpressed gene promoters. In 2011, Li et al. reported that HNF-4alpha

is a key regulator positively controlling the expression of miR-122 in liver [37], proving our computational finding. They performed not only the luciferase reporter gene assay to detect the trans-activation effect of HNF-4alpha in miR-122 promoter but also the ChIP and EMSA assays to determine HNF-4alpha binding of miR-122 promoter in vitro and in vivo. Moreover, other liver-enriched transcription factors including 1alpha, 3alpha, 3beta, and HNF-6 showed a strong positive correlation with miR-122. The knockout of HNF1A, FOXA1, and FOXA2 by RNAi assay reduces the expression of miR-122, suggesting that these transcription factors may bind to miR-122 promoter and transcriptionally regulate miR-122 [38]. Importantly, HNF-1alpha, HNF-3alpha, and HNF-3beta were identified in our list of TF candidates, and a HNF-6 binding site was determined (−2720 from miR-122 TSS) if the 3 kb promoter sequence of miR-122 was used.

In addition, the TF binding site of NR1B2 can be found in 226 coexpressed gene promoters. A previous research article published in 1987 indicated that the inappropriate expression of HAP gene (the official HGNC symbol is RARB) may relate to the hepatocellular carcinogenesis, and hap protein may directly participate in the hepatocellular transformation [39]. It is implied that transcription factor NR1B2 may bind to miR-122 promoter and regulate its expression.

4. Discussion

For large-scale investigation of human TF-miRNA relations in this study, the expression profiles of human miRNAs and over ten thousands genes from normal human tissues facil-itate the acquirement of miRNA coexpressed gene groups. However, the latest usable RT-qPCR miRNA expression data for human normal tissues were published in 2007 [32]. The currently available data are almost derived from cancer cell lines or tumor tissues. Totally 230 coexpressed gene groups limit the coTFBS analysis of most miRNAs in existence. Besides, based on the cut-off 0.8 PCC value applied to define coexpression between a miRNA and an annotated gene, 34 miRNAs have no coexpressed genes and 50 miRNAs have less than ten coexpressed genes. Although each miRNA has sufficient coexpressed genes to analyze its putative cis-and trans-elements if the lower PCC cut-off was used, it may cause the trade-off of specificity when determining miRNA coexpressed genes. For example, hsa-let-7a-1 has no coexpressed gene when using 0.8 PCC value but has 29 coexpressed genes if 0.6 PCC value was applied.

It is noteworthy that the normalization process between

two raw expression datasets was only𝑍-score transformed,

without using log10transformation before𝑍-score

standard-ization. According to the definition of formula, Pearson’s correlation coefficient calculates linear correlation between two variables and has an invariant property in statistics.

In fact, the log10 transformation tends to alter the original

expression patterns and sharply reduces the scale of raw intensity, resulting in unexpected affection of PCC values. Figure S2 illustrates an example of hsa-miR-122 expression

(5)

9 8 7 6 5 4 3 2 1 0 −1 −2 −3 −4 −5 −6 −7 −8 −9 Ad re n al Br ai n He ar t Ki d n ey Li ve r Lu n g Ly m p h n o d e Ov ar y Pa n cr ea s Placen ta Pro st ate Sk elet al m u sc le Te st is T rac he a Thy mu s Thy ro id Ut er u s hsa-miR-122 Exp ressio n le ve ls ( Z -s co re tra n sf o rmed) (a) 9 8 7 6 5 4 3 2 1 0 −1 −2 −3 −4 −5 −6 −7 −8 −9 Ad re n al Br ai n He ar t K idne y Li ve r Lu n g Ly m p h n o d e Ov ar y Pa n cr ea s Placen ta Pro st ate Sk elet al m u sc le Te st is T rac he a Thy mu s Thy ro id Ut er u s 261 coexpressed genes Exp ressio n le ve ls ( Z -s co re tra n sf o rmed) (b)

Figure 3: 261 human protein-coding genes are coexpressed with hsa-miR-122. The pink curve in 3(b) represents the average value of miR-122 coexpressed genes among 17 human normal tissues. The remarkable peaks appear in liver.

to the high expression level in liver, two obvious peaks of

brain and thymus appear in the log10 transformed

expres-sion profile of hsa-miR-122. The members of hsa-miR-122 coexpression group also reflect the variation. Unlike more

than two hundred coexpression genes by using 𝑍-score

transformed expression data, merely five coexpressed genes

left based on the cut-off 0.8 PCC value by𝑍-score and log10

transformed data.

Another limitation of TFBS/TF detection depends on present transcription factor binding matrices collected in TRANSFAC. As is well known, sequence motif search of transcription factor binding sites is executed by the Match program according to the matrix information in TRANSFAC library. A specific TFBS in miRNA promoter will not be detected if the corresponding matrix has not been obtained by in vitro selection studies. Here is an example. Unlike hsa-miR-122, miR-224 is upregulated in HCC through epigenetic mechanisms and controls several crucial cellular processes [40]. Wang et al. indicated that miR-224 expression is recip-rocally regulated by HDAC1, HDAC3, and EP300. Our result shows that P300 (encoded by EP300) is the TF candidate of miR-224, and its TFBS can be found in 227 coexpressed gene promoters. Because matrices of histone deacetylases 1 and 3 are not available in TRANSFAC, no such transcription factors were predicted.

In conclusion, rather than traditional PWM search char-acterizing putative cis-elements in promoter regions, the coTFBS strategy was developed to determine more confident TFBSs for human miRNAs. The investigation was restricted by the incomplete expression profiles of present human miRNAs. Once most expression profiles of miRNAs in the

latest database are available, TF candidates of more miRNAs can be explored in the future. Furthermore, although more and more ChIP-seq data were generated and were useful to identify transcription factor binding sites [41–43], the corresponding ChIP-seq data of specific TFs are still the minority. It is expected that large-scale ChIP analysis of general TFs contributes more confident TFBSs by observing the aggregated peaks within promoter regions of human miRNAs.

5. Conclusions

In organisms, not all of ribonucleic acids (RNAs) are trans-lated to proteins. miRNAs are such noncoding RNAs which play critical roles in gene regulation, even if it is generally believed that proteins convey vital information from genes and execute biological functions to maintain life processes. Although target prediction has been the mainstream when studying miRNA functions for a while, researchers start to explore TF-miRNA interactions and study the transcrip-tional regulation of miRNAs, which are necessary to depict how miRNAs participate in diverse biological processes. To determine putative TFs and TFBSs located in human miRNA promoters, we created a computational pipeline which not only allows large-scale investigation as long as the expression profiles of miRNAs are available, but also filters the redundancy when searching short sequence. This valuable information is helpful in understanding miRNA functions and provides knowledge to evaluate the therapeutic potential in clinical research.

(6)

Table 1: Putative transcription factors regulating miR-122 gene. The bold items indicate the experimental-supported TFs. PCC represents the Pearson’s correlation coefficient between each TF-encoded gene and miR-122.

Matrix ID Transcription factor Gene PCC Occurrence

V$ELF1 Q6 Elf-1 ELF1 0.192827 242

V$MAFB 01 MAFB MAFB 0.383476 239

V$TBX5 02 TBX5 TBX5 0.132419 227

V$NR1B2 Q6 NR1B2 RARB 0.117271 226

V$CDX2 Q5 02 CDX-2 CDX2 0.105817 215

V$GATA1 01 GATA-1 GATA1 0.0128229 215

V$SMAD3 Q6 01 Smad3 SMAD3 0.0915205 203

V$HNF4A Q6 01 HNF-4alpha HNF4A 0.316168 191

V$SOX5 01 SOX5 SOX5 0.149287 185

V$SPI1 03 SPI1 SPI1 0.268782 178

V$ERBETA Q5 ER-beta ESR2 0.303769 176

V$GATA1 02 GATA-1 GATA1 0.0128229 173

V$GATA1 05 GATA-1 GATA1 0.0128229 168

V$GATA1 06 GATA-1 GATA1 0.0128229 168

V$CDX2 Q5 01 Cdx-2 CDX2 0.105817 167

V$MAZ Q6 MAZ MAZ 0.414832 166

V$MYOD Q6 01 MyoD MYOD1 0.306424 164

V$PITX3 Q2 PITX3 PITX3 0.127413 160

V$ING4 01 ING4 ING4 0.234372 158

V$HNF3B Q6 HNF-3beta FOXA2 0.44036 154

V$CDX2 01 Cdx-2 CDX2 0.105817 152

V$CRX Q4 Crx CRX 0.243011 149

V$HNF3A 01 HNF3A FOXA1 0.140429 148

V$GATA1 04 GATA-1 GATA1 0.0128229 139

V$ERR1 Q3 ERR1 ESRRA 0.18197 135

V$CEBPE Q6 CEBPE CEBPE 0.0949125 133

V$HNF1 02 HNF-1alpha HNF1A 0.363153 122

V$CRX 02 Crx CRX 0.243011 119

V$NEUROD 02 NeuroD NEUROD1 0.0402891 114

V$MAZ Q6 01 MAZ MAZ 0.414832 110

V$OC2 Q3 OC-2 ONECUT2 0.124899 102

V$IRF7 Q3 IRF-7 IRF7 0.13082 100

V$PIT1 Q6 Pit-1 POU1F1 0.319886 99

V$DBP Q6 01 DBP DBP 0.043001 76

V$CEBPG Q6 01 C/EBPgamma CEBPG 0.027117 73

V$MYOD 01 MyoD MYOD1 0.306424 72

V$CREL 01 c-Rel REL 0.29444 44

V$CEBPG Q6 C/EBPgamma CEBPG 0.027117 43

V$ATF4 Q6 ATF-4 ATF4 0.0251237 40

V$HOXA7 01 HOXA7 HOXA7 0.397453 31

V$E2F1 Q4 E2F-1 E2F1 0.110614 29

(7)

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Authors’ Contribution

Chia-Hung Chien, Shun-Long Weng, Wen-Chi Chang, Ann-Ping Tsou, and Hsien-Da Huang conceived and designed the experiments. Chia-Hung Chien and Yi-Fan Chiang-Hsieh analyzed the data and performed the experiments. Chia-Hung Chien wrote the paper.

Acknowledgments

Thanks are due to Pei-Wen Jiang for the assistance of pro-gramming and system maintenance and to Hsi-Yuan Huang for experience sharing in expression profile analysis. This research was supported by a grant from National Science Council of the Republic of China for financially supporting this research under Contract nos. NSC 102-2313-B-006-004, NSC 101-2311-B-009-003-MY3, NSC 100-2627-B-009-002, NSC 102-2911-I-009-101, and NSC 99-2628-B-006-016-MY3.

References

[1] S. M. Khoshnaw, A. R. Green, D. G. Powe, and I. O. Ellis, “MicroRNA involvement in the pathogenesis and management of breast cancer,” Journal of Clinical Pathology, vol. 62, no. 5, pp. 422–428, 2009.

[2] R. Schickel, B. Boyerinas, S.-M. Park, and M. E. Peter, “MicroR-NAs: key players in the immune system, differentiation, tumori-genesis and cell death,” Oncogene, vol. 27, no. 45, pp. 5959–5974, 2008.

[3] H. K. Saini, S. Griffiths-Jones, and A. J. Enright, “Genomic analysis of human microRNA transcripts,” Proceedings of the

National Academy of Sciences of the United States of America,

vol. 104, no. 45, pp. 17719–17724, 2007.

[4] H. K. Saini, A. J. Enright, and S. Griffiths-Jones, “Annotation of mammalian primary microRNAs,” BMC Genomics, vol. 9, article 564, 2008.

[5] J. Qian, Z. Zhang, J. Liang et al., “The full-length transcripts and promoter analysis of intergenic microRNAs in Drosophila

melanogaster,” Genomics, vol. 97, no. 5, pp. 294–303, 2011.

[6] A. Barski, R. Jothi, S. Cuddapah et al., “Chromatin poises miRNA- and protein-coding genes for expression,” Genome

Research, vol. 19, no. 10, pp. 1742–1751, 2009.

[7] C.-H. Chien, Y.-M. Sun, W.-C. Chang et al., “Identifying transcriptional start sites of human microRNAs based on high-throughput sequencing data,” Nucleic Acids Research, vol. 39, no. 21, pp. 9345–9356, 2011.

[8] M. Bhattacharyya, L. Feuerbach, T. Bhadra, T. Lengauer, and S. Bandyopadhyay, “MicroRNA transcription start site prediction with multi-objective feature selection,” Statistical Applications in

Genetics and Molecular Biology, vol. 11, no. 1, article 6, 2012.

[9] Y. Saito, H. Suzuki, T. Taya et al., “Development of a novel microRNA promoter microarray for ChIP-on-chip assay to identify epigenetically regulated microRNAs,” Biochemical and

Biophysical Research Communications, vol. 426, no. 1, pp. 33–37,

2012.

[10] J. Wang, M. Lu, C. Qiu, and Q. Cui, “TransmiR: a transcription factor microRNA regulation database,” Nucleic Acids Research, vol. 38, supplement 1, pp. D119–D122, 2009.

[11] R. Shalgi, D. Lieber, M. Oren, and Y. Pilpel, “Global and local architecture of the mammalian microRNA-transcription factor regulatory network,” PLoS Computational Biology, vol. 3, no. 7, article e131, 2007.

[12] C.-Y. Chen, S.-T. Chen, C.-S. Fuh, H.-F. Juan, and H.-C. Huang, “Coregulation of transcription factors and microRNAs in human transcriptional regulatory network,” BMC

Bioinfor-matics, vol. 12, supplement 1, article S41, 2011.

[13] A. Re, D. Core, D. Taverna, and M. Caselle, “Genome-wide sur-vey of microRNA-transcription factor feed-forward regulatory circuits in human,” Molecular BioSystems, vol. 5, no. 8, pp. 854– 867, 2009.

[14] O. Friard, A. Re, D. Taverna, M. de Bortoli, and D. Cor´a, “CircuitsDB: a database of mixed microRNA/transcription factor feed-forward regulatory circuits in human and mouse,”

BMC Bioinformatics, vol. 11, article 435, 2010.

[15] A. le Bechec, E. Portales-Casamar, G. Vetter et al., “MIR@NT@N: a framework integrating transcription factors, microRNAs and their targets to identify sub-network motifs in a meta-regulation network model,” BMC Bioinformatics, vol. 12, article 67, 2011.

[16] H. Naeem, R. Kuffner, and R. Zimmer, “MIRTFnet: analysis of miRNA regulated transcription factors,” PLoS ONE, vol. 6, no. 8, article e22519, 2011.

[17] X. Li, W. Jiang, W. Li et al., “Dissection of human miRNA regulatory influence to subpathway,” Briefings in Bioinformatics, vol. 13, no. 2, pp. 175–186, 2012.

[18] M. Beckstette, R. Homann, R. Giegerich, and S. Kurtz, “Fast index based algorithms and software for matching position specific scoring matrices,” BMC Bioinformatics, vol. 7, article 389, 2006.

[19] M. C. Frith, J. L. Spouge, U. Hansen, and Z. Weng, “Statistical significance of clusters of motifs represented by position spe-cific scoring matrices in nucleotide sequences,” Nucleic Acids

Research, vol. 30, no. 14, pp. 3214–3224, 2002.

[20] S. Rahmann, T. Muller, and M. Vingron, “On the power of pro-files for transcription factor binding site detection,” Statistical

Applications in Genetics and Molecular Biology, vol. 2, article 7,

2003.

[21] Y. M. Oh, J. K. Kim, S. Choi, and J.-Y. Yoo, “Identification of co-occurring transcription factor binding sites from DNA sequence using clustered position weight matrices,” Nucleic

Acids Research, vol. 40, no. 5, article e38, 2012.

[22] V. Matys, E. Fricke, R. Geffers et al., “TRANSFAC: tran-scriptional regulation, from patterns to profiles,” Nucleic Acids

Research, vol. 31, no. 1, pp. 374–378, 2003.

[23] E. Portales-Casamar, S. Thongjuea, A. T. Kwon et al., “JASPAR 2010: the greatly expanded open-access database of transcrip-tion factor binding profiles,” Nucleic Acids Research, vol. 38, supplement 1, pp. D105–D110, 2010.

[24] S. Aerts, G. Thijs, B. Coessens, M. Staes, Y. Moreau, and B. de Moor, “Toucan: deciphering the cis-regulatory logic of coregulated genes,” Nucleic Acids Research, vol. 31, no. 6, pp. 1753–1764, 2003.

[25] T. T. Marstrand, J. Frellsen, I. Moltke et al., “Asap: a framework for over-representation statistics for transcription factor bind-ing sites,” PLoS ONE, vol. 3, no. 2, article e1623, 2008.

(8)

[26] F. Zambelli, G. Pesole, and G. Pavesi, “Pscan: finding over-represented transcription factor binding site motifs in sequences from co-regulated or co-expressed genes,” Nucleic

Acids Research, vol. 37, supplement 2, pp. W247–W252, 2009.

[27] S. J. H. Sui, J. R. Mortimer, D. J. Arenillas et al., “oPOSSUM: identification of over-represented transcription factor binding sites in co-expressed genes,” Nucleic Acids Research, vol. 33, no. 10, pp. 3154–3164, 2005.

[28] A. Kozomara and S. Griffiths-Jones, “miRBase: integrating microRNA annotation and deep-sequencing data,” Nucleic

Acids Research, vol. 39, supplement 1, pp. D152–D157, 2011.

[29] T. R. Dreszer, D. Karolchik, A. S. Zweig et al., “The UCSC Genome Browser database: extensions and updates 2011,”

Nucleic Acids Research, vol. 40, supplement 1, pp. D918–D923,

2012.

[30] A. I. Su, T. Wiltshire, S. Batalov et al., “A gene atlas of the mouse and human protein-encoding transcriptomes,” Proceedings of

the National Academy of Sciences of the United States of America,

vol. 101, no. 16, pp. 6062–6067, 2004.

[31] T. Barrett, D. B. Troup, S. E. Wilhite et al., “NCBI GEO: mining tens of millions of expression profiles—database and tools update,” Nucleic Acids Research, vol. 35, supplement 1, pp. D760– D765, 2007.

[32] Y. Liang, D. Ridzon, L. Wong, and C. Chen, “Characterization of microRNA expression profiles in normal human tissues,” BMC

Genomics, vol. 8, article 166, 2007.

[33] C. Cheadle, M. P. Vawter, W. J. Freed, and K. G. Becker, “Analysis of microarray data using Z score transformation,” The Journal of

Molecular Diagnostics, vol. 5, no. 2, pp. 73–81, 2003.

[34] P. Flicek, I. Ahmed, M. R. Amode et al., “Ensembl 2013,” Nucleic

Acids Research, vol. 41, no. 1, pp. D48–D55, 2013.

[35] A. E. Kel, E. Gossling, I. Reuter, E. Cheremushkin, O. V. Kel-Margoulis, and E. Wingender, “MATCH: a tool for searching transcription factor binding sites in DNA sequences,” Nucleic

Acids Research, vol. 31, no. 13, pp. 3576–3579, 2003.

[36] W.-C. Tsai, S.-D. Hsu, C.-S. Hsu et al., “MicroRNA-122 plays a critical role in liver homeostasis and hepatocarcinogenesis,” The

Journal of Clinical Investigation, vol. 122, no. 8, pp. 2884–2897,

2012.

[37] Z. Y. Li, Y. Xi, W. N. Zhu et al., “Positive regulation of hepatic miR-122 expression by HNF4𝛼,” Journal of Hepatology, vol. 55, no. 3, pp. 602–611, 2011.

[38] C. Coulouarn, V. M. Factor, J. B. Andersen, M. E. Durkin, and S. S. Thorgeirsson, “Loss of miR-122 expression in liver cancer correlates with suppression of the hepatic phenotype and gain of metastatic properties,” Oncogene, vol. 28, no. 40, pp. 3526–3536, 2009.

[39] H. de The, A. Marchio, P. Tiollais, and A. Dejean, “A novel steroid thyroid hormone receptor-related gene inappropriately expressed in human hepatocellular carcinoma,” Nature, vol. 330, no. 6149, pp. 667–670, 1987.

[40] Y. Wang, H. C. Toh, P. Chow et al., “MicroRNA-224 is up-regulated in hepatocellular carcinoma through epigenetic mechanisms,” The FASEB Journal, vol. 26, no. 7, pp. 3032–3041, 2012.

[41] J. Qin, M. J. Li, P. Wang, M. Q. Zhang, and J. Wang, “ChIP-Array: combinatory analysis of ChIP-seq/chip and microarray gene expression data to discover direct/indirect targets of a transcription factor,” Nucleic Acids Research, vol. 39, supplement 2, pp. W430–W436, 2011.

[42] M. Hu, J. Yu, J. M. Taylor, A. M. Chinnaiyan, and Z. S. Qin, “On the detection and refinement of transcription factor binding sites using ChIP-Seq data,” Nucleic Acids Research, vol. 38, no. 7, pp. 2154–2167, 2010.

[43] T. Handstad, M. B. Rye, F. Drablos, and P. Sætrom, “A ChIP-Seq benchmark shows that sequence conservation mainly improves detection of strong transcription factor binding sites,” PLoS

(9)

Submit your manuscripts at

http://www.hindawi.com

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

Anatomy

Research International

Peptides

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

Hindawi Publishing Corporation http://www.hindawi.com

International Journal of

Volume 2014

Zoology

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014 Molecular Biology International

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

The Scientific

World Journal

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

Bioinformatics

Advances in

Marine Biology

Journal of

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014 Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

Signal Transduction

Journal of

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

BioMed

Research International

Evolutionary Biology

International Journal of

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

Biochemistry Research International

Archaea

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014 Genetics

Research International

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

Advances in

Virology

Hindawi Publishing Corporation http://www.hindawi.com

Nucleic Acids

Journal of

Volume 2014

Stem Cells

International

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

Enzyme

Research

Hindawi Publishing Corporation

http://www.hindawi.com Volume 2014

International Journal of

數據

Figure 1: The summary of detecting TF-miR regulatory relations in human genome.
Figure 2: The concept of coTFBS. Protein-coding genes 1, 2, 3, and 4 are coexpressed with the miRNA gene
Figure 3: 261 human protein-coding genes are coexpressed with hsa-miR-122. The pink curve in 3(b) represents the average value of miR-122 coexpressed genes among 17 human normal tissues
Table 1: Putative transcription factors regulating miR-122 gene. The bold items indicate the experimental-supported TFs

參考文獻

相關文件

Microphone and 600 ohm line conduits shall be mechanically and electrically connected to receptacle boxes and electrically grounded to the audio system ground point.. Lines in

• Learn the mapping between input data and the corresponding points the low dimensional manifold using mixture of factor analyzers. • Learn a dynamical model based on the points on

It’s based on the PZB service quality gap model and SERVQUAL scale, and together with the reference to other relevant research and service, taking into account

SERVQUAL Scale and relevant scales to bus service quality, and based on service content and customer service related secondary data of H highway bus service company, to design the

Monopolies in synchronous distributed systems (Peleg 1998; Peleg

Based on the reformulation, a semi-smooth Levenberg–Marquardt method was developed, and the superlinear (quadratic) rate of convergence was established under the strict

Corollary 13.3. For, if C is simple and lies in D, the function f is analytic at each point interior to and on C; so we apply the Cauchy-Goursat theorem directly. On the other hand,

Corollary 13.3. For, if C is simple and lies in D, the function f is analytic at each point interior to and on C; so we apply the Cauchy-Goursat theorem directly. On the other hand,