WEKO3
アイテム
A Method for Isoform Prediction from RNA-Seq Data by Iterative Mapping
https://ipsj.ixsq.nii.ac.jp/records/82651
https://ipsj.ixsq.nii.ac.jp/records/8265131600d5b-559e-4e45-a245-5814c288ce6d
名前 / ファイル | ライセンス | アクション |
---|---|---|
![]() |
Copyright (c) 2012 by the Information Processing Society of Japan
|
|
オープンアクセス |
Item type | SIG Technical Reports(1) | |||||||
---|---|---|---|---|---|---|---|---|
公開日 | 2012-06-21 | |||||||
タイトル | ||||||||
タイトル | A Method for Isoform Prediction from RNA-Seq Data by Iterative Mapping | |||||||
タイトル | ||||||||
言語 | en | |||||||
タイトル | A Method for Isoform Prediction from RNA-Seq Data by Iterative Mapping | |||||||
言語 | ||||||||
言語 | eng | |||||||
資源タイプ | ||||||||
資源タイプ識別子 | http://purl.org/coar/resource_type/c_18gh | |||||||
資源タイプ | technical report | |||||||
著者所属 | ||||||||
Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University | ||||||||
著者所属 | ||||||||
Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University | ||||||||
著者所属 | ||||||||
Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University | ||||||||
著者所属 | ||||||||
Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University | ||||||||
著者所属(英) | ||||||||
en | ||||||||
Department of Bioinformatic Engineering, Graduate School of Information Science and Technology, Osaka University | ||||||||
著者名 |
Tomoshige, Ohno
× Tomoshige, Ohno
|
|||||||
著者名(英) |
Tomoshige, Ohno
× Tomoshige, Ohno
|
|||||||
論文抄録 | ||||||||
内容記述タイプ | Other | |||||||
内容記述 | Alternative splicing plays an important role in eukaryotic gene expression by producing diverse proteins from a single gene. Predicting how genes are transcribed is of great biological interest. To this end, massively parallel whole transcriptome sequencing, often referred to as RNA-Seq, is becoming widely used and is revolutionizing the cataloging isoforms using a vast number of short mRNA fragments called reads. Conventional RNA-Seq analysis methods typically align reads onto a reference genome (mapping) in order to capture the form of isoforms that each gene yields and how much of every isoform is expressed from an RNA-Seq dataset. However, a considerable number of reads cannot be mapped uniquely. Those so-called multireads that are mapped onto multiple locations due to short read length and analogous sequences inflate the uncertainty as to how genes are transcribed. This causes inaccurate gene expression estimations and leads to incorrect isoform prediction. To cope with this problem, we propose a method for isoform prediction by iterative mapping. The positions from which multireads originate can be estimated based on the information of expression levels, whereas quantification of isoform-level expression requires accurate mapping. These procedures are mutually dependent, and therefore remapping reads is essential. By iterating this cycle, our method estimates gene expression levels more precisely and hence improves predictions of alternative splicing. Our method simultaneously estimates isoform-level expressions by computing how many reads originate from each candidate isoform using an EM algorithm within a gene. To validate the effectiveness of the proposed method, we compared its performance with conventional methods using an RNA-Seq dataset derived from a human brain. The proposed method had a precision of 66.7% and outperformed conventional methods in terms of the isoform detection rate. | |||||||
論文抄録(英) | ||||||||
内容記述タイプ | Other | |||||||
内容記述 | Alternative splicing plays an important role in eukaryotic gene expression by producing diverse proteins from a single gene. Predicting how genes are transcribed is of great biological interest. To this end, massively parallel whole transcriptome sequencing, often referred to as RNA-Seq, is becoming widely used and is revolutionizing the cataloging isoforms using a vast number of short mRNA fragments called reads. Conventional RNA-Seq analysis methods typically align reads onto a reference genome (mapping) in order to capture the form of isoforms that each gene yields and how much of every isoform is expressed from an RNA-Seq dataset. However, a considerable number of reads cannot be mapped uniquely. Those so-called multireads that are mapped onto multiple locations due to short read length and analogous sequences inflate the uncertainty as to how genes are transcribed. This causes inaccurate gene expression estimations and leads to incorrect isoform prediction. To cope with this problem, we propose a method for isoform prediction by iterative mapping. The positions from which multireads originate can be estimated based on the information of expression levels, whereas quantification of isoform-level expression requires accurate mapping. These procedures are mutually dependent, and therefore remapping reads is essential. By iterating this cycle, our method estimates gene expression levels more precisely and hence improves predictions of alternative splicing. Our method simultaneously estimates isoform-level expressions by computing how many reads originate from each candidate isoform using an EM algorithm within a gene. To validate the effectiveness of the proposed method, we compared its performance with conventional methods using an RNA-Seq dataset derived from a human brain. The proposed method had a precision of 66.7% and outperformed conventional methods in terms of the isoform detection rate. | |||||||
書誌レコードID | ||||||||
収録物識別子タイプ | NCID | |||||||
収録物識別子 | AA12055912 | |||||||
書誌情報 |
研究報告バイオ情報学(BIO) 巻 2012-BIO-29, 号 13, p. 1-7, 発行日 2012-06-21 |
|||||||
Notice | ||||||||
SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc. | ||||||||
出版者 | ||||||||
言語 | ja | |||||||
出版者 | 情報処理学会 |