ログイン 新規登録
言語:

WEKO3

  • トップ
  • ランキング
To
lat lon distance
To

Field does not validate



インデックスリンク

インデックスツリー

メールアドレスを入力してください。

WEKO

One fine body…

WEKO

One fine body…

アイテム

  1. 研究報告
  2. バイオ情報学(BIO)
  3. 2021
  4. 2021-BIO-68

Predicting PRDM9 binding sites by a convolutional neural network and verification using genetic recombination map

https://ipsj.ixsq.nii.ac.jp/records/214037
https://ipsj.ixsq.nii.ac.jp/records/214037
bc9f0637-4682-423e-9ae7-b08007985240
名前 / ファイル ライセンス アクション
IPSJ-BIO21068001.pdf IPSJ-BIO21068001.pdf (627.5 kB)
Copyright (c) 2021 by the Information Processing Society of Japan
オープンアクセス
Item type SIG Technical Reports(1)
公開日 2021-11-23
タイトル
タイトル Predicting PRDM9 binding sites by a convolutional neural network and verification using genetic recombination map
タイトル
言語 en
タイトル Predicting PRDM9 binding sites by a convolutional neural network and verification using genetic recombination map
言語
言語 eng
資源タイプ
資源タイプ識別子 http://purl.org/coar/resource_type/c_18gh
資源タイプ technical report
著者所属
Graduate School of Information Science and Technology, Hokkaido University
著者所属
Graduate School of Information Science and Technology, Hokkaido University/Faculty of Information Science and Technology, Hokkaido University
著者所属
Graduate School of Information Science and Technology, Hokkaido University/Faculty of Information Science and Technology, Hokkaido University
著者所属(英)
en
Graduate School of Information Science and Technology, Hokkaido University
著者所属(英)
en
Graduate School of Information Science and Technology, Hokkaido University / Faculty of Information Science and Technology, Hokkaido University
著者所属(英)
en
Graduate School of Information Science and Technology, Hokkaido University / Faculty of Information Science and Technology, Hokkaido University
著者名 Takahiro, Nakamura

× Takahiro, Nakamura

Takahiro, Nakamura

Search repository
Toshinori, Endo

× Toshinori, Endo

Toshinori, Endo

Search repository
Naoki, Osada

× Naoki, Osada

Naoki, Osada

Search repository
著者名(英) Takahiro, Nakamura

× Takahiro, Nakamura

en Takahiro, Nakamura

Search repository
Toshinori, Endo

× Toshinori, Endo

en Toshinori, Endo

Search repository
Naoki, Osada

× Naoki, Osada

en Naoki, Osada

Search repository
論文抄録
内容記述タイプ Other
内容記述 PR domain-containing 9 (PRDM9) is a zinc-finger protein that binds to specific DNA motifs and induces the crossing-over between chromosomes, resulting in a high recombination rate around binding sites. In this study, we developed a strategy to evaluate the prediction accuracy of PRDM9 binding site by examining the correlation with local recombination rate to avoid the effect of overfitting to one type of data. We compared the methods using position-specific weight matrix (PWM), which has been commonly used in previous studies, and convolutional network (CNN), which has recently performed well. Approximately 170,000 genomic DNA fragments of humans (301 bp each) containing the Chromatin Immuno-Precipitation with high-throughput sequencing (ChIP-seq) peak of PRDM9 of B-allele in the HEK293T cell line were used for constructing PWM and positive data to train CNN. We found that CNN outperformed PWM in terms of area under the curve, and the prediction scores of CNN correlated more strongly with the local recombination rate than PWM. We also investigated the potential PRDM9 binding sites missed by the ChIP-seq experiments but labeled as positive in CNN and discuss the reason for the difference in performances.
論文抄録(英)
内容記述タイプ Other
内容記述 PR domain-containing 9 (PRDM9) is a zinc-finger protein that binds to specific DNA motifs and induces the crossing-over between chromosomes, resulting in a high recombination rate around binding sites. In this study, we developed a strategy to evaluate the prediction accuracy of PRDM9 binding site by examining the correlation with local recombination rate to avoid the effect of overfitting to one type of data. We compared the methods using position-specific weight matrix (PWM), which has been commonly used in previous studies, and convolutional network (CNN), which has recently performed well. Approximately 170,000 genomic DNA fragments of humans (301 bp each) containing the Chromatin Immuno-Precipitation with high-throughput sequencing (ChIP-seq) peak of PRDM9 of B-allele in the HEK293T cell line were used for constructing PWM and positive data to train CNN. We found that CNN outperformed PWM in terms of area under the curve, and the prediction scores of CNN correlated more strongly with the local recombination rate than PWM. We also investigated the potential PRDM9 binding sites missed by the ChIP-seq experiments but labeled as positive in CNN and discuss the reason for the difference in performances.
書誌レコードID
収録物識別子タイプ NCID
収録物識別子 AA12055912
書誌情報 研究報告バイオ情報学(BIO)

巻 2021-BIO-68, 号 1, p. 1-7, 発行日 2021-11-23
ISSN
収録物識別子タイプ ISSN
収録物識別子 2188-8590
Notice
SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.
出版者
言語 ja
出版者 情報処理学会
戻る
0
views
See details
Views

Versions

Ver.1 2025-01-19 16:55:39.472499
Show All versions

Share

Mendeley Twitter Facebook Print Addthis

Cite as

エクスポート

OAI-PMH
  • OAI-PMH JPCOAR
  • OAI-PMH DublinCore
  • OAI-PMH DDI
Other Formats
  • JSON
  • BIBTEX

Confirm


Powered by WEKO3


Powered by WEKO3