機械学習による化合物の逆合成解析可能性予測手法の開発

小澤, 真実; 安尾, 信明; 関嶋, 政和

WEKO3

lat lon distance

[[sub_check.contents]]

[[sub_radio.contents]]

Field does not validate

[[sub_attr.contents]]　

インデックスツリー

アイテム

機械学習による化合物の逆合成解析可能性予測手法の開発

https://ipsj.ixsq.nii.ac.jp/records/216928

名前 / ファイル	ライセンス	アクション
IPSJ-BIO22069018.pdf (3.0 MB)	Copyright (c) 2022 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2022-03-03

タイトル

機械学習による化合物の逆合成解析可能性予測手法の開発

タイトル

言語

タイトル

Development of a method for predicting the accessibility of retrosynthesis of compounds by machine learning

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

東京工業大学情報理工学院

著者所属

東京工業大学物質・情報卓越教育院

著者所属

東京工業大学情報理工学院／東京工業大学物質・情報卓越教育院

著者所属(英)

School of Computing, Tokyo Institute of Technology

著者所属(英)

Tokyo Tech Academy for Convergence of Materials and Informatics, Tokyo Institute of Technology

著者所属(英)

School of Computing, Tokyo Institute of Technology / Tokyo Tech Academy for Convergence of Materials and Informatics, Tokyo Institute of Technology

著者名

小澤, 真実
安尾, 信明
関嶋, 政和

論文抄録

内容記述タイプ

Other

内容記述

創薬の分野における研究・開発コスト削減手段の一つとして，計算機科学を利用する手法が注目されている．化合物探索におけるその一例として，分子生成モデルによる膨大な量の化合物の提案，そしてその膨大な量の化合物をウエットな実験に進む前に合成可能かどうかでフィルタリングする，というものが考えられる．フィルタリングには逆合成解析によるものとスコアリングによるものがあり，既存のスコアリング手法の一つとしてRAscoreが挙げられる．これは，逆合成解析可能性を機械学習によって予測するスコアである．本研究では，RAscoreモデルの実用性について議論し，パラメータチューニングによってより正確な予測が行えるモデルの開発を提案する．元々のモデルの学習に用いられたChEMBLのデータのほかに，分子生成モデルを用いて作った合成困難なデータを用いて重みパラメータを学習したモデルを2種類作成し，既存のRAscoreモデルを用いたスコアをベースラインとして比較することでこの手法を評価した．その結果，新たに作成したモデルの一つはベースラインモデルベースラインと比較して負例についての判別性能の向上が見られ，AUC，Binary Accuracy, lossについて同等以上の精度をもつモデルとなった．

論文抄録(英)

内容記述タイプ

Other

内容記述

In the field of drug discovery, the use of computer science is gaining increasing attention as a means of reducing R&D costs. One example of this in compound discovery is the method where a molecular generation model proposes a large number of compounds and filters the large number of compounds based on their synthetic potential before proceeding to wet experiments. Filtering can be done either by inverse compositional analysis or by scoring. One of the existing scoring methods is RAscore, a score that predicts the likelihood of inverse compositional analysis by machine learning. In this study, we discuss the practicality of the RAscore model and propose to develop a model that can make more accurate predictions by tuning its parameters. In addition to the ChEMBL data used to train the original model, two models with weight parameters trained on difficult-to-synthesise data generated using molecular generation models were created, and the scores were compared using the existing RAscore model as a baseline to evaluate the method. The results showed that one of the new models had better discrimination performance for negative cases than the baseline model, and had similar or better accuracy for AUC, Binary Accuracy and loss.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AA12055912

書誌情報

研究報告バイオ情報学（BIO）

巻 2022-BIO-69, 号 18, p. 1-6, 発行日 2022-03-03

ISSN

収録物識別子タイプ

ISSN

収録物識別子

2188-8590

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-19 15:41:02.549713

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

機械学習による化合物の逆合成解析可能性予測手法の開発

× 小澤, 真実

× 安尾, 信明

× 関嶋, 政和

Versions

Share

Cite as

エクスポート