[口头报告]Predicting functional effect of missense variants using graph neural networks and deep sequence language modelling

Predicting functional effect of missense variants using graph neural networks and deep sequence language modelling
编号:126 访问权限:仅限参会人 更新:2022-07-07 11:14:39 浏览:573次 口头报告

报告开始:2022年07月23日 17:40 (Asia/Shanghai)

报告时间:15min

所在会议:[S2] 分会场2 » [S2-1] 基因组学与表观基因组学

暂无文件

摘要
Accurate prediction of damaging missense variants is critically important for interpreting genome sequence. While many methods have been developed, their performance has been limited. Recent progress in machine learning and availability of large-scale population genomic sequencing data provide new opportunities to significantly improve computational predictions. I this talk, I will present our two methods gMVP and tMVP for predicting the damaging missense variants. gMVP is a new method based on graph attention neural networks. Its main component is a graph with nodes capturing predictive features of amino acids and edges weighted by coevolution strength, which enables effective pooling of information from local protein context and functionally correlated distal positions. Evaluated by deep mutational scan data, gMVP outperforms published methods in identifying damaging variants in TP53, PTEN, BRCA1, and MSH2. Additionally, it achieves the best separation of de novo missense variants in neurodevelopmental disorder cases from the ones in controls. Finally, the model supports transfer learning to optimize gain- and loss-of-function predictions in sodium and calcium channels. In summary, we demonstrate that gMVP can improve interpretation of missense variants in clinical testing and genetic studies. I will also present our latest method tMVP for predicting damaging missense variants which is based on a deep generative language model trained on 250 million protein sequences. Additionally, I will briefly introduce AbFold, our recently developed software for antibody structure prediction
 
关键字
精准医疗;人工智能
报告人
张海仓
中科院计算技术研究所

张海仓,中国科学院计算技术研究所,副研究员。博士毕业于中科院计算技术研究所。曾任字节跳动高级算法工程师、哥伦比亚大学博士后研究科学家。研究方向包括生物信息学(AI 蛋白质设计,蛋白质结构预测,计算基因 组学)和深度学习算法等。以一作或通讯作者身份在 Nat. Commun., Bioinformatics 等高水平国际学术期刊发表 SCI 论文 20 余篇。
 

发表评论
验证码 看不清楚,更换一张
全部评论