北京大学学报(医学版) ›› 2017, Vol. 49 ›› Issue (3): 551-556. doi: 10.3969/j.issn.1671-167X.2017.03.030

• 论著 • 上一篇    下一篇

构建用于预测中药化学成分心脏毒性的定量构效关系模型

李雅秋1,王旗1,2,3△   

  1. (1. 北京大学公共卫生学院毒理学系, 北京100191; 2. 国家中医药管理局中药配伍减毒重点研究室, 北京100191; 3. 食品安全毒理学研究与评价北京市重点实验室,北京100191)
  • 出版日期:2017-06-18 发布日期:2017-06-18
  • 通讯作者: 王旗 E-mail:wangqi@bjmu.edu.cn
  • 基金资助:
    中医药行业科研专项基金(201507004-1)资助

Quantitative structure-activity relationship model for prediction of cardiotoxicity of chemical components in traditional Chinese medicines

LI Ya-qiu1, WANG Qi1,2,3△   

  1. (1. Department of Toxicology, Peking University School of Public Health, Beijing 100191, China; 2. Key Laboratory of State Administration of TCM for Compatibility Toxicology, Beijing 100191, China; 3. Beijing Key Laboratory of Toxicological Research and Risk Assessment for Food Safety, Beijing 100191, China)
  • Online:2017-06-18 Published:2017-06-18
  • Contact: WANG Qi E-mail:wangqi@bjmu.edu.cn
  • Supported by:
    Supported by the National Special Project for Traditional Chinese Medicine Industry (2015070041)

摘要: 目的:当前用于预测心脏毒性的定量构效关系(quantitative structureactivity relationship, QSAR)模型仅限于hERG通道抑制作用这一机制,应用范围较狭窄。本研究旨在构建包含各类心脏不良反应的QSAR模型,以应用于中药化学成分潜在心脏毒性的预测。方法: 从Toxicity Reference Database(ToxRefDB)和Side Effect Resource(SIDER)数据库中共收集1 109个具有心脏毒性的化合物和789个不具有心脏毒性的化合物作为构建QSAR模型的训练集,应用ADMET Predictor软件计算、筛选分子描述符,通过两种算法(支持向量机和人工神经网络)依次纳入不同数量分子描述符分别构建QSAR模型,通过10折交叉验证方法进行内部验证选择最优模型,然后通过查阅文献及数据库共收集19种具有心脏毒性和10种不具有心脏毒性的中药化学成分作为外部验证集,评价所建QSAR模型对于中药化学成分心脏毒性预测的适用性。结果: 经筛选后共有220种分子描述符参与建模,用支持向量机算法所建的最优模型为包含87种分子描述符的模型,其内部验证结果显示模型灵敏度为71%,特异度为70%,约登指数(Youden’s index)和马修斯相关系数(Matthews correlation coefficient)均为0.41。用人工神经网络算法所建的最优模型为包含13个神经元及87种分子描述符的模型,其内部验证结果显示模型灵敏度为78%,特异度为77%,约登指数和马修斯相关系数均为0.54。通过29种中药化学成分验证显示,支持向量机模型外部验证结果灵敏度为95%,特异度为40%,整体预测的准确率达到76%;人工神经网络模型外部验证结果灵敏度为95%,特异度为60%,整体预测的准确率达到83%。结论:应用人工神经网络算法构建的模型预测能力要优于支持向量机算法构建的模型,通过已知毒性的中药化学成分验证表明,此QSAR模型有良好的灵敏度和预测准确率,可以用于中药化学成分心脏毒性的预测。

关键词:  医学, 中国传统, 心脏毒素类, 量化构效关系

Abstract: Objective: Some quantitative structure-activity relationship (QSAR) models have been developed to predict cardiac toxicity of drugs, which have limited predictive power due to based on hERG channel inhibition. The objective of this study was try to develop a QSAR model based on all kinds of cardiac adverse effects, and to predict the potential cardiotoxicity of chemical components in traditional Chinese medicines (TCM). Methods: In this study, the compounds data of all kinds of cardiac adverse reactions were selected as the training set. The QSAR models were constructed based on 1 109 compounds with cardiotoxicity and 789 compounds without cardiotoxicity, which were available from the Toxicity Reference Database (ToxRefDB) and Side Effect Resource (SIDER) database. The ADMET Predictor software was applied to calculate and to screen the molecular descriptors, and to construct the QSAR models using support vector machine (SVM) and artificial neural networks (ANN) algorithm, respectively. The models were optimized using compound-based 10-fold cross validation. Then, the predictive performance for the potential cardiotoxicity of chemical components in TCM were assessed using external validation by 19 components in TCM with cardiotoxicity and 10 components in TCM without cardiotoxicity. Results: A total of 220 molecular descriptors were selected for modeling, and the best model using SVM algorithm contained 87 molecular descriptors. The internal validation results showed that the predictive sensitivity, specificity, the Youden’s index (YI) and the Matthews correlation coefficient (MCC) were 71%, 70%, 0.41, and 0.41, respectively. The best model constructed using ANN algorithm contained 13 neurons and 87 molecular descriptors. The internal validation results showed that the predictive sensitivity, specificity, the YI and the MCC were 78%, 77%, 0.54, and 0.54, respectively. Both models were validated using external validation by the same set of 29 chemical components in TCM with or without cardiotoxicity, which were not included in the training set. The predictive performances of SVM or ANN model were as follows, respectively: sensitivity 95%, 95%; specificity 40%, 60%; and accuracy 76%, 83%. Conclusion: The predictive performance of the QSAR model using ANN algorithm was better than that of the model using SVM algorithm. The external validation study of 29 chemical components in TCM illustrated that the QSAR model was applicable for screening and predicting the potential cardiotoxicity of chemical components in TCM.

Key words: Medicine, Chinese traditional, Cardiotoxins, Quantitative structure-activity relationship

中图分类号: 

  • R284.1
No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] 张三. 中文标题测试[J]. 北京大学学报(医学版), 2010, 42(1): 1 -10 .
[2] 赵磊, 王天龙 . 右心室舒张末期容量监测用于肝移植术中容量管理的临床研究[J]. 北京大学学报(医学版), 2009, 41(2): 188 -191 .
[3] 万有, , 韩济生, John E. Pintar. 孤啡肽基因敲除小鼠电针镇痛作用增强[J]. 北京大学学报(医学版), 2009, 41(3): 376 -379 .
[4] 张燕, 韩志慧, 钟延丰, 王盛兰, 李玲玲, 郑丹枫. 骨骼肌活组织检查病理诊断技术的改进及应用[J]. 北京大学学报(医学版), 2009, 41(4): 459 -462 .
[5] 赵奇, 薛世华, 刘志勇, 吴凌云. 同向施压测定自酸蚀与全酸蚀粘接系统粘接强度[J]. 北京大学学报(医学版), 2010, 42(1): 82 -84 .
[6] 林红, 王玉凤, 吴野平. 学校生活技能教育对小学三年级学生行为问题影响的对照研究[J]. 北京大学学报(医学版), 2007, 39(3): 319 -322 .
[7] 丰雷, 程嘉, 王玉凤. 注意缺陷多动障碍儿童的运动协调功能[J]. 北京大学学报(医学版), 2007, 39(3): 333 -336 .
[8] 李岳玲, 钱秋瑾, 王玉凤. 儿童注意缺陷多动障碍成人期预后及其预测因素[J]. 北京大学学报(医学版), 2007, 39(3): 337 -340 .
[9] . 书讯[J]. 北京大学学报(医学版), 2007, 39(3): 225 -328 .
[10] 牟向东, 王广发, 刁小莉, 阙呈立. 肺黏膜相关淋巴组织型边缘区B细胞淋巴瘤一例[J]. 北京大学学报(医学版), 2007, 39(4): 346 -350 .