Quantitative structure-property relationship (QSPR) prediction of liquid viscosities of pure organic compounds employing random forest regression

TitleQuantitative structure-property relationship (QSPR) prediction of liquid viscosities of pure organic compounds employing random forest regression
Publication TypeJournal Article
Year of Publication2009
AuthorsRajappan, R, Shingade, PD, Natarajan, R, Jayaraman, VK
JournalIndustrial & Engineering Chemistry Research
Volume48
Issue21
Pagination9708-9712
Date PublishedNOV
ISSN0888-5885
Abstract

A quantitative structure-property relationship (QSPR) approach was used to develop a predictive model for viscosities of pure organic liquids using a set of 403 compounds that belong to diverse classes of organic chemicals. A pool of 116 descriptors that encode topostructural, topochemical, electrotopological, geometrical, and quantum chemical properties of the organic compounds was used to develop QSPR models, based on the robust Random Forest (RF) regression algorithm. The performance of the algorithm, in terms of correlation coefficients and mean square errors, was determined to be good. The capability of the algorithm to build models and select the most-informative features simultaneously is very useful for several quantitative structure-activity/property relationship tasks. The eight most-dominant features selected by the RF regression algorithm primarily contained predictors that encode characteristics of atoms and groups that form hydrogen bonds, as well as factors involving molecular shape and size.

DOI10.1021/ie8018406
Type of Journal (Indian or Foreign)Foreign
Impact Factor (IF)2.071
Divison category: 
Chemical Engineering & Process Development