Prediction of the vapor pressure of some halogenated methyl-phenyl ether (anisole) compounds using linear and nonlinear QSPR methods

Nasser Goudarzi, Mohammad Goodarzi

Research output: Contribution to journalArticle

9 Citations (Scopus)

Abstract

In this work, several chemometric methods were applied for the modeling and prediction of the vapor pressure (-Log PL°) of halogenated methyl-phenyl ether (anisole) compounds. A genetic algorithm method designed for the selection of variables in the multiple linear regression (MLR) model and also the PCA-ranking technique were chosen as feature selection methods for building a least square support vector machine (LS-SVM) model to predict-Log PL°. The multiple linear regression method was used to build a linear relationship between molecular descriptors and the-Log PL° of these compounds. The LS-SVM was then utilized to construct the nonlinear quantitative structure-activity relationship models. The results obtained using LS-SVM were compared with MLR; this revealed that the LS-SVM model was much better than the GA-MLR model. The root-mean-square errors of the training set and the test set for the PC-ranking-LS-SVM model were 0. 2912 and 0.2427, and the correlation coefficients were 0.9259 and 0.9112, respectively. This paper provides a new and effective method for predicting-Log PL° of organic compounds, and also reveals that PC-ranking-LS-SVM can be used as a powerful chemometrics tool for quantitative structure-property relationship (QSPR) studies.

Original languageEnglish (US)
Pages (from-to)1615-1620
Number of pages6
JournalMolecular Physics
Volume107
Issue number15
DOIs
StatePublished - Oct 1 2009

Fingerprint

Vapor Pressure
anisole
Quantitative Structure-Activity Relationship
Least-Squares Analysis
Vapor pressure
vapor pressure
Support vector machines
Linear Models
ethers
Linear regression
ranking
regression analysis
predictions
Passive Cutaneous Anaphylaxis
root-mean-square errors
organic compounds
Organic compounds
genetic algorithms
correlation coefficients
Mean square error

Keywords

  • Ab initio
  • Computational chemistry
  • Electronic structure
  • Quantum chemistry

ASJC Scopus subject areas

  • Biophysics
  • Molecular Biology
  • Condensed Matter Physics
  • Physical and Theoretical Chemistry

Cite this

Prediction of the vapor pressure of some halogenated methyl-phenyl ether (anisole) compounds using linear and nonlinear QSPR methods. / Goudarzi, Nasser; Goodarzi, Mohammad.

In: Molecular Physics, Vol. 107, No. 15, 01.10.2009, p. 1615-1620.

Research output: Contribution to journalArticle

@article{1e9808dba0344f9eae35baf78901d933,
title = "Prediction of the vapor pressure of some halogenated methyl-phenyl ether (anisole) compounds using linear and nonlinear QSPR methods",
abstract = "In this work, several chemometric methods were applied for the modeling and prediction of the vapor pressure (-Log PL°) of halogenated methyl-phenyl ether (anisole) compounds. A genetic algorithm method designed for the selection of variables in the multiple linear regression (MLR) model and also the PCA-ranking technique were chosen as feature selection methods for building a least square support vector machine (LS-SVM) model to predict-Log PL°. The multiple linear regression method was used to build a linear relationship between molecular descriptors and the-Log PL° of these compounds. The LS-SVM was then utilized to construct the nonlinear quantitative structure-activity relationship models. The results obtained using LS-SVM were compared with MLR; this revealed that the LS-SVM model was much better than the GA-MLR model. The root-mean-square errors of the training set and the test set for the PC-ranking-LS-SVM model were 0. 2912 and 0.2427, and the correlation coefficients were 0.9259 and 0.9112, respectively. This paper provides a new and effective method for predicting-Log PL° of organic compounds, and also reveals that PC-ranking-LS-SVM can be used as a powerful chemometrics tool for quantitative structure-property relationship (QSPR) studies.",
keywords = "Ab initio, Computational chemistry, Electronic structure, Quantum chemistry",
author = "Nasser Goudarzi and Mohammad Goodarzi",
year = "2009",
month = "10",
day = "1",
doi = "10.1080/00268970902994293",
language = "English (US)",
volume = "107",
pages = "1615--1620",
journal = "Molecular Physics",
issn = "0026-8976",
publisher = "Taylor and Francis Ltd.",
number = "15",

}

TY - JOUR

T1 - Prediction of the vapor pressure of some halogenated methyl-phenyl ether (anisole) compounds using linear and nonlinear QSPR methods

AU - Goudarzi, Nasser

AU - Goodarzi, Mohammad

PY - 2009/10/1

Y1 - 2009/10/1

N2 - In this work, several chemometric methods were applied for the modeling and prediction of the vapor pressure (-Log PL°) of halogenated methyl-phenyl ether (anisole) compounds. A genetic algorithm method designed for the selection of variables in the multiple linear regression (MLR) model and also the PCA-ranking technique were chosen as feature selection methods for building a least square support vector machine (LS-SVM) model to predict-Log PL°. The multiple linear regression method was used to build a linear relationship between molecular descriptors and the-Log PL° of these compounds. The LS-SVM was then utilized to construct the nonlinear quantitative structure-activity relationship models. The results obtained using LS-SVM were compared with MLR; this revealed that the LS-SVM model was much better than the GA-MLR model. The root-mean-square errors of the training set and the test set for the PC-ranking-LS-SVM model were 0. 2912 and 0.2427, and the correlation coefficients were 0.9259 and 0.9112, respectively. This paper provides a new and effective method for predicting-Log PL° of organic compounds, and also reveals that PC-ranking-LS-SVM can be used as a powerful chemometrics tool for quantitative structure-property relationship (QSPR) studies.

AB - In this work, several chemometric methods were applied for the modeling and prediction of the vapor pressure (-Log PL°) of halogenated methyl-phenyl ether (anisole) compounds. A genetic algorithm method designed for the selection of variables in the multiple linear regression (MLR) model and also the PCA-ranking technique were chosen as feature selection methods for building a least square support vector machine (LS-SVM) model to predict-Log PL°. The multiple linear regression method was used to build a linear relationship between molecular descriptors and the-Log PL° of these compounds. The LS-SVM was then utilized to construct the nonlinear quantitative structure-activity relationship models. The results obtained using LS-SVM were compared with MLR; this revealed that the LS-SVM model was much better than the GA-MLR model. The root-mean-square errors of the training set and the test set for the PC-ranking-LS-SVM model were 0. 2912 and 0.2427, and the correlation coefficients were 0.9259 and 0.9112, respectively. This paper provides a new and effective method for predicting-Log PL° of organic compounds, and also reveals that PC-ranking-LS-SVM can be used as a powerful chemometrics tool for quantitative structure-property relationship (QSPR) studies.

KW - Ab initio

KW - Computational chemistry

KW - Electronic structure

KW - Quantum chemistry

UR - http://www.scopus.com/inward/record.url?scp=68849113311&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=68849113311&partnerID=8YFLogxK

U2 - 10.1080/00268970902994293

DO - 10.1080/00268970902994293

M3 - Article

VL - 107

SP - 1615

EP - 1620

JO - Molecular Physics

JF - Molecular Physics

SN - 0026-8976

IS - 15

ER -