Binary classification of chalcone derivatives with LDA or KNN based on their antileishmanial activity and molecular descriptors selected using the Successive Projections Algorithm feature-selection technique

Mohammad Goodarzi, Wouter Saeys, Mario Cesar Ugulino De Araujo, Roberto Kawakami Harrop Galvão, Yvan Vander Heyden

Research output: Contribution to journalArticlepeer-review

23 Scopus citations

Abstract

Chalcones are naturally occurring aromatic ketones, which consist of an α-, β-unsaturated carbonyl system joining two aryl rings. These compounds are reported to exhibit several pharmacological activities, including antiparasitic, antibacterial, antifungal, anticancer, immunomodulatory, nitric oxide inhibition and anti-inflammatory effects. In the present work, a Quantitative Structure-Activity Relationship (QSAR) study is carried out to classify chalcone derivatives with respect to their antileishmanial activity (active/inactive) on the basis of molecular descriptors. For this purpose, two techniques to select descriptors are employed, the Successive Projections Algorithm (SPA) and the Genetic Algorithm (GA). The selected descriptors are initially employed to build Linear Discriminant Analysis (LDA) models. An additional investigation is then carried out to determine whether the results can be improved by using a non-parametric classification technique (One Nearest Neighbour, 1NN). In a case study involving 100 chalcone derivatives, the 1NN models were found to provide better rates of correct classification than LDA, both in the training and test sets. The best result was achieved by a SPA-1NN model with six molecular descriptors, which provided correct classification rates of 97% and 84% for the training and test sets, respectively.

Original languageEnglish (US)
Pages (from-to)189-195
Number of pages7
JournalEuropean Journal of Pharmaceutical Sciences
Volume51
Issue number1
DOIs
StatePublished - 2014

Keywords

  • Antileishmanial activity
  • Genetic Algorithm
  • Linear Discriminant Analysis
  • One Nearest Neighbour
  • Successive Projections Algorithm

ASJC Scopus subject areas

  • Pharmaceutical Science

Fingerprint

Dive into the research topics of 'Binary classification of chalcone derivatives with LDA or KNN based on their antileishmanial activity and molecular descriptors selected using the Successive Projections Algorithm feature-selection technique'. Together they form a unique fingerprint.

Cite this