[Back]


Publications in Scientific Journals:

K. Varmuza, P. Filzmoser, B. Liebmann, M. Dehmer:
"Redundancy analysis for characterizing the correlation between groups of variables - Applied to molecular descriptors";
Chemometrics and Intelligent Laboratory Systems, 117 (2012), 31 - 41.



English abstract:
Redundancy analysis (RA) estimates the extent of linear relationships between blocks of variables that are given for a set of objects (samples). RA has only rarely been used in chemometrics. Basic principles and limits of RA are discussed, and RA is briefly compared with canonical correlation analysis (CCA) and partial least-squares (PLS2) regression. The significance of a redundancy index is estimated by permutation tests. For PLS2, an index determining the similarity of variable blocks can be derived that is equivalent to the canonical measure of correlation, CMC.

RA is applied to a set of 3708 molecular descriptors (created by software Dragon) for 6458 chemical structures (AMES database). The 27 descriptor groups are characterized by their redundancy indices, which allow a comparison of their multivariate information content. The results guide the selection of the most different descriptor groups, which perform better in a discrimination task (classification of mutagenicity) than the entire groups.


"Official" electronic version of the publication (accessed through its Digital Object Identifier - DOI)
http://dx.doi.org/10.1016/j.chemolab.2011.05.013

Electronic version of the publication:
http://www.sciencedirect.com/science/article/pii/S0169743911001158


Created from the Publication Database of the Vienna University of Technology.