Abstract
A new look on the problem of the molecular systems index description is presented. The capabilities of iterated line (edge) graphs in characterization of saturated hydrocarbons properties were investigated. It was demonstrated that single selected molecular (graph-theoretical (topological) or informational) descriptor calculated for the sequence of nested line graphs provides quite reliable progressive set of regression equations. Hence, the problem of descriptor set reduction is solved in the presented approach at list partially. Corresponding program complex (QUASAR) has been implemented with Python 3 program language. As the test example physico-chemical properties of octane isomers have been chosen. Among the properties under investigation there are boiling point, critical temperature, critical pressure, enthalpy of vaporization, enthalpy of formation, surface tension and viscosity. The corresponding rather simple linear regression equations which include one, two or three parameters correspondingly have been obtained. The predictive ability of the equations has been investigated using internal validation tests. The test by leave-one-out (LOO) validation and Y‑scrambling evaluate the obtained equations as adequate. For instance, for the regression model for boiling point the best equation characterizes by determination coefficients R2 = 0.943, with LOO procedure – Q2 = 0.918, while for the Y-scrambling test Q2y-scr