recognition of images
V.G. Baranov, V.R. Milov, J.H. Zaripova, A.Yu. Epshtein
In recent years analytical information processing tools usually include the methods of data mining. Among those methods classi-fication procedures fill a highly important place. This article presents an approach to the intellectualization of classification pro-cedures. With that end in view the performance of applied classification procedures is estimated. Those estimations are formed by experts using either the results of solving the applied tasks or on basis of statistical modelling. Сlassification tasks are described by series of characteristics, including the number of classes, the number of attributes, the size of the learning sampling, etc. As a result of learning for different types of tasks the preferable types of classiers and their learning procedures are determined.
In this article the special attention is paid to the methods of classifier performance estimation. For the task of binary classification the dependences of alpha error likelihood and beta error likelihood from the threshold value in decision rule are obtained. Using those likelihoods receiver operating characteristics (ROC) graphs are plotted. For the classifier performance comparing the use of the area under an ROC Curve (AUC) is suggested. In addition to it, an integral parameter of classifier precision is suggested, based on the value of empirically determined risk, averaged by the optimal threshold value. The comparing of classifiers based on linear discriminant analysis, quadratic discriminant analysis, probabilistic neural network, and radial basis function network is carried out by modeling.