Objects selection in multi-images by using hierarchical artificial neuronets

350 rub

Journal Neurocomputers №7 for 2010 г.

Article in number:

Keywords: hierarchical artificial neuronets object selection images recognition

Authors:

A. V. Timofeev, O. A. Derin

Abstract:

The approaches used for image recognition, are divided into algorithmic, analyzing the scene by algorithms and based on the description of the objects on the prototype, and on "neural networks", using artificial neural network, trained on sample images and do not require the algorithm. Each of these approaches has its advantages, in particular, an algorithmic approach allows the use of accumulated human expertise and neural network ? does not require a mathematical model of recognizable complex scene and the algorithm creation. The combination of these approaches and their advantages of traditional methods is impossible. Contemporary trends of the theory of image processing is a joint analysis of problems of image recognition and generation. A method for constructing a hierarchical neural network (HNN) recognition of objects in multi - images (MI), whose structure inverted structural diagram of a multimedia system ? a computer graphics processor (GP). Described the structural scheme of the HNN, performs a reverse sequence of actions as compared with GPs, described algorithms of its elements. This structure was implemented in the language C++ program as filter of the DirectX multimedia subsystem of the operating system Windows. HNN has been tested to detect MI CCTV systems on one railway station at Saint-Petersburg in order to detect suspicious persons in the waiting room. The system was developed under the R&D performed SPIIRAS (St. Petersburg). HNN showed high robustness with respect to the background image and the probability of detecting objects ? namely, the probability of recognition of people / vehicles / animals is not less than 0.8, including control the trajectory of motion ? not less than 0.75, the probability of detection of objects left behind ? not less than 0,6. Based on the proposed structure of the HNN is possible to construct three-dimensional video sensor of the surrounding space. The combination of the detected objects in neighboring frames in the MI allows for moving the robot to estimate distances to these objects and build a relief of the surrounding space. In this case, the stereobase is the distance traveled by the robot, i.e. stereobase is dynamic and can be changed by correcting the speed. It is proposed to divide the segmentation of the HNN on the supporting performing segmentation of the next image from scratch, without relying on the results of previous calculations (based on clustering by K-mean), and intermediate segmentation results of the previous correction algorithms (based on an algorithm of competitive learning neural network). An example of the results of the prototype soft-hardware device three-dimensional view, which provides automatic detection of surrounding objects and circumvent obstacles the mobile platform. The conclusion about the appropriateness of the proposed structure of the HNN for the selection of moving objects in the MI.

Pages: 45-56

References

Timofeev, A. V., Kosovskaya, T. M., Conditions of Effectiveness of Pattern Recognition Problem Solution Using Logical Level Descriptions of Classes // International Scientific Journal «Information Theories and Applications» (IJ ITA) 2008. V.15 P. 572-576.
Seul, M., O-Gorman, L., and Sammon, M. J., Practical algorithms for image analysis. Description, examples, and code. Cambrige University Press. 2000.
Mercier, G., Madani, K., A new on-line CMAC Algorithm for Real Time Applications // R.I. Informatics congress. St. Petersburg. May 1994. P. 37-42.
Хайкин С. Нейронные сети: полный курс: Пер. с англ. М.: Вильямc. 2006.
Кирьянов Д. В., Кирьянова Е. Н. Вычислительная физика. М.: Полибук Мультимедиа. 2006.
Змеу К. В., Ноткин Б. С., Дьяченко П. А.Безмодельное прогнозирующее инверсное нейроуправление // Мехатроника, автоматизация, управление. 2006. № 9. C. 8-15.
Волков В. В., Луизов А. В., Овчинников Б. В., Травникова. Н. П. Эргономика зрительной деятельности человека. Л.: Машиностроение. 1989.
Vatterli, M., Kovacevic,Е.. Wavelets and Subband Coding. PrenticeHall. New Jersey. 1995.
Тимофеев А. В., Дерин О. А. Анализ сложных мультиизображений в режиме реального времени // Россия. 2008.
№ 10. СПб.: Приборостроение. С. 25-30.
Андреев В. А., Дерин О. А., Гуленко И. Е., Тимофеев А. В. Нейросетевые технологии анализа мультиизображений и методы видеозахвата и анимации движений // Мат. Международной научно-технической мультиконференции «Актуальные проблемы информационно-компьютерных технологий, мехатроники и робототехники - 2009 (ИКТМР-2009)», с. Дивноморское Геленджикского района Краснодарского края, Россия, 28 сентября - 3 октября 2009 г. 2009. С. 298-301.
Тимофеев А. В., Дерин О. А. Нейросетевое распознавание сложных стерео и мультиизображений // Доклады 5-й научной конференции «Управление и информационные технологии» (УИТ-2008). (СПб. 14-16 октября 2008). Том. 1. С. 150-153.
Gall, D. J., The MPEG Video Compression Algorithm // Signal Processing Image Communication. 1992. Vol. 4, No. 2.
P. 129 - 140.
Тимофеев А. В., Дерин О. А. Нейросетевые технологии обнаружения и оценки потенциальной террористической опасности // Сб. трудов Третьей Всероссийской научно-практической конференции «Перспективные системы и задачи управления» Домбай, Россия. 7-13 апреля 2008. С. 45-48.
Тимофеев А., Дерин О. Распознавание сложных стерео и мульти-изображений в реальном времени // Сб. тр. XIVth Intelligent Technologies and Applications. Болгария, Варна. 2008. V. 1. P. 149-152.
Тимофеев А. В., Дерин О. А. Трехмерный сенсор на основе стереозрения и лазерного дальномера // Мат. 5-й научно-технической конференции «Мехатроника, автоматизация, управление» (МАУ-2008, Санкт-Петербург, 14-16 октября 2008 г.). 2008. С. 298-301.