350 rub
Journal Science Intensive Technologies №8 for 2009 г.
Article in number:
SPECTRUM SIGNAL PROCESSING IN THE HEARING ANALYZER
Authors:
A.S. Kolokolov, V.M. Krol, A.Yu. Mestcherakov, I.A. Lubinski, V.P. Yachno
Abstract:
On the basis of the analysis of the data physiologists and psychophysics the possible hearing mechanisms providing stability of perception of speech in the presence of frequency distortions and background noise were revealed. At the heart of them three kinds of a bandpass filtering of an acoustical spectrum are used. The first two types of a bandpass filtering are realized by means of lateral inhibition processes with symmetric and asymmetrical neuron inhibition connections. Because of this filtering local maxima and sharp slopes are emphasized and brought to light in a spectral pattern of a sound. It is supposed that these selected spectral cues contain the important information for phoneme recognition. The third type of a bandpass filtering is realized on the basis of effect of the delayed inhibition and allows to detect local no uniformities of a speech spectral pattern in time. It is supposed that these detected hearing system sharp spectrum changes are used for the segmentation of a speech signal on the consecutive quasistationary segments representing single phonemes of a speech signal, and also for speech events duration estimation. It is well known that duration plays the important role in allocation in speech of stressed vowels, and also in division of explosive and fricative consonants. The study results can be used by working out the methods of signal processing in frequency domain as applied to problems of front end processing and analysis of speech and acoustic signals.
Pages: 42-47
References
  1. Применение цифровой обработки сигналов / под ред. Э. Оппенгейма. М.: Мир. 1980. 552 с.
  2. Рабинер Л.Р. Цифровая обработка речевых сигналов. М.: Радио и связь. 1981. 495с.
  3. Оппенгейм А., Шафер Р. Цифровая обработка сигналов. М.:Техносфера. 2006. 848с.
  4. Springer Handbook of Speech Processing / J. Benesty, M.M. Sondhi, Y. Huang (Eds.). Springer-Verlag Berlin Heidelberg. 2008.
  5. Левинсон С.Е. Структурные методы автоматического распознавания речи // ТИИЭР. 1985. Т. 73. № 11. С.100 - 128.
  6. Zue V.W., Cole R.A. Experiments on spectrogram reading / In Proc. ICASSP-79. 1979. P.116 - 119.
  7. Цзуе В.В. Лингвистический подход к автоматическому распознаванию речевых сигналов // ТИИЭР. 1985. Т.73. № 11. C.75 - 91.
  8. Хэррис Ф.Дж. Использование окон при гармоническом анализе методом дискретного преобразования Фурье // ТИИЭР. 1978. Т.66. C. 60 - 96.
  9. Фант Г. Акустическая теория речеобразования. М.: Наука. 1964.
  10. Lippmann P.L. Speech recognition by machines and humans.Speech Communication. 1997. V.22. P.1-15.
  11. Колоколов А.С. Предварительная обработка сигнала для распознавания речи // АиТ. 2002. № 3. С. 190 - 198.
  12. Traunmüller H. Analytical expressions for the tonotopic sensory scale // J. Acoust. Soc. Amer. 1990. V.88. N.1. P.97 - 100.
  13.  Zwicker E., Terhardt E. Analytical expressions for critical-band rate and critical bandwidth as a function of frequency //
    J. Acoust. Soc. Amer. 1980. V.68. N.5. P.1523 - 1525.
  14. Физиологиясенсорныхсистем. Ч.2. Всерии «Руководствопофизиологии» / подред. Я.А. Альтман, А.В. Бару,
    И.А. Вартанянидр.Л.: Наука. 1972.
  15. Чистович Л.А., Венцов А.В., Гранстрем М.П. и др.Физиология речи. Восприятие речи человеком / В серии «Руководство по физиологии». Л.: Наука. 1976.
  16. Позин Н.В. Моделирование нейронных структур. М.: Наука. 1967.
  17. Любинский И.А., Позин Н.В., Яхно В.П. Анализ моделей однородного нейронного слоя с латеральными связями // АиТ. 1967. №10. С.168 - 181.
  18. Speech Processing in the Auditory System / Greenberg S., Ainsworth W.A., Popper A.N., Fay A.A. (Eds.). Springer. 2004.