350 rub
Journal №1 for 2012 г.
Article in number:
A study of speech signal informative cues
Authors:
A.S. Kolokolov, I.A. Lubinski, A.Yu. Mestcherakov, V.P. Yachno
Abstract:
A computer program for study of speech signal informative features is developed. At the core of the program is used the procedure of analysis-synthesis, which is complemented by a graphical editor of the signal dynamic spectrogram. Because of the editing there can be deleted the selected cursor frequency components of the speech signal. The dynamic spectrogram editing is implemented as a sequential procedure. At each step this procedure there can be suppressed by a variety of rectangular fragments of spectrum allocated to the frequency - time area. So removing by the spectrum editing certain frequency components of speech and listening signal after editing, one can draw at conclusions about the importance of certain frequency components in the perception of sound and bring to light its informative cues. By means of the developed program a study of informative cues of a speech signal is conducted. The results of the study showed that the phonetic information about the vowel is redundant and transmitted independently on several relatively narrow non-overlapping frequency bands. On this basis, concluded that the traditional description of vowels in terms of the first 2-3 formants needs revision. It is shown that phonetic quality of the fricative consonant can sharply change when its low-frequency components are removed.
Pages: 41-47
References
  1. Сапожков М.А., Михайлов В.Г. Вокодерная связь. М.: Радио и связь. 1983.
  2. Чистович Л.А., Венцов А.В., Гранстрем М.П. и др. Физиология речи. Восприятие речи человеком // В серии «Руководство по физиологии». Л.: Наука. 1976.
  3. Цзуе В.В. Лингвистический подход к автоматическому распознаванию речевых сигналов. ТИИЭР. 1985. Т. 73. № 11. С. 75-91.
  4. Zue V.W., Cole R.A. Experiments on spectrogram reading. Proc. ICASSP-79. 1979. P. 116-119.
  5. Potter R.K., Kopp G.A., Green H.C. Visible speech. Van Nostrand. NY. 1947.
  6. Рабинер Л.Р., Шафер Р.В. Цифровая обработка речевых сигналов.М.: Радио и связь. 1981.
  7. Фланаган Дж. Анализ, синтез и восприятие речи. М.: Связь 1968.
  8. Хэррис Ф.Дж. Использование окон при гармоническом анализе методом дискретного преобразования Фурье // ТИИЭР. 1978. Т. 66. С. 60-96.
  9. Дергач М.Ф. Статистика восприятия глухих взрывных и щелевых согласных в зависимости от их длительности // Вопросы статистики речи. Л.: ЛГУ. 1962. С. 40-45.
  10. Дукельский Н.И. Принципы сегментации речевого потока. М.-Л.: АН СССР. 1962. С. 253.
  11. Cooper F.S., Delattre P.C., Liberman A.M., et al. Some experiments on the perception of synthetic speech sounds // J. Acoust. Soc. Amer. 1952. V. 24. P. 597-606.
  12. Колоколов А.С., Крольи В.М., Любинский И.А., Мещеряков А.Ю., Яхно В.П. Способ исследования информативных признаков речевого сигнала // Наукоёмкие технологии. 2010. № 8. Т. 11. С. 10-15.