350 rub
Journal Science Intensive Technologies №8 for 2010 г.
Article in number:
A method for study of speech signal informative cues
Authors:
A.S. Kolokolov, V.M. Krol, A.Yu. Mestcherakov, I.A. Lubinski, V.P. Yachno
Abstract:
The method for research of informative phonetic cues of the speech signal, combining the analysis and synthesis procedures is offered. It allows to check up influence of separate spectrum fragments of a speech signal on its perception. The method is based on the editing of the dynamic spectrogram of the speech signal, which concludes with removing the separate spectrum fragment and the subsequent restoration of the edited signal in time domain. The conclusion about importance of a concrete fragment of a spectrum in perception of a speech signal becomes on the basis of listening of the initial and restored signals. Spectrum editing is realised in the form of consecutive procedure on which each step the various rectangular fragments of a spectrum allocated for planes "frequency - time" can be suppressed. Operation of removal of fragments of the dynamic spectrogram can be added by insert possibility in the set area of the spectrogram of a fragment from the same or other spectrogram. However it is obvious that this operation is limited only to an insert of noise fragments. It is caused by that at an insert of quasiharmonious fragments in quasiharmonious segments of a signal discrepancy of frequencies of their basic tones leads to specific distortions. They are shown in occurrence beats with the frequency equal to a difference of the basic tones, listened in the restored signal. Their presence increases roughness of sounding of a signal and can complicate listening interpretation. The proposed method can be easy realized by means of modern computer engineering on the basis of fast Fourier transform and represents convenient and flexible tool for speech signal research.
Pages: 10-15
References
  1. Фант Г.Акустическая теория речеобразования. М.: Наука. 1964.
  2. Чистович Л.А., Венцов А.В., Гранстрем М.П. и др. Физиология речи. Восприятие речи человеком / В серии «Руководство по физиологии». Л., Наука. 1976.
  3. Zue V.W., Cole R.A.Experiments on spectrogram reading // Proc. ICASSP-79. 1979. P.116-119.
  4. ЦзуеВ.В.Лингвистическийподходкавтоматическому распознаванию речевых сигналов // ТИИЭР. 1985. Т. 73. № 11. С.75-91.
  5. Potter R.K., Kopp G.A., Green H.C.Visible speech. VanNostrand. NY. 1947.
  6. Рабинер Л.Р., Шафер Р.В. Цифровая обработка речевых сигналов. М.: Радио и связь. 1981.
  7. Фланаган Дж.Анализ, синтез и восприятие речи. М.: Связь. 1968.
  8. Дергач М.Ф.Статистика восприятия глухих взрывных и щелевых согласных в зависимости от их длительности // Вопросы статистики речи. Л.: ЛГУ. 1962. С. 40-45.
  9. Дукельский Н.И. Принципы сегментации речевого потока. М., Л.: АНСССР. 1962.
  10. Cooper F.S., Delattre P.C., Liberman A.M., et al. Some experiments on the perception of synthetic speech sounds // J. Acoust. Soc. Amer. 1952. V.24. P. 597-606.
  11. Хэррис Ф.Дж. Использование окон при гармоническом анализе методом дискретного преобразования Фурье // ТИИЭР. 1978. Т.66. С.60-96.