Journal Achievements of Modern Radioelectronics №4 for 2021 г.
Article in number:
Method for automatic evaluation of the voice signals quality with low bit rate voice coding
Type of article: scientific article
DOI: https://doi.org/10.18127/j20700784-202104-06
UDC: 621.395
Authors:

V.A. Aladinskiy¹, S.V. Kuzminskiy²

1  LTD «STC» (St. Petersburg, Russia)

2 FGUE «18 CNII» DoD RF (Moscow, Russia)

Abstract:

Evaluation of the VSQ is an important problem. In known methods for еvaluation the quality of analog or digital voice signals with pulse-code modulation (PCM) are used. When automatic quality assessing, it is assumed that the parameters of voice signal are determined. Then the corresponding subjective assessment of VSQ is selected. Objective methods for quality assessing are obtained by comparing the reference signal and the signal transmitted through the communication channel. To eliminate the identified contradictions, it is proposed to use an objective method that provides the assessing of VSQ in the absence of a reference signal and without conversion to the PCM format, i.e. based on the analysis of the LBR voice digital stream. The analyzed digital stream is considered as a system of random variables, which is characterized by a set of mean values and a covariance matrix that make up the pattern of input implementation. The reference pattern also includes a set of mean values and a covariance matrix. The initial data for the formation of a reference pattern are digital streams formed according to a given protocol of LBR voice coding, having the highest VSQ. The proposed method is based on calculating the divergence between the reference pattern and input implementation pattern. At the training stage, several distorted samples with a known number of bit errors are formed from the reference sample. When comparing their patterns, the divergence values are calculated, which are put in compliance with the VSQ assessment. Two approaches have been identified for making the compliance: experimental and experimental-analytical. The choice of approach is determined by the presence of initial data characterizing the VSQ for particular vocoder. Then generated compliance is interpolated. At the assessment stage for each implementation is calculated the divergence between the explored pattern and the reference pattern, which is formed from the training sample. Based on the divergence, the VSQ assessment is calculated.

To test the proposed method a reference sample of digital streams obtained by compressing voice signals with a LPC-10-2400 vocoder was formed. Based on the initial data, which are presented in Recommendation-R F.1112-1, the compliance between the values of sound intelligibility and divergence was compiled by experimental-analytical approach. The obtained correspondence was interpolated by the least squares method using a third degree polynomial.

Pages: 40-47
For citation

Aladinskiy V.A., Kuzminskiy S.V. Method for automatic evaluation of the voice signals quality with low bit rate voice coding. Achievements of modern radioelectronics. 2021. V. 75. № 4. P. 40–47. DOI: https://doi.org/10.18127/j20700784-202104-06 [in Russian]

References
  1. GOST R 50840-95 Peredacha rechi po traktam svyazi. Metody otsenki kachestva, razborchivosti i uznavaemosti. M.: Gosstandart Rossii. 1995. [in Russian]
  2. Recommendation ITU-T P.800. Methods for subjective determination of transmission quality. 1996. URL: https://www.itu.int/rec/ dologin_pub.asp?lang=e&id=T-REC-P.800-199608-I!!PDF-E&type=items (data obrashcheniya: 18.11.2020).
  3. Recommendation ITU-T P.830. Subjective performance assessment of telephone-band and wide-band digital codecs. 1996. URL: https://www.itu.int/rec/dologin_pub.asp?lang=e&id=T-REC-P.830-199602-I!!PDF-E&type=items (data obrashcheniya: 18.11.2020).
  4. GOST R 51061-97 Sistemy nizkoskorostnoy peredachi rechi po tsifrovym kanalam. Parametry kachestva rechi i metody izmereniy. M.: Gosstandart Rossii. 1997. [in Russian]
  5. GOST 16600-72 Peredacha rechi po traktam radiotelefonnoy svyazi. Trebovaniya k razborchivosti rechi i metody artikulyatsionnykh izmereniy. M.: Standartinform. 2007. [in Russian]
  6. Recommendation ITU-T P.862. Perceptual Evaluation of Speech Quality. Geneva. 2001. URL: https://www.itu.int/rec/dologin_pub.asp?lang= e&id=T-REC-P.862-200102-I!!SOFT-ZST-E&type=items (data obrashcheniya: 20.11.2020).
  7. Recommendation ITU-T P.863. Perceptual objective listening quality assessment. 2018. URL: https://www.itu.int/rec/dologin_pub.asp? lang=e&id=T-REC-P.863-201803-I!!EPB-E&type=items (data obrashcheniya: 20.11.2020).
  8. Recommendation ITU-R F.1112-1. Digitized speech transmissions for systems operating below about 30 MHz. Radiocommunication Study Group 8, Question ITU-R 164/9. 1995. URL: https://www.itu.int/dms_pubrec/itu-r/rec/f/R-REC-F.1112-1-199510-I!!PDF-E.pdf (data obrashcheniya: 20.11.2020).
  9. Shelukhin O.I., Luk'yantsev N.F. Tsifrovaya obrabotka i peredacha rechi. Pod red. O.I. Shelukhina. M.: Radio i svyaz'. 2000.  [in Russian]
  10. Mikhaylov V.G., Zlatoustova L.V. Izmerenie parametrov rechi. Pod red. M.A. Sapozhkova. M.: Radio i svyaz'. 1987. [in Russian]
  11. Tu Dzh., Gonsales R. Printsipy raspoznavaniya obrazov. Per. s angl. M.: Mir. 1978. [in Russian]
  12. Fomin Ya.A., Tarlovskiy G.R. Statisticheskaya teoriya raspoznavaniya obrazov. M.: Radio i svyaz'. 1986. [in Russian]
  13. GOST R ISO 9921-2013 Otsenka rechevoy svyazi. M.: Standartinform. 2014. [in Russian]
  14. Aladinskiy V.A., Kuz'minskiy S.V. Metod formirovaniya priznakov raspoznavaniya protokolov nizkoskorostnogo kodirovaniya rechi. Naukoemkie tekhnologii. 2015. № 12. S. 20–25. [in Russian]
Date of receipt: 09.03.2021
Approved after review: 24.03.2021
Accepted for publication: 01.04.2021