The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. ex. Some numerals are expressed as "XNUMX".
Copyrights notice
The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. Copyrights notice
Dans cet article, les propriétés de corrélation sont utilisées pour développer deux schémas de codage efficaces pour les paramètres de fréquence du spectre des lignes vocales (LSF). Le premier schéma (1D KL), qui exploite la corrélation intra-trame, est basé sur la transformation de Karhunen-Loeve (KL) unidimensionnelle ; le deuxième schéma, qui nécessite certains retards de codage pour utiliser davantage la corrélation intertrame, utilise une transformation bidimensionnelle (KL 2D) dans le domaine fréquentiel ou une transformation KL unidimensionnelle coopérant avec DPCM dans le domaine temporel. De plus, étant donné que la transformée KL est globalement optimale, ce qui est sensible au changement des statistiques des données d'entrée, deux autres systèmes de codage par transformation adaptative sont également étudiés dans cet article. Les performances de tous les systèmes pour différents débits binaires sont étudiées et des comparaisons adéquates sont effectuées. Il est montré que le gain de l’utilisation de la transformation KL pour exploiter la corrélation intra-trame et inter-trame est respectivement de 3 et 4 bits/trame vocale.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copier
Hai Le VU, "Efficient Transform Coding Schemes for Speech LSFs" in IEICE TRANSACTIONS on Fundamentals,
vol. E82-A, no. 4, pp. 580-587, April 1999, doi: .
Abstract: In this paper, the correlation properties are used to develop two efficient encoding schemes for speech line spectrum frequency (LSF) parameters. The first scheme (1D KL), which exploits the intraframe correlation, is based on one-dimensional Karhunen-Loeve (KL) transformation; the second scheme, which requires some coding delays to further utilize the interframe correlation, uses two-dimensional (2D KL) transform in the frequency domain or one-dimensional KL transform co-operating with DPCM in the time domain. Moreover, since the KL transform is globally optimal, which is sensitive to the change of input data statistics, further two adaptive transform coding systems are also investigated in this paper. The performance of all systems for different bit rates is investigated and adequate comparisons are made. It is shown that the gain of using KL transformation to exploit the intraframe and interframe correlation is 3 and 4 bits/speech frame, respectively.
URL: https://global.ieice.org/en_transactions/fundamentals/10.1587/e82-a_4_580/_p
Copier
@ARTICLE{e82-a_4_580,
author={Hai Le VU, },
journal={IEICE TRANSACTIONS on Fundamentals},
title={Efficient Transform Coding Schemes for Speech LSFs},
year={1999},
volume={E82-A},
number={4},
pages={580-587},
abstract={In this paper, the correlation properties are used to develop two efficient encoding schemes for speech line spectrum frequency (LSF) parameters. The first scheme (1D KL), which exploits the intraframe correlation, is based on one-dimensional Karhunen-Loeve (KL) transformation; the second scheme, which requires some coding delays to further utilize the interframe correlation, uses two-dimensional (2D KL) transform in the frequency domain or one-dimensional KL transform co-operating with DPCM in the time domain. Moreover, since the KL transform is globally optimal, which is sensitive to the change of input data statistics, further two adaptive transform coding systems are also investigated in this paper. The performance of all systems for different bit rates is investigated and adequate comparisons are made. It is shown that the gain of using KL transformation to exploit the intraframe and interframe correlation is 3 and 4 bits/speech frame, respectively.},
keywords={},
doi={},
ISSN={},
month={April},}
Copier
TY - JOUR
TI - Efficient Transform Coding Schemes for Speech LSFs
T2 - IEICE TRANSACTIONS on Fundamentals
SP - 580
EP - 587
AU - Hai Le VU
PY - 1999
DO -
JO - IEICE TRANSACTIONS on Fundamentals
SN -
VL - E82-A
IS - 4
JA - IEICE TRANSACTIONS on Fundamentals
Y1 - April 1999
AB - In this paper, the correlation properties are used to develop two efficient encoding schemes for speech line spectrum frequency (LSF) parameters. The first scheme (1D KL), which exploits the intraframe correlation, is based on one-dimensional Karhunen-Loeve (KL) transformation; the second scheme, which requires some coding delays to further utilize the interframe correlation, uses two-dimensional (2D KL) transform in the frequency domain or one-dimensional KL transform co-operating with DPCM in the time domain. Moreover, since the KL transform is globally optimal, which is sensitive to the change of input data statistics, further two adaptive transform coding systems are also investigated in this paper. The performance of all systems for different bit rates is investigated and adequate comparisons are made. It is shown that the gain of using KL transformation to exploit the intraframe and interframe correlation is 3 and 4 bits/speech frame, respectively.
ER -