The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. ex. Some numerals are expressed as "XNUMX".
Copyrights notice
The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. Copyrights notice
Le traitement des sous-titres (de cinéma) qui ajoute un texte descriptif sur une séquence d'images est une fonction de manipulation vidéo importante qu'un éditeur vidéo doit prendre en charge. Cet article propose une approche efficace de domaine compressé MC-DCT pour insérer la légende dans le flux vidéo compressé MPEG. Il ajoute essentiellement les blocs DCT de l'image de légende aux blocs DCT correspondants des trames d'entrée un par un dans le domaine MC-DCT comme dans [6]. Cependant, la force de l'image de légende est ajustée dans le domaine DCT pour empêcher les coefficients DCT résultants de dépasser la valeur maximale autorisée en MPEG. Afin d'ajuster la force de l'image de légende de manière adaptative, nous devons connaître la valeur exacte en pixels de l'image d'entrée. C'est une tâche difficile dans le domaine DCT. Nous proposons un schéma d'approximation pour les valeurs de pixels dans lequel la valeur DC d'un bloc est utilisée comme valeur de pixel attendue pour tous les pixels de ce bloc. Bien que cette approximation puisse conduire à certaines erreurs dans la zone de légende, elle fournit toujours une qualité d'image relativement élevée dans la zone sans légende, alors que le temps de traitement est environ 4.9 fois plus rapide que la méthode de décodage-sous-titrage-réencodage.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copier
Jongho NANG, Seungwook HONG, Ohyeong KWON, "An Efficient Caption Insertion Scheme for MPEG Video in MC-DCT Compressed Domain" in IEICE TRANSACTIONS on Communications,
vol. E84-B, no. 8, pp. 2292-2300, August 2001, doi: .
Abstract: The (cinema) caption processing that adds descriptive text on a sequence of frames is an important video manipulation function that a video editor should support. This paper proposes an efficient MC-DCT compressed domain approach to insert the caption into the MPEG compressed video stream. It basically adds the DCT blocks of the caption image to the corresponding DCT blocks of the input frames one by one in the MC-DCT domain as in [6]. However, the strength of the caption image is adjusted in the DCT domain to prevent the resulting DCT coefficients from exceeding the maximum value allowed in MPEG. In order to adjust the strength of the caption image adaptively we need to know the exact pixel value of the input image. This is a difficult task in DCT domain. We propose an approximation scheme for the pixel values in which the DC value of a block is used as the expected pixel value for all pixels in that block. Although this approximation may lead to some errors in the caption area, it still provides a relatively high image quality in the non-caption area, whereas the processing time is about 4.9 times faster than the decode-captioning-reencode method.
URL: https://global.ieice.org/en_transactions/communications/10.1587/e84-b_8_2292/_p
Copier
@ARTICLE{e84-b_8_2292,
author={Jongho NANG, Seungwook HONG, Ohyeong KWON, },
journal={IEICE TRANSACTIONS on Communications},
title={An Efficient Caption Insertion Scheme for MPEG Video in MC-DCT Compressed Domain},
year={2001},
volume={E84-B},
number={8},
pages={2292-2300},
abstract={The (cinema) caption processing that adds descriptive text on a sequence of frames is an important video manipulation function that a video editor should support. This paper proposes an efficient MC-DCT compressed domain approach to insert the caption into the MPEG compressed video stream. It basically adds the DCT blocks of the caption image to the corresponding DCT blocks of the input frames one by one in the MC-DCT domain as in [6]. However, the strength of the caption image is adjusted in the DCT domain to prevent the resulting DCT coefficients from exceeding the maximum value allowed in MPEG. In order to adjust the strength of the caption image adaptively we need to know the exact pixel value of the input image. This is a difficult task in DCT domain. We propose an approximation scheme for the pixel values in which the DC value of a block is used as the expected pixel value for all pixels in that block. Although this approximation may lead to some errors in the caption area, it still provides a relatively high image quality in the non-caption area, whereas the processing time is about 4.9 times faster than the decode-captioning-reencode method.},
keywords={},
doi={},
ISSN={},
month={August},}
Copier
TY - JOUR
TI - An Efficient Caption Insertion Scheme for MPEG Video in MC-DCT Compressed Domain
T2 - IEICE TRANSACTIONS on Communications
SP - 2292
EP - 2300
AU - Jongho NANG
AU - Seungwook HONG
AU - Ohyeong KWON
PY - 2001
DO -
JO - IEICE TRANSACTIONS on Communications
SN -
VL - E84-B
IS - 8
JA - IEICE TRANSACTIONS on Communications
Y1 - August 2001
AB - The (cinema) caption processing that adds descriptive text on a sequence of frames is an important video manipulation function that a video editor should support. This paper proposes an efficient MC-DCT compressed domain approach to insert the caption into the MPEG compressed video stream. It basically adds the DCT blocks of the caption image to the corresponding DCT blocks of the input frames one by one in the MC-DCT domain as in [6]. However, the strength of the caption image is adjusted in the DCT domain to prevent the resulting DCT coefficients from exceeding the maximum value allowed in MPEG. In order to adjust the strength of the caption image adaptively we need to know the exact pixel value of the input image. This is a difficult task in DCT domain. We propose an approximation scheme for the pixel values in which the DC value of a block is used as the expected pixel value for all pixels in that block. Although this approximation may lead to some errors in the caption area, it still provides a relatively high image quality in the non-caption area, whereas the processing time is about 4.9 times faster than the decode-captioning-reencode method.
ER -