Temporal Video Scene Segmentation By Fused Bags-of-Features

  • Rodrigo Mitsuo Kishi USP-UFMS
  • Tiago Henrique Trojahn USP-IFSP
  • Rudinei Goularte USP

Resumo

Temporal segmentation of video into semantically coherent scenes is a fundamental step to enhance video operations like browsing, retrieval and recommendation. Available automatic scene segmentation methods in the literature are still far, in terms of efficacy, from reasonable practical application requirements. Towards to lowering this gap, this paper presents a new multimodal early fusion based scene segmentation method, which extends the classical and powerful singlemodal bags-of-features latent semantics discriminative capability to a multimodal paradigm. This approach was designed to refine the latent semantics from singlemodal data by identifying and representing audiovisual patterns while still preserving singlemodal visual/aural words patterns. Experiments have been performed over a publicly available dataset where the proposed method achieved higher average values for the FCO metric than previous state-of-the-art approaches.
Publicado
2018-10-16
Como Citar
KISHI, Rodrigo Mitsuo; TROJAHN, Tiago Henrique; GOULARTE, Rudinei. Temporal Video Scene Segmentation By Fused Bags-of-Features. Anais do Simpósio Brasileiro de Sistemas Multimídia e Web (WebMedia), [S.l.], p. 173-180, out. 2018. Disponível em: <https://sol.sbc.org.br/index.php/webmedia/article/view/4573>. Acesso em: 14 maio 2024.

##plugins.generic.recommendByAuthor.heading##

<< < 1 2