• 沒有找到結果。

REAL-TIME VIDEO OBJECT SEGMENTATION ALGORITHM BASED ON CHANGE DETECTION AND BACKGROUND UPDATING

N/A
N/A
Protected

Academic year: 2021

Share "REAL-TIME VIDEO OBJECT SEGMENTATION ALGORITHM BASED ON CHANGE DETECTION AND BACKGROUND UPDATING"

Copied!
1
0
0

加載中.... (立即查看全文)

全文

(1)

International Journal of Innovative

Computing, Information and Control ICIC International c°2009 ISSN 1349-4198

Volume 5, Number 7, July 2009 pp. 1797—1810

REAL-TIME VIDEO OBJECT SEGMENTATION ALGORITHM BASED ON CHANGE DETECTION AND BACKGROUND UPDATING

Tsong-Yi Chen1, Thou-Ho (Chao-Ho) Chen1, Da-Jinn Wang2

and Yung-Chuen Chiou1

1Department of Electronic Engineering

National Kaohsiung University of Applied Sciences 415, Chien Kung Rd., Kaohsiung 807, Taiwan

{ chentso; thouho }@cc.kuas.edu.tw

2Department of Information Management

National Kaohsiung Marine University

No.142, Haijhuan Rd., Nanzih District, Kaohsiung City 81143, Taiwan wangdaj@mail.nkmu.edu.tw

Received February 2008; revised July 2008

Abstract. This paper proposes an efficient real-time video object segmentation algo-rithm based on change detection and background updating. The basic idea is to use the change detection technique to analyze temporal information between successive frames for extracting the change region. Then, the combination of frame difference mask and background subtraction mask is employed to quickly acquire the initial object mask and further solve the uncovered background problem and still object problem. Moreover, the novel hierarchical boundary refinement is introduced to overcome the shadow influence and residual background problem. The objective evaluations of the proposed algorithm demonstrate that the spatial accuracy can be maintained above 95% for most normal cases.

Keywords: Video object segmentation, Change detection, Background updating

1. Introduction. Conventional video-coding standards, such as H.261/H.263 and MPEG-1/MPEG-2 that employ the frame-by-frame coding policy, can’t provide high-level fea-ture of video contents. The MPEG-4 coding standard [1] has introduced the concept of a video-object plane (VOP) as the basic coding element for supporting visual multime-dia communication and will be applied to many multimemultime-dia content descriptions [2], and intelligent signal processing, such as, digital video surveillance system and video confer-ence. Each VOP contains the shape and texture information of semantically meaningful object in the scene. In order to encode video objects using object-by-object in video sequences rather than frame-by-frame and achieve the content-based manipulation for video content, automatic video segmentation will play an important role of deriving VOP from video sequences in MPEG-4 video part. For many practicable multimedia applica-tions of real-time demand, a fast and efficient video segmentation algorithm will be very important.

Basically, video object segmentation algorithm can be coarsely classified into two types, semi-automatic and automatic. In semi-automatic algorithm [3,4], user must first defines a high level semantic object of interest to be segmented and detect the object’s boundary in keyframe by way of manual. Then, the extracted object of interest region is used to segment this object occurred in the video sequences. The user interaction based algorithm can give better segmentation results than automatic algorithm, but it may be unsuitable in real-time applications due to the fact that an interest object needs to be specified by

參考文獻

相關文件

 Tying in with the modules and topics in the school-based English Language curriculum, schools are encouraged to make use of the lesson plans in the resource

Vetter, Reanimating Faces in Images and Video, EUROGRAPHICS 2003. Seidel, Exchanging Faces in Images,

– Assume that there is no concept with significan t correlation to Mountain..

– evolve the algorithm into an end-to-end system for ball detection and tracking of broadcast tennis video g. – analyze the tactics of players and winning-patterns, and hence

ƒ Visit the following page and select video #3 (How warrant works) to show students a video introducing derivative warrant, then discuss with them the difference of a call and a

Trace of display recognition and mental state inference in a video labelled as undecided from the DVD [5]: (top) selected frames from the video sampled every 1 second; (middle) head

Secondly, the key frame and several visual features (soil and grass color percentage, object number, motion vector, skin detection, player’s location) for each shot are extracted and

Zhang, “A flexible new technique for camera calibration,” IEEE Tran- scations on Pattern Analysis and Machine Intelligence,