In this paper, we present a video text binarization algorithm using connected component (CC) level filtering. We develop a three-step algorithm: estimate the polarity of text, extract text/non-text CCs with a retinex filtering method, and filter out non-text components. The proposed non-text CCs filtering is considering CCs stroke width and contrast. We conducted experiments on subtitles in TV programs and movies. Experimental results showed that the method outperforms conventional methods in terms of a pixel-level accuracy and a character/word recognition rate.
This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (NRF-2017R1A1A1A05001063).? This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT & Future Planning (NRF-2017R1A1A1A05001063).