A spatial-temporal approach for video caption detection and recognition
Publication in refereed journal


引用次數
替代計量分析
.

其它資訊
摘要We present a video caption detection and recognition system based on a fuzzy-clustering neural network (FCNN) classier. Using a novel caption-transition detection scheme we locate both spatial and temporal positions of video captions with high precision and efficiency. Then employing several new character segmentation and binarization techniques, we improve the Chinese video-caption recognition accuracy from 13% to 86% on a set of news video captions. As the first attempt on Chinese video-caption recognition, our experiment results are very encouraging.
著者Tang X, Gao XB, Liu JZ, Zhang HJ
期刊名稱IEEE Transactions on Neural Networks
出版年份2002
月份7
日期1
卷號13
期次4
出版社IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
頁次961 - 971
國際標準期刊號1045-9227
電子國際標準期刊號1941-0093
語言英式英語
關鍵詞Chinese caption detection; fuzzy clustering neural networks (FCNNs); video indexing; video OCR; video shot segmentation
Web of Science 學科類別Computer Science; Computer Science, Artificial Intelligence; Computer Science, Hardware & Architecture; Computer Science, Theory & Methods; Engineering; Engineering, Electrical & Electronic

上次更新時間 2020-20-09 於 03:21