Object detection from video tubelets with Convolutional Neural Networks
Refereed conference paper presented and published in conference proceedings

Full Text

Times Cited

Other information
AbstractDeep Convolution Neural Networks (CNNs) have shown impressive performance in various vision tasks such as image classification, object detection and semantic segmentation. For object detection, particularly in still images, the performance has been significantly increased last year thanks to powerful deep networks (e.g. GoogleNet) and detection frameworks (e.g. Regions with CNN features (R-CNN)). The lately introduced ImageNet [6] task on object detection from video (VID) brings the object detection task into the video domain, in which objects' locations at each frame are required to be annotated with bounding boxes. In this work, we introduce a complete framework for the VID task based on still-image object detection and general object tracking. Their relations and contributions in the VID task are thoroughly studied and evaluated. In addition, a temporal convolution network is proposed to incorporate temporal information to regularize the detection results and shows its effectiveness for the task. Code is available at https://github.com/myfavouritekk/vdetlib.
All Author(s) ListKang K., Ouyang W., Li H., Wang X.
Name of Conference2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016
Start Date of Conference26/06/2016
End Date of Conference01/07/2016
Place of ConferenceLas Vegas
Country/Region of ConferenceUnited States of America
Detailed descriptionorganized by IEEE,
Volume Number2016-January
Pages817 - 825
LanguagesEnglish-United Kingdom

Last updated on 2020-02-09 at 01:25