Go to Header
Go to Navigation
Go to Content
Go to Footer
Login
Help
中文(繁體)
Research Resources
CUHK UPDates
Theses
Experts List
User Guides
About Us
List of Researchers
> Associate Professor Hongsheng LI
Home
Research Outputs
Researcher Profiles
Department Profiles
Research Areas
Professor LI Hongsheng
Personal Information
Position and Department
Associate Professor
,
Department of Electronic Engineering
ORCiD
0000-0002-2664-7975
CUHK Research Outputs
1 of 4
3D Object Detection for Autonomous Driving: A Comprehensive Survey
(
2023
)
Adaptive Zone-Aware Hierarchical Planner for Vision-Language Navigation
(
2023
)
A Simple Baseline for Video Restoration With Grouped Spatial-Temporal Shift
(
2023
)
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection
(
2023
)
CORA: Adapting CLIP for Open-Vocabulary Detection With Region Prompting and Anchor Pre-Matching
(
2023
)
FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation
(
2023
)
Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels
(
2023
)
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
(
2023
)
Learning 3D Representations From 2D Pre-Trained Models via Image-to-Point Masked Autoencoders
(
2023
)
Mining multi-center heterogeneous medical data with distributed synthetic learning
(
2023
)
MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers
(
2023
)
PATS: Patch Area Transportation with Subdivisionfor Local Feature Matching
(
2023
)
Prompt, Generate, Then Cache: Cascade of Foundation Models Makes Strong Few-Shot Learners
(
2023
)
PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection
(
2023
)
ReasonNet: End-to-End Driving With Temporal and Global Reasoning
(
2023
)
Starting from Non-Parametric Networks for 3D Point Cloud Analysis
(
2023
)
Teach-DETR: Better Training DETR With Teachers
(
2023
)
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
(
2023
)
AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks
(
2022
)
Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields
(
2022
)
EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers
(
2022
)
Efficient Burst Raw Denoising with Variance Stabilization and Multi-frequency Denoising Network
(
2022
)
FlowFormer: A Transformer Architecture for Optical Flow
(
2022
)
Frozen CLIP Models are Efficient Video Learners
(
2022
)
IDR: Self-Supervised Image Denoising via Iterative Data Refinement
(
2022
)
Learning a Structured Latent Space for Unsupervised Point Cloud Completion
(
2022
)
Learning Degradation Representations for Image Deblurring
(
2022
)
MCMAE: Masked Convolution Meets Masked Autoencoders
(
2022
)
MPPNet: Multi-Frame Feature Intertwining with Proxy Points for 3D Temporal Object Detection
(
2022
)
PointCLIP: Point Cloud Understanding by CLIP
(
2022
)
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training
(
2022
)
RBGNet: Ray-based Grouping for 3D Object Detection
(
2022
)
RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization
(
2022
)
ST-Adapter: Parameter-Efficient Image-to-Video Transfer Learning
(
2022
)
Structured Domain Adaptation With Online Relation Regularization for Unsupervised Person Re-ID
(
2022
)
SymReg-GAN: Symmetric Image Registration with Generative Adversarial Networks
(
2022
)
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification
(
2022
)
TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers
(
2022
)
Towards Robust Face Recognition with Comprehensive Search
(
2022
)
UniFormer: Unified Transformer for Efficient Spatiotemporal Representation Learning
(
2022
)
UniNet: Unified Architecture Search with Convolution, Transformer, and MLP
(
2022
)
Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks
(
2022
)
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation
(
2022
)
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
(
2021
)
A Holistically-Guided Decoder for Deep Representation Learning with Applications to Semantic Segmentation and Object Detection
(
2021
)
Container: Context Aggregation Network
(
2021
)
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation
(
2021
)
DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network
(
2021
)
DominoSearch: Find layer-wise fine-grained N:M sparse schemes from dense neural networks
(
2021
)
Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation
(
2021
)
Most Relevant Area
Artificial intelligence
(
Computer science
)
Related Area
Digital signal processing (including speech and image processing)
(
Electronic engineering
)
Share Link
Last updated on 2024-03-09 at 16:09
Share Link
Your name*
Your email*
Recipient's name*
Recipient's email*
Message
Auxilliary Information
No information will be stored or shared with any third party.
Cancel
Our policy towards the use of cookies
All Clarivate Analytics websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.
Ok to Continue
Cookie Policy