

Open-Vocabulary Semantic Segmentation with Mask-adapted CLIPĮfficient Semantic Segmentation by Altering Resolutions for Compressed Videos Out-of-Candidate Rectification for Weakly Supervised Semantic Segmentation Token Contrast for Weakly-Supervised Semantic Segmentationĭelivering Arbitrary-Modal Semantic Segmentation Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models Interactive Segmentation as Gaussian Process Classification MP-Former: Mask-Piloted Transformer for Image Segmentation Multimodal Industrial Anomaly Detection via Hybrid Fusion Lossy Compression for Robust Unsupervised Time-Series Anomaly Detection Iterative Next Boundary Detection for Instance Segmentation of Tree Rings in Microscopy Images of Shrub Cross Sectionsĭiversity-Measurable Anomaly Detectionīlock Selection Method for Using Feature Norm in Out-of-distribution Detection Texture-guided Saliency Distilling for Unsupervised Salient Object DetectionīEV-LaneDet: a Simple and Effective 3D Lane Detection Baseline LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross-Modal FusionĬonQueR: Query Contrast Voxel-DETR for 3D Object Detection(3D 目标检测的Query Contrast Voxel-DETR)ĭetecting Human-Object Contact in Images Uni3D: A Unified Baseline for Multi-dataset 3D Object Detection MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection Virtual Sparse Convolution for Multimodal 3D Object Detection X3KD: Knowledge Distillation Across Modalities, Tasks and Stages for Multi-Camera 3D Object Detection Towards Domain Generalization for Multi-view 3D Object Detection in Bird-Eye-View SCOTCH and SODA: A Transformer Video Shadow Detection Framework YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors(YOLOv7) 联邦学习(Federated Learning (#automatic driving) 其他ĬapDet: Unifying Dense Captioning and Open-World Detection PretrainingĮnhanced Training of Query-Based Object Detection via Selective Query Recollection 迁移学习/domain/自适应(Transfer Learning/Domain Adaptation) 37. 持续学习(Continual Learning/Life-long Learning) 36. 模型训练/泛化(Model Training/Generalization)Ģ7. 视觉表征学习(Visual Representation Learning) 25. 图像特征提取与匹配(Image feature extraction and matching) 24. 神经网络可解释性(Neural Network Interpretability) 21. 神经网络结构设计(Neural Network Structure Design)Ģ0. 视觉定位/位姿估计(Visual Localization/Pose Estimation) 17. 图像生成/图像合成(Image Generation/Image Synthesis) 15. 文本检测/识别/理解(Text Detection/Recognition/Understanding) 12.

人脸生成/合成/重建/编辑(Face Generation/Face Synthesis/Face Reconstruction/Face Editing).图像&视频检索/(Image&Video Retrieval/Video Understanding) 视频生成/视频合成(Video Generation/Video Synthesis).图像去阴影/去反射(Image Shadow Removal/Image Reflection Removal).
