mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
-
Updated
Apr 2, 2025 - Python
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".
Official PyTorch implementation of Fully Attentional Networks
[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers
Improved Residual Networks (https://arxiv.org/pdf/2004.04989.pdf)
Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23
GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?
Deep Isometric Learning for Visual Recognition (ICML 2020)
Improving Generalization via Scalable Neighborhood Component Analysis
PyTorch reimplementation of the paper "Involution: Inverting the Inherence of Convolution for Visual Recognition" (2D and 3D Involution) [CVPR 2021].
[ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
This repository contains the ViewFool and ImageNet-V proposed by the paper “ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints” (NeurIPS2022).
Deep Understanding of Traffic Scenes for Autonomous Driving
[TMLR] "Adversarial Feature Augmentation and Normalization for Visual Recognition", Tianlong Chen, Yu Cheng, Zhe Gan, Jianfeng Wang, Lijuan Wang, Zhangyang Wang, Jingjing Liu
Implementation for <Orthogonal Over-Parameterized Training> in CVPR'21.
Official Implementation of "Fine-Tuning is Fine, if Calibrated.", NeurIPS 2024
[ICCV W] Contextual Convolutional Neural Networks (https://arxiv.org/pdf/2108.07387.pdf)
Implementation of the paper "SoftHGNN: Soft Hypergraph Neural Networks for General Visual Recognition".
Add a description, image, and links to the visual-recognition topic page so that developers can more easily learn about it.
To associate your repository with the visual-recognition topic, visit your repo's landing page and select "manage topics."