Skip to content
Change the repository type filter

All

    Repositories list

    • Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
      Python
      MIT License
      24500Updated Dec 10, 2024Dec 10, 2024
    • flair

      Public
      01910Updated Dec 3, 2024Dec 3, 2024
    • cosmos

      Public
      0500Updated Dec 3, 2024Dec 3, 2024
    • [NeurIPS 2023 Spotlight] In-Context Impersonation Reveals Large Language Models' Strengths and Biases
      Python
      MIT License
      02220Updated Nov 30, 2024Nov 30, 2024
    • ZerAuCap

      Public
      [NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords
      Python
      11600Updated Nov 30, 2024Nov 30, 2024
    • [ICLR 2024] Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model
      Python
      MIT License
      0000Updated Oct 31, 2024Oct 31, 2024
    • ReNO

      Public
      [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
      Python
      MIT License
      911160Updated Oct 12, 2024Oct 12, 2024
    • Python
      01700Updated Oct 5, 2024Oct 5, 2024
    • EgoCVR

      Public
      [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval
      Python
      MIT License
      03110Updated Aug 27, 2024Aug 27, 2024
    • Official PyTorch implementation of "Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models" (ECCV 2024)
      Python
      MIT License
      2800Updated Aug 12, 2024Aug 12, 2024
    • DataDream

      Public
      [ECCV 2024] Official repository for "DataDream: Few-shot Guided Dataset Generation"
      Python
      32640Updated Jul 24, 2024Jul 24, 2024
    • This repository contains the code for our DAGM GCPR 2023 paper "Text-to-feature diffusion for audio-visual few-shot learning"
      Python
      MIT License
      1800Updated Jul 23, 2024Jul 23, 2024
    • [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"
      Python
      MIT License
      55310Updated Jul 4, 2024Jul 4, 2024
    • uot-fm

      Public
      Official Repository for "Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation" [ICLR 2024]
      Python
      MIT License
      41210Updated May 15, 2024May 15, 2024
    • Official repository for "Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model" [ICLR 2024 spotlight]
      MIT License
      0700Updated Feb 20, 2024Feb 20, 2024
    • ECCV 2022: Abstracting Sketches through Simple Primitives
      Python
      GNU General Public License v3.0
      52620Updated Jan 19, 2024Jan 19, 2024
    • ProbVLM

      Public
      ProbVLM: Probabilistic Adapter for Frozen Vision-Language Models
      Python
      MIT License
      23620Updated Dec 21, 2023Dec 21, 2023
    • Code for the paper "Addressing caveats of neural persistence with deep graph persistence".
      Python
      0400Updated Nov 29, 2023Nov 29, 2023
    • ReGaDa

      Public
      BMVC 2023: Video-adverb retrieval with compositional adverb-action embeddings
      Python
      GNU General Public License v3.0
      1610Updated Nov 17, 2023Nov 17, 2023
    • CLEVR-X

      Public
      CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
      Python
      BSD 3-Clause "New" or "Revised" License
      12600Updated Oct 27, 2023Oct 27, 2023
    • DeViL

      Public
      GCPR 2023 - DeViL: Decoding Vision features into Language
      Python
      GNU General Public License v3.0
      01100Updated Oct 16, 2023Oct 16, 2023
    • ISCO

      Public
      ICCV 2023: Iterative Superquadric Recomposition of 3D Objects from Multiple Views
      GNU General Public License v3.0
      01210Updated Oct 5, 2023Oct 5, 2023
    • KG-SP

      Public
      PyTorch code of our KG-SP method for Compositional Zero-Shot Learning
      Python
      GNU General Public License v3.0
      01100Updated Aug 10, 2023Aug 10, 2023
    • ZS-A2T

      Public
      [GCPR 2023] Zero-shot Translation of Attention Patterns in VQA Models to Natural Language
      0300Updated Jul 28, 2023Jul 28, 2023
    • Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"
      Python
      MIT License
      12700Updated Jul 10, 2023Jul 10, 2023
    • Official repository for the ICCV 2023 paper: "Waffling around for Performance: Visual Classification with Random Words and Broad Concepts"
      Python
      MIT License
      65610Updated Jul 8, 2023Jul 8, 2023
    • czsl

      Public
      PyTorch CZSL framework containing GQA, the open-world setting, and the CGE and CompCos methods.
      Python
      GNU General Public License v3.0
      2711270Updated Jun 22, 2023Jun 22, 2023
    • Official PyTorch implementation of CVPR 2023 MULA Workshop paper "Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval"
      0700Updated Apr 3, 2023Apr 3, 2023
    • This repository contains the code for our CVPR 2022 paper on "Non-isotropy Regularization for Proxy-based Deep Metric Learning".
      Python
      Apache License 2.0
      51410Updated Mar 10, 2023Mar 10, 2023
    • ACVC

      Public
      Official PyTorch implementation of CVPRW 2022 paper "Attention Consistency on Visual Corruptions for Single-Source Domain Generalization"
      Python
      MIT License
      02910Updated Feb 22, 2023Feb 22, 2023