Skip to content

Popular repositories Loading

  1. Tune-A-Video Tune-A-Video Public

    [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

    Python 4.3k 389

  2. Awesome-Video-Diffusion Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

    3.9k 219

  3. computer_use_ootb computer_use_ootb Public

    Out-of-the-box (OOTB) GUI Agent for Windows and macOS

    Python 1.2k 108

  4. Show-o Show-o Public

    [ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 1.1k 49

  5. Show-1 Show-1 Public

    [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

    Python 952 61

  6. ShowUI ShowUI Public

    Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

    Jupyter Notebook 868 49

Repositories

Showing 10 of 76 repositories
  • ShowUI Public

    Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

    showlab/ShowUI’s past year of commit activity
    Jupyter Notebook 868 Apache-2.0 49 1 0 Updated Jan 24, 2025
  • Awesome-Robotics-Diffusion Public

    (In progress) A curated list of recent robot learning papers incorporating diffusion models for robotics tasks.

    showlab/Awesome-Robotics-Diffusion’s past year of commit activity
    22 1 0 0 Updated Jan 24, 2025
  • Show-o Public

    [ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

    showlab/Show-o’s past year of commit activity
    Python 1,138 Apache-2.0 49 35 1 Updated Jan 23, 2025
  • Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

    showlab/Awesome-Video-Diffusion’s past year of commit activity
    3,851 219 1 0 Updated Jan 23, 2025
  • computer_use_ootb Public

    Out-of-the-box (OOTB) GUI Agent for Windows and macOS

    showlab/computer_use_ootb’s past year of commit activity
    Python 1,190 Apache-2.0 108 30 5 Updated Jan 21, 2025
  • Awesome-Unified-Multimodal-Models Public

    📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

    showlab/Awesome-Unified-Multimodal-Models’s past year of commit activity
    341 15 0 0 Updated Jan 18, 2025
  • MovieSeq Public

    [ECCV2024] Learning Video Context as Interleaved Multimodal Sequences

    showlab/MovieSeq’s past year of commit activity
    Jupyter Notebook 34 1 0 0 Updated Jan 18, 2025
  • Awesome-GUI-Agent Public

    💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

    showlab/Awesome-GUI-Agent’s past year of commit activity
    444 29 0 0 Updated Jan 17, 2025
  • FQGAN Public

    FQGAN: Factorized Visual Tokenization and Generation

    showlab/FQGAN’s past year of commit activity
    Python 40 0 0 0 Updated Jan 5, 2025
  • Tune-An-Ellipse Public

    [CVPR 2024] Tune-An-Ellipse: CLIP Has Potential to Find What You Want

    showlab/Tune-An-Ellipse’s past year of commit activity
    Python 9 1 2 0 Updated Jan 5, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.