Skip to content

Popular repositories Loading

  1. Awesome-Video-Diffusion Awesome-Video-Diffusion Public

    A curated list of recent diffusion models for video generation, editing, and various other applications.

    5.1k 310

  2. Tune-A-Video Tune-A-Video Public

    [ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

    Python 4.4k 396

  3. Show-o Show-o Public

    [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

    Python 1.7k 72

  4. computer_use_ootb computer_use_ootb Public

    Out-of-the-box (OOTB) GUI Agent for Windows and macOS

    Python 1.7k 166

  5. ShowUI ShowUI Public

    [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

    Python 1.5k 103

  6. Show-1 Show-1 Public

    [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

    Python 1.1k 57

Repositories

Showing 10 of 101 repositories
  • Code2Video Public

    A new paradigm for video generation via coding

    showlab/Code2Video’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Sep 29, 2025
  • TrustScorer Public

    ACM MM 2025 Can I Trust You? Advancing GUI Task Automation with Action Trust Score

    showlab/TrustScorer’s past year of commit activity
    5 MIT 0 0 0 Updated Sep 28, 2025
  • Awesome-Unified-Multimodal-Models Public

    đź“– This is a repository for organizing papers, codes and other resources related to unified multimodal models.

    showlab/Awesome-Unified-Multimodal-Models’s past year of commit activity
    696 36 5 0 Updated Sep 27, 2025
  • Awesome-MLLM-Hallucination Public

    đź“– A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

    showlab/Awesome-MLLM-Hallucination’s past year of commit activity
    856 37 1 0 Updated Sep 27, 2025
  • Show-o Public

    [ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

    showlab/Show-o’s past year of commit activity
    Python 1,715 Apache-2.0 72 55 2 Updated Sep 25, 2025
  • macosworld Public
    showlab/macosworld’s past year of commit activity
    Python 14 1 0 0 Updated Sep 22, 2025
  • Show-1 Public

    [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

    showlab/Show-1’s past year of commit activity
    Python 1,133 57 9 7 Updated Sep 13, 2025
  • livecc Public

    LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)

    showlab/livecc’s past year of commit activity
    Python 271 36 7 1 Updated Sep 9, 2025
  • DIM Public

    The official implementation of the paper "Draw-In-Mind: Learning Precise Image Editing via Chain-of-Thought Imagination"

    showlab/DIM’s past year of commit activity
    16 0 2 0 Updated Sep 4, 2025
  • videollm-online Public

    VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)

    showlab/videollm-online’s past year of commit activity
    Python 554 Apache-2.0 54 28 0 Updated Sep 2, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…