Skip to content
@SHI-Labs

SHI Labs

Computer Vision, Machine Learning, and AI Systems & Applications

Pinned Loading

  1. Neighborhood-Attention-Transformer Neighborhood-Attention-Transformer Public

    Neighborhood Attention Transformer, arxiv 2022 / CVPR 2023. Dilated Neighborhood Attention Transformer, arxiv 2022

    Python 1.2k 90

  2. NATTEN NATTEN Public

    Fast Multi-dimensional Sparse Attention

    C++ 739 58

  3. Versatile-Diffusion Versatile-Diffusion Public

    Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023

    Python 1.3k 84

  4. Prompt-Free-Diffusion Prompt-Free-Diffusion Public

    Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models, arxiv 2023 / CVPR 2024

    Python 760 38

  5. OneFormer OneFormer Public

    [CVPR 2023] OneFormer: One Transformer to Rule Universal Image Segmentation

    Jupyter Notebook 1.7k 152

  6. Compact-Transformers Compact-Transformers Public

    Escaping the Big Data Paradigm with Compact Transformers, 2021 (Train your Vision Transformers in 30 mins on CIFAR-10 with a single GPU!)

    Python 542 86

Repositories

Showing 10 of 65 repositories
  • NATTEN Public

    Fast Multi-dimensional Sparse Attention

    SHI-Labs/NATTEN’s past year of commit activity
    C++ 739 MIT 58 16 6 Updated Apr 14, 2026
  • MapReduce-LoRA Public

    [CVPR'26 Highlight] MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models

    SHI-Labs/MapReduce-LoRA’s past year of commit activity
    Python 4 MIT 0 0 0 Updated Apr 10, 2026
  • physical-ai-bench Public

    [CVPR 2026 Oral] PAI-Bench: A Comprehensive Benchmark for Physical AI

    SHI-Labs/physical-ai-bench’s past year of commit activity
    Python 68 MIT 3 1 0 Updated Apr 9, 2026
  • Forget-Me-Not Public

    Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models, 2023

    SHI-Labs/Forget-Me-Not’s past year of commit activity
    Python 138 MIT 8 7 0 Updated Oct 22, 2025
  • VisPer-LM Public

    [NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation

    SHI-Labs/VisPer-LM’s past year of commit activity
    Python 72 1 2 0 Updated Oct 17, 2025
  • T2I-Copilot Public

    T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)

    SHI-Labs/T2I-Copilot’s past year of commit activity
    Jupyter Notebook 49 MIT 3 0 0 Updated Oct 6, 2025
  • SHI-Labs/shi-labs.github.io’s past year of commit activity
    CSS 0 0 0 0 Updated Oct 5, 2025
  • IMG-Multimodal-Diffusion-Alignment Public

    IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance, ICCV 2025

    SHI-Labs/IMG-Multimodal-Diffusion-Alignment’s past year of commit activity
    Python 30 3 1 0 Updated Oct 1, 2025
  • StyleNAT Public

    New flexible and efficient image generation framework that sets new SOTA on FFHQ-256 with FID 2.05, 2022

    SHI-Labs/StyleNAT’s past year of commit activity
    Python 102 MIT 13 0 0 Updated Jun 26, 2025
  • SHI-Labs/Slow-Fast-Video-Multimodal-LLM’s past year of commit activity
    Python 28 1 2 0 Updated Apr 8, 2025

Top languages

Loading…

Most used topics

Loading…