Skip to content
Change the repository type filter

All

    Repositories list

    • STTN

      Public
      [ECCV'2020] STTN: Learning Joint Spatial-Temporal Transformations for Video Inpainting
      Jupyter Notebook
      MIT License
      83548111Updated Jun 18, 2025Jun 18, 2025
    • FTVSR

      Public
      [ECCV'22] FTVSR: Learning Spatiotemporal Frequency-Transformer for Compressed Video Super-Resolution
      Python
      MIT License
      13175170Updated Oct 22, 2024Oct 22, 2024
    • VQD-SR

      Public
      [ICCV'23] VQD-SR: Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution
      Python
      44950Updated Jun 19, 2024Jun 19, 2024
    • [CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
      Python
      MIT License
      25453170Updated Jun 5, 2024Jun 5, 2024
    • [TVCG'2023] AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)
      Python
      Apache License 2.0
      79526161Updated May 8, 2024May 8, 2024
    • Stark

      Public
      [ICCV'21] Learning Spatio-Temporal Transformer for Visual Tracking
      Python
      MIT License
      151708710Updated Apr 13, 2024Apr 13, 2024
    • TracKit

      Public
      [ECCV'20] Ocean: Object-aware Anchor-Free Tracking
      Python
      MIT License
      96618281Updated Aug 7, 2023Aug 7, 2023
    • JavaScript
      01200Updated Jun 17, 2023Jun 17, 2023
    • [TMM 2023] Language-Guided Face Animation by Recurrent StyleGAN-based Generator
      Python
      MIT License
      01110Updated Apr 23, 2023Apr 23, 2023
    • [MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation
      Python
      MIT License
      21100Updated Apr 3, 2023Apr 3, 2023
    • STTR

      Public
      [ACCV'22] Fine-Grained Image Style Transfer with Visual Transformers
      Python
      MIT License
      61900Updated Dec 6, 2022Dec 6, 2022
    • soho

      Public
      [CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
      Python
      2020990Updated Sep 30, 2022Sep 30, 2022
    • TTSR

      Public
      [CVPR'20] TTSR: Learning Texture Transformer Network for Image Super-Resolution
      Python
      MIT License
      11779030Updated Jul 24, 2022Jul 24, 2022
    • TTVSR

      Public
      [CVPR'22 Oral] TTVSR: Learning Trajectory-Aware Transformer for Video Super-Resolution
      Python
      MIT License
      1322390Updated Jul 24, 2022Jul 24, 2022
    • CKDN

      Public
      [ICCV'21] CKDN: Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment
      Python
      MIT License
      55960Updated Apr 9, 2022Apr 9, 2022
    • CyDAS

      Public
      Cyclic Differentiable Architecture Search
      Python
      MIT License
      63610Updated Feb 14, 2022Feb 14, 2022
    • [CVPR21] LightTrack: Finding Lightweight Neural Network for Object Tracking via One-Shot Architecture Search
      Python
      MIT License
      68459240Updated Dec 29, 2021Dec 29, 2021
    • tasn

      Public
      Trilinear Attention Sampling Network for Fine-grained Image Recognition
      Python
      39219166Updated Dec 14, 2021Dec 14, 2021
    • [CVPR'2019] PEN-Net: Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting
      Python
      MIT License
      76361231Updated Nov 29, 2021Nov 29, 2021
    • A collection of models for image<->text generation in ACM MM 2021.
      Python
      MIT License
      86720Updated Oct 31, 2021Oct 31, 2021
    • [MM'20] Aesthetic-Aware Image Style Transfer
      Python
      31520Updated Sep 16, 2021Sep 16, 2021
    • img2poem

      Public
      [MM'18] Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training
      Python
      6028130Updated Aug 23, 2021Aug 23, 2021
    • AutoML

      Public
      AutoFormer, Cream
      Python
      MIT License
      242100Updated Jul 4, 2021Jul 4, 2021
    • SiamDW

      Public
      [CVPR'19 Oral] Deeper and Wider Siamese Networks for Real-Time Visual Tracking
      Python
      MIT License
      177761201Updated May 18, 2021May 18, 2021
    • SariGAN

      Public
      [NeurIPS'20] Learning Semantic-aware Normalization for Generative Adversarial Networks
      Python
      25310Updated May 14, 2021May 14, 2021
    • NEAS

      Public
      Python
      41910Updated May 11, 2021May 11, 2021
    • WSOD2

      Public
      [ICCV'19] WSOD^2: Learning Bottom-up and Top-down Objectness Distillation for Weakly-supervised Object Detection
      Python
      MIT License
      45140Updated Jan 26, 2021Jan 26, 2021
    • [AAAI‘20] - Learning 2D Temporal Localization Networks for Moment Localization with Natural Language
      Python
      Other
      165001Updated Feb 16, 2020Feb 16, 2020
    • DBTNet

      Public
      Code for our NeurIPS'19 paper "Learning Deep Bilinear Transformation for Fine-grained Image Representation"
      Python
      1910560Updated Jan 20, 2020Jan 20, 2020
    • 2D-TAN

      Public
      AAAI2020 - Learning 2D Temporal Localization Networks for Moment Localization with Natural Language
      Python
      31810Updated Dec 10, 2019Dec 10, 2019
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.