Skip to content
Change the repository type filter

All

    Repositories list

    • CLIPS

      Public
      An Enhanced CLIP Framework for Learning with Synthetic Captions
      Python
      MIT License
      12200Updated Dec 17, 2024Dec 17, 2024
    • HTML
      0000Updated Dec 4, 2024Dec 4, 2024
    • This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
      Python
      1723370Updated Nov 20, 2024Nov 20, 2024
    • Python
      MIT License
      01310Updated Oct 14, 2024Oct 14, 2024
    • Python
      Other
      14000Updated Sep 25, 2024Sep 25, 2024
    • JavaScript
      0100Updated Sep 24, 2024Sep 24, 2024
    • Python
      0700Updated Sep 4, 2024Sep 4, 2024
    • This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
      112590Updated Jun 13, 2024Jun 13, 2024
    • AQA-Bench

      Public
      Algorithmic-Q&A-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability
      Python
      MIT License
      0400Updated Jun 13, 2024Jun 13, 2024
    • CLIPA

      Public
      [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
      Python
      Apache License 2.0
      1330500Updated Jun 3, 2024Jun 3, 2024
    • This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
      Python
      14510Updated Jun 3, 2024Jun 3, 2024
    • [CVPR 2024] This repository includes the official implementation our paper "MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections"
      Python
      03720Updated May 13, 2024May 13, 2024
    • FedConv

      Public
      [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning"
      Python
      MIT License
      02500Updated Apr 30, 2024Apr 30, 2024
    • EVP

      Public
      [TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"
      Python
      MIT License
      43700Updated Apr 30, 2024Apr 30, 2024
    • AdvXL

      Public
      [CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"
      Python
      11740Updated Apr 21, 2024Apr 21, 2024
    • MixCon3D

      Public
      [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"
      Python
      33130Updated Apr 21, 2024Apr 21, 2024
    • HQ-Edit

      Public
      HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
      Python
      Other
      37830Updated Apr 18, 2024Apr 18, 2024
    • This repository includes the official implementation and dataset of our paper "Compress & Align: Curating Image-Text Data with Human Knowledge".
      0210Updated Mar 22, 2024Mar 22, 2024
    • [ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
      Python
      37500Updated Nov 28, 2023Nov 28, 2023
    • SwinMM

      Public
      [MICCAI 2023] This repository includes the official implementation our paper "SwinMM: Masked Multi-view with Swin Transformers for 3D Medical Image Segmentation"
      Python
      610300Updated Oct 13, 2023Oct 13, 2023
    • [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"
      Python
      Apache License 2.0
      11900Updated Sep 15, 2023Sep 15, 2023
    • DMAE

      Public
      [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"
      Python
      Other
      510330Updated Jul 24, 2023Jul 24, 2023
    • RobustCNN

      Public
      [ICLR 2023] This repository includes the official implementation our paper "Can CNNs Be More Robust Than Transformers?"
      Python
      MIT License
      1314300Updated Jan 23, 2023Jan 23, 2023
    • [ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recognition".
      Python
      MIT License
      01910Updated Dec 22, 2022Dec 22, 2022
    • vit_cert

      Public
      [ECCV 2022] This repository includes the official implementation our paper "ViP: Unified Certified Detection and Recovery for Patch Attack with Vision Transformers"
      Python
      0300Updated Jul 21, 2022Jul 21, 2022