Skip to content
View jianjieluo's full-sized avatar
😇
Busssssssssssssssssssssy with graduation matters
😇
Busssssssssssssssssssssy with graduation matters

Block or report jianjieluo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. YehLi/xmodaler YehLi/xmodaler Public

    X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsens…

    Python 1k 111

  2. SCD-Net SCD-Net Public

    [CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion model with additional semantic prior.

    Python 57 5

  3. OpenAI-CLIP-Feature OpenAI-CLIP-Feature Public

    An easy to use, user-friendly and efficient code for extracting OpenAI CLIP (Global/Grid) features from image and text respectively.

    Python 112 6

  4. PCM-Net PCM-Net Public

    [ECCV24] Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning

    Python 3 2

  5. SynthImgCap SynthImgCap Public

    Official webpage for the paper: Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning [ECCV24]

    JavaScript

  6. MineCube MineCube Public

    A Cool Voxel Editor Based on OpenGL 3.3+ !

    C 30 5