Skip to content
View pipixin321's full-sized avatar
🎯
Focusing
🎯
Focusing
  • HUST(Huazhong University of Science and Technology)
  • Wuhan
  • 20:21 (UTC +08:00)

Block or report pipixin321

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
pipixin321/README.md

Hi there πŸ‘‹, I'm Huaxin Zhang

I am a Master of HUST (Huazhong University of Science and Technology), supervised by Nong Sang.

πŸ”­ Reseach-wise, I mainly focus on:

  • Multi-modal Large Language Models
  • Video Understanding, more specifically, Weakly-supervised Temporal Action Localization (WSTAL) & Weakly-suervised Video Anomaly Detection (WSVAD).

πŸ˜„ I am open to:

  • A internship/job/PhD offer with computer vision/multimodal LLM research and engineering.

πŸ“« Contact me by:

πŸ’¬ News:

  • 2024-07-01: We release our code and model of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM".[project page]
  • 2024-06-10: We release our code and model of "Arcana: Improving Multi-modal Large Language Model through Boosting Vision Capabilities".[project page]
  • 2024-01-29: I start my internship in Baidu VIS, to do some research on Multi-modal Large Language Model (MLLM).
  • 2023-12-09: One paper about point supervised temporal action localization is accepted on AAAI 2024.

Huaxin's github stats

Pinned Loading

  1. HR-Pro HR-Pro Public

    Official implementation of HR-Pro(AAAI2024)

    Python 26 1

  2. HolmesVAD HolmesVAD Public

    Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"

    Python 77 3

  3. Arcana Arcana Public

    Forked from syp2ysy/Arcana

    Implementation of "Arcana: Improving Multi-modal Large Language Model through Boosting Vision Capabilitie"

    Python

  4. GlanceVAD GlanceVAD Public

    Official implementation of GlanceVAD

    17

  5. IROS22-FMG-Sensor-Optimization IROS22-FMG-Sensor-Optimization Public

    Forked from JerryX1110/IROS22-FMG-Sensor-Optimization

    [IROS22 Oral] Optimization of Forcemyography Sensor Placement for Arm Movement Recognition https://arxiv.org/abs/2207.10915

    Python

  6. Two-Branch-Network-For-SMPL-based-Action-Recognition Two-Branch-Network-For-SMPL-based-Action-Recognition Public

    SMPL Action Recognition Implementation

    Python 1