Skip to content
View zsychina's full-sized avatar
🐭
鼠鼠我啊,又要寄了
🐭
鼠鼠我啊,又要寄了
  • Sun Yat-sen University
  • Guangzhou, China
  • 23:40 (UTC +08:00)

Block or report zsychina

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zsychina/README.md

Hi there 👋

  • 🔭 Education Experience: Undergraduate - DLUT AUTOMATION 20FALL, Master - SYSU CS 24FALL
  • 🌱 Research Focus: Reinforcement learning, especially from human feedback
  • 👯 Tech Stack:PyTorch

Looking forward to making friends or cooperating with you!

Pinned Loading

  1. PrefTransPPO PrefTransPPO Public

    Using preference transformer to learning a reward function from dataset, then train an agent with PPO

    Python

  2. ppo-vanilla ppo-vanilla Public

    ppo minimum implementation

    Python

  3. ppo-continuous ppo-continuous Public

    ppo continuous

    Python

  4. sysu-select-course-script sysu-select-course-script Public

    中山大学研究生选课脚本

    Python

  5. GA-PID-Optimize GA-PID-Optimize Public

    遗传算法整定PID参数,大连理工大学'23《现代智能优化算法》X《计算机控制技术课程设计》

    Python 1

  6. ppo-transformer ppo-transformer Public

    GPT-2 structure transformer for sequential decision making in gym environment

    Python