basicv8vc

Follow

Jia basicv8vc

Follow

I like to fine-tune Deep Neural Nets on small datasets.

93 followers · 123 following

Achievements

Achievements

Organizations

Lists (17)

Sort

Alignment

32 repositories

DSP gang

LLM && Agents

31 repositories

LLM inference

LLM internal

How do you think?

LLM PC

14 repositories

LLM pretraining

18 repositories

llm reasoning

LLM tuning

16 repositories

LLM4Sci

side projects --> startup

look-a-look

13 repositories

mamba(ssm)

multimodal

11 repositories

non-Trans LLMs

非Transformer结构的LLMs

Triton && MLX && JAX

tts

workflow

Stars

etched-ai / open-oasis

Inference script for Oasis 500M

Python 1,693 146 Updated Nov 8, 2024

edwko / OuteTTS

Interface for OuteTTS models.

Python 807 65 Updated Jan 8, 2025

bklieger-groq / g1

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,141 376 Updated Dec 6, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,039 615 Updated Jan 2, 2025

xyflow / xyflow

React Flow | Svelte Flow - Powerful open source libraries for building node-based UIs with React (https://reactflow.dev) or Svelte (https://svelteflow.dev). Ready out-of-the-box and infinitely cust…

TypeScript 26,843 1,737 Updated Jan 10, 2025

danijar / elements

Building blocks for productive research

Python 47 6 Updated Jan 11, 2025

zenbase-ai / core

Prompt engineering, automated.

Jupyter Notebook 260 21 Updated Nov 22, 2024

ax-llm / ax

The unofficial DSPy framework. Build LLM powered Agents and "Agentic workflows" based on the Stanford DSP paper.

TypeScript 1,236 89 Updated Jan 10, 2025

roboflow / sports

computer vision and sports

Python 2,655 304 Updated Aug 19, 2024

goombalab / hydra

Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"

Python 118 9 Updated Aug 3, 2024

OpenBMB / CPO

Python 18 1 Updated Jul 16, 2024

OpenBMB / VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列

Python 1,103 91 Updated Jun 13, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 13,052 913 Updated Oct 22, 2024

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,261 464 Updated Nov 6, 2024

microsoft / LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,791 285 Updated Jan 11, 2025

google-deepmind / opro

official code for "Large Language Models as Optimizers"

Python 486 53 Updated Dec 4, 2024

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,820 123 Updated Oct 30, 2024

facebookresearch / chameleon

Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.

Python 1,903 112 Updated Jul 29, 2024

ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models

This repository collects all relevant resources about interpretability in LLMs

304 19 Updated Nov 1, 2024

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 21,564 2,125 Updated Jan 11, 2025

TransformerLensOrg / TransformerLens

A library for mechanistic interpretability of GPT-style language models

Python 1,733 316 Updated Jan 10, 2025

neelnanda-io / 1L-Sparse-Autoencoder

Python 114 12 Updated Oct 28, 2023

EleutherAI / sae

Sparse autoencoders

Python 402 53 Updated Dec 18, 2024

jbloomAus / SAELens

Training Sparse Autoencoders on Language Models

Jupyter Notebook 564 134 Updated Jan 12, 2025

ai-safety-foundation / sparse_autoencoder

Sparse Autoencoder for Mechanistic Interpretability

Python 207 40 Updated Jul 20, 2024

openai / sparse_autoencoder

Python 400 40 Updated Jul 19, 2024

ApolloResearch / e2e_sae

Sparse Autoencoder Training Library

Python 37 8 Updated Oct 29, 2024

zou-group / textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 1,982 170 Updated Dec 15, 2024

zyushun / Adam-mini

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 375 14 Updated Dec 5, 2024

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 3,454 348 Updated Jan 6, 2025