Lists (17)
Sort Name ascending (A-Z)
Alignment
DSP gang
LLM && Agents
LLM inference
LLM internal
How do you think?LLM PC
LLM pretraining
llm reasoning
LLM tuning
LLM4Sci
side projects --> startuplook-a-look
mamba(ssm)
multimodal
non-Trans LLMs
非Transformer结构的LLMsTriton && MLX && JAX
tts
text to speechworkflow
效率maxStars
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
React Flow | Svelte Flow - Powerful open source libraries for building node-based UIs with React (https://reactflow.dev) or Svelte (https://svelteflow.dev). Ready out-of-the-box and infinitely cust…
The unofficial DSPy framework. Build LLM powered Agents and "Agentic workflows" based on the Stanford DSP paper.
Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"
[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
General technology for enabling AI capabilities w/ LLMs and MLLMs
official code for "Large Language Models as Optimizers"
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
This repository collects all relevant resources about interpretability in LLMs
A modular graph-based Retrieval-Augmented Generation (RAG) system
A library for mechanistic interpretability of GPT-style language models
Training Sparse Autoencoders on Language Models
Sparse Autoencoder for Mechanistic Interpretability
TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793