LLM
TigerBot: A multi-language multi-task LLM
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
✨✨Latest Advances on Multimodal Large Language Models
Meta-Transformer for Unified Multimodal Learning
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
LAVIS - A One-stop Library for Language-Vision Intelligence
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
The official GitHub page for the survey paper "A Survey of Large Language Models".
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
ImageBind One Embedding Space to Bind Them All
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet and so on. 文本生成模型,实现了包括LLaMA,ChatGLM,BLOOM,GPT2,Seq2Seq,BART,T5,UDA等模型的训练和预测,开箱即用。
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Run any open-source LLMs, such as Llama, Mistral, as OpenAI compatible API endpoint in the cloud.
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…
Let your Claude able to think
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…
Train transformer language models with reinforcement learning.