- Bangalore
Lists (10)
Sort Name ascending (A-Z)
Stars
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Set of demo to try Isaac ROS with Isaac SIM
Framework for enhancing LLMs for RAG tasks using fine-tuning.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
The world's largest GitHub Repository for LLMs + Robotics
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
[CoRL 2023] This repository contains data generation and training code for Scaling Up & Distilling Down
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Generating Robotic Simulation Tasks via Large Language Models
ROSA 🤖 is an AI Agent designed to interact with ROS1- and ROS2-based robotics systems using natural language queries. ROSA helps robot developers inspect, diagnose, understand, and operate robots.
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
Action Chunking Transformer implementation for low cost robot
Pytorch implementation of Stable Vector Fields on Lie Groups through Diffeomorphism
Pytorch implementation of diffusion models on Lie Groups for 6D grasp pose generation https://sites.google.com/view/se3dif/home
TidyBot++: An Open-Source Holonomic Mobile Manipulator for Robot Learning
C++ Implementation of a Multibody Vehicle Dynamics Simulation