Skip to content

CsabaConsulting/InspectorGadgetApp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

Your Generative AI Companion: Open Multi-Modal Personal Assistant

Notes:

  1. A demo video of the app: https://www.youtube.com/watch?v=kCtHH6XG5as
  2. This project was submitted to the Gemini API Developer Competition.
  3. The cloud functions needed for the backend are in the functions subfolder of the new repository.
  4. The judging is now over and this repository got moved over to Open Multi-Modal Personal Assistant.
  5. The repository state at the time of the submission was the last commit on August 12th before I recorded the demo: https://github.com/Open-Multi-Modal-Personal-Assistant/OpenMMPA/tree/b7de11fa3d9122fd96b85e7c708802b371bb15c9
  6. After the Made By Google '24 someone may compare Open MMPA to Gemini Live, however we must notice that:
    • Open MMPA targets embedded form factors
    • Open MMPA also features a local Vector Database for history and personal RAG
    • Open MMPA may lack some multi modal capabilities and integrations, but those are all planned
    • Open MMPA is open source

Unleash the Power of Generative AI on Your Devices

Forget clunky AI pins! Your smartphone or smartwatch already has the potential to be your personal AI powerhouse. This open-source project harnesses the full might of generative AI, multi-modal capabilities, and RAG (Retrieval Augmented Generation) to turn your existing devices into intelligent companions.

Key Features:

  • Multi-Platform: Experience seamless AI assistance on iOS and Android, thanks to the versatility of Flutter.
  • Voice Mastery: Communicate effortlessly with native Android STT/TTS or unlock even more languages with Google Chirp's advanced speech recognition.
  • Personalized to You: Your AI remembers your preferences and past conversations, providing tailored recommendations and assistance through on-device RAG technology.
  • Versatile Modes:
    • Natural Voice Chat: Engage in dynamic conversations like you would with a friend.
    • Translation Mode: Break down language barriers with real-time translation powered by Chirp.
    • Multi-modal Exploration: Ask questions about anything you see through your camera, unlocking a new dimension of interaction.

Advanced Capabilities:

  • Leverage the Gemini 1.5 Flash & Pro Models: Experience cutting-edge AI language understanding and generation.
  • Tap into Powerful Tools:
    • Location-aware responses
    • Real-time weather forecasts
    • Sunrise and sunset information
    • Web research through Tavily (API key required)
    • Up-to-date fiat and crypto currency exchange rates

On the Horizon:

  • Web search via DuckDuckGo Assist
  • Business insights with Alpha Vantage (API key required)
  • Song lyric lookup

Future Possibilities:

  • Expand your AI's toolkit with SerpAPI integration
  • Calendar integration
  • Email integration
  • SMS / Text message, call, and Contacts integration

Join the AI Revolution:

Transform your mobile device into a personalized AI assistant. Contribute to this open-source project and shape the future of AI on your terms!

Remember: API keys for Tavily and Alpha Vantage are required to access their respective features.

Let's build a smarter, more connected future together!

About

Open Multi-Modal Personal Assistant

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published