SpeX is a groundbreaking AI-powered smart glasses solution designed to empower visually impaired individuals with the ability to navigate the world independently. With the help of cutting-edge AI technologies, SpeX acts as the "eyes" for its users, offering real-time visual assistance, auditory feedback, geographical tracking, distance measurement, and proactive alerts. πβ¨
Visually impaired individuals often face difficulties in navigating their surroundings, engaging in social interactions, and accessing written content. Traditional tools fall short of providing real-time visual assistance, geographical tracking, and obstacle detection, which limits their independence. SpeX tackles these challenges by integrating AI-driven object detection, face recognition, text-to-speech conversion, real-time tracking, distance measurement, and alert systems into a wearable smart glasses solution. πΆββοΈπ¬
SpeX aims to offer visually impaired users a wearable device that provides:
- Real-time object detection π
- Face recognition for better social interaction π₯
- Text recognition and reading π
- Real-time geographical tracking π
- Distance measurement for obstacle detection π
- Real-time alerts for safety
β οΈ - Voice-based interaction for a seamless experience π€
This innovative solution promotes independence, safety, and social integration. π€
- Object Detection (YOLO): Detects and locates objects in real-time to help users navigate and avoid obstacles π§±π§.
- Face Recognition (OpenCV): Recognizes faces for better social interaction and personalized experience π€.
- Text Recognition (Tesseract OCR): Converts printed text (e.g., signs, labels, books) into spoken words πποΈ.
- Real-Time Geographical Tracking: Provides location data and tracking to enhance situational awareness πΊοΈ.
- Distance Measurement: Measures distance to nearby objects and obstacles to prevent collisions π
β οΈ . - Real-Time Alerts: Alerts users to sudden environmental changes or potential dangers, ensuring safety π¨.
- Voice Interaction: Hands-free voice commands for easy and efficient interaction π€π£οΈ.
- Webcam: Captures real-time visual input π·
- Microphone: Records voice commands for interaction π€
- Speaker/Headphones: Outputs auditory feedback and alerts π
- Raspberry Pi 0: The heart of the system, providing cost-effective power β‘
- Bluetooth Earphones: For portable, hands-free audio π§
- Numpy: Handles numerical computations and data manipulation π’
- OpenCV: Real-time image processing for face and object recognition πΈ
- Mediapipe: Provides ML-based gesture and face tracking π€³
- Pytesseract: Extracts text from images for real-time reading π
- Transformers: Pre-trained models for NLP tasks like text summarization π
- Torch: Deep learning framework for AI model training and deployment π€
- Ultralytics (YOLO): Real-time object detection for accurate identification π¦Έ
- Face_recognition: Detects and recognizes faces with deep learning π§
- Speechrecognition: Converts speech to text for voice command processing π£οΈ
- Pytz: Handles time zones for real-time assistance β°
- Geopy: Provides geolocation for tracking and navigation π
- Pyttsx3: Converts text to speech for delivering feedback and alerts π§
-
Make sure you have Python 3.11 installed for optimal performance π.
-
Clone the repository:
git clone https://github.com/harshendram/speXweb.git
-
Navigate to the project directory:
cd speXweb
-
Install required dependencies:
pip install -r requirements.txt
-
Run the application:
python speX.py
SpeX provides real-time assistance through voice commands and proactive alerts. Here's how you can interact with the system:
- "What is the distance to [object]?" β Measures the distance to the specified object or obstacle.
- "Identify objects around me." β Detects objects in the user's vicinity.
- "Who is that?" β Recognizes faces.
- "Read this text." β Converts visible text into speech.
- "Where am I?" β Provides real-time geographical location data.
- Proactive Alerts β Alerts users of sudden changes or hazards (e.g., "Obstacle detected 2 meters ahead!").
SpeX revolutionizes the lives of visually impaired individuals by offering:
- Autonomous Navigation: Avoid obstacles, measure distances, and identify objects independently πΆββοΈ
- Social Interaction: With face recognition, users can interact more easily with people they know π«
- Information Access: Read printed text, signs, and documents without assistance ποΈ
- Geographical Awareness: Real-time tracking and location-based awareness for safe navigation πΊοΈ
- Safety Alerts: Proactive notifications about potential hazards in the environment
β οΈ
SpeX is designed to evolve with the following possibilities:
- Language Expansion: Support for multiple languages to cater to a wider audience π
- Advanced Recognition: Enhanced object and face detection through improved AI models π€
- Assistive Technology Integration: Future integrations with other assistive devices and technologies π
With SpeX, the world is just a little bit brighter. π Letβs make the future accessible, one step at a time. π