Project Archived

This repository has been archived and is no longer actively maintained here. The project has been split and is now maintained under the 1Mind Labs organization.

For the latest updates and continued development, please visit the respective repositories within the 1Mind Labs organization.

Project Description

An automated digital forensics and incident response system designed for anomaly detection and pattern recognition across system data and network activity. The tool integrates AI/ML models to classify system risk levels, identify indicators of compromise (IoCs), and generate actionable insights from forensic disk images, memory dumps, and network traffic.

Built for Smart India Hackathon - 2024

Key Features

Automated Forensic Data Collection: Automates FTK Imager, Volatility, RegRipper, and Sysinternals Suite through Python libraries (PyEWF, MemProcFS, Regipy, and PSUtil) for forensic images, memory dumps, registry hives, and background processes.
IoC Identification: Utilizes custom YARA rules and MISP databases for detecting file anomalies and correlating known IoCs.
Network Traffic Analysis: Leverages Wireshark and Scapy to analyze packet captures and identify suspicious network activities.
AI/ML Integration: Implements TensorFlow models for anomaly detection and risk classification, offering investigators prioritized analysis of critical artifacts.
Cross-Platform Dashboards: Provides real-time data visualization, interactive timelines, and detailed reports with export options in PDF, JSON, and CSV formats.
Scalable Architecture: Built with FastAPI, Next.js, and Flutter, ensuring high performance and easy deployment across environments.

Additionally, the tool supports live drive detection, allowing investigators to connect drives and perform real-time forensic analysis. It also features a chatbot that provides detailed explanations of detected anomalies, offering further insights.

Project Milestones

Completed:

Automate disk image processing
Analyze system logs using YARA rules
- Display system logs results on the dashboard
Analyze network traffic with Scapy
- Display network traffic results on the dashboard
Develop ML model for risk type and risk level categorization
Develop ML model for network traffic prediction
Generate reports in PDF, CSV, and JSON formats
Build interactive graphs
Detect external drives

In Progress:

Automate memory dumping
Train network traffic ML model to identify more attack patterns
Integrate MISP IoC database for improved threat identification
Analyze registry hives
- Display registry results on the dashboard
- Develop ML model for registry hive analysis
Analyze system running processes
- Display running processes results on the dashboard
- Develop ML model for running processes analysis
Integrate blockchain technology for secure and immutable audit trails

Other:

Support desktop app download
Implement responsive web design
Improve loading animations
Add chatbot functionality and background image

Bugs:

Add YARA analysis support for additional file types: .doc, .docx, .xls, .xlsx, .ppt, .pptx
Implement multithreading for faster file and folder scanning within disk images and memory dumps

Project Architecture

Log Files: Uses the yara-python library to automate YARA rule-based file classification, scanning log files for malicious patterns and indicators of compromise.
Network Captures: Utilizes the Scapy library to automate network traffic analysis, replacing manual tasks typically handled by Wireshark. It looks for specific packets such as HTTP Requests, DNS Queries, and IP Packets to identify anomalies in packet captures.
Registry Hives: Employs the Regipy library to automate the tasks of Regripper, extracting and analyzing Windows registry hives for forensic investigation of system activity.
Running Processes: Leverages PSUtil to automate the functionalities of Sysinternals Suite, monitoring and collecting data on running processes, system performance, and resource usage.
FastAPI: Acts as the core backend framework, handling data ingestion, analysis requests, and communication between the various forensic modules and front-end interfaces, ensuring efficient processing.
Google Gemini: Integrates with the system to analyze processed file data using a prompt, generating detailed summaries of findings through ReportLab, which are then exported in report formats (PDF, CSV, and JSON).
TensorFlow AI/ML Model: Trained to detect anomalies, classify risks, and recognize patterns within the ingested forensic data, supporting advanced automated analysis and decision-making.
Web App: Allows users to input various forensic artifacts such as regular files, folders, memory dumps, or disk images. It provides a real-time interface for investigators to interact with the analysis engine.
Desktop App: Provides cross-platform compatibility with enhanced security features compared to web browsers, enabling secure input and analysis of forensic data with better local system access.

ML Model Design

Network Traffic Classification Model:
- We utilized a pre-trained XGBoost classifier from scikit-learn to evaluate network traffic patterns.
- The model was optimized for multiclass classification using log loss as the evaluation metric.
- Model performance indicates no signs of overfitting, as the training and validation results are closely aligned. (Shown in Graph 1)
- Achieved 98% accuracy on both the training set and the new test dataset, ensuring robust generalization.

XGBoost Multiclass Log Loss stabilizes over boosting rounds for training and validation datasets (Graph 1)

Risk Level and Type Classification Model:
- Implemented an RNN model with tokenization and GloVe embedding (100-dimensional vector embeddings) for text data.
- The LSTM layer is used to capture temporal patterns in the data, enabling better classification of risk types.
- A dropout layer was incorporated to prevent overfitting, leading to close alignment between training and validation data. (Shown in Graph 2 and Graph 3)
- The model achieved 95% accuracy, demonstrating strong performance across the dataset.

Model accuracy over training epochs for both training and validation datasets (Graph 2)

Model loss over training epochs for both training and validation datasets (Graph 3)

Project Vision

The project is designed to address key cybersecurity challenges faced by India, targeting specific threats such as large-scale financial fraud, ransomware attacks on critical infrastructure, and politically motivated cyber-attacks. For instance, India has witnessed increasing attacks on its banking sector, with incidents like the 2018 Cosmos Bank cyber heist, where ₹94 crores were siphoned off through malware. Additionally, high-profile incidents such as the 2020 ransomware attack on the Kundankulam Nuclear Power Plant, which threatened national infrastructure, highlight the critical need for enhanced forensic capabilities. The tool’s AI/ML models are trained on these major incidents, ensuring that it can detect patterns from both historic and emerging threats. By focusing on these pressing issues, the tool offers law enforcement and cybersecurity teams a solution that is directly relevant to India’s current cybersecurity landscape, speeding up investigations and reducing risks.

Looking ahead, the tool is designed with global scalability in mind, aiming to support the investigation of international cybercrimes. It will provide personalized workflows tailored to different types of cyber-attacks, such as cross-border ransomware campaigns or large-scale data breaches. The platform consolidates major forensic tools into one interface, offering customization options for investigators to adapt the tool based on specific cases, making it a comprehensive solution for both domestic and global cybersecurity challenges.

Getting Started

Follow these steps to set up and run the Crypta system on your local machine, or you can watch the demo video.

Installation

Option 1: Docker Setup

Pull the Docker Image:

docker pull areebahmeddd/crypta-backend:latest

Run the Docker Container:

docker run -p 8000:8000 -e GEMINI_API_KEY=your_gemini_api_key areebahmeddd/crypta-backend:latest

Replace your_gemini_api_key with your actual API key.

Option 2: Local Setup

Fork the Repository:
- Go to the Crypta repository and click "Fork" to create a copy under your GitHub account.

Clone the Repository:

git clone https://github.com/<your-username>/Crypta.git

Create a Virtual Environment (Optional but Recommended):
```
python -m venv .venv
```
Activate the Virtual Environment:
- Windows:
```
.venv\Scripts\activate
```
- macOS and Linux:
```
source .venv/bin/activate
```
Install Dependencies:
```
pip install -r requirements.txt
```
Set Up Environment Variables:
- Create a .env file in the project root directory with the following template:
```
GEMINI_API_KEY=your_gemini_api_key
```
Run the Python Application:
```
python app/run.py
```

Usage

After setting up the application using Docker or running it locally, you can verify that it's working by making a simple HTTP GET request.

Using a Web Browser or HTTP Client

Open your web browser or use an HTTP client like Postman and navigate to:
```
http://localhost:8000/
```
You should see a response confirming the backend server is running, such as:
```
{
  "message": "Backend server is running"
}
```
Using curl

Alternatively, you can test the server from the command line with a curl GET request:
```
curl http://localhost:8000/
```
This should return a similar JSON response confirming that the server is active.

Project Preview

Web Application UI

Landing Page

Upload Page

Dashboard Page (File Summary)

Dashboard Page (Vulnerability Summary)

Modal Page (Detected IoCs)

Dashboard Page (Graphs)

Flutter Application UI

Landing Page

Dashboard Page (Overall Summary)

Dashboard Page (Detected IoCs)

License

This project is licensed under the Apache License 2.0.

Authors

Areeb Ahmed
Shivansh Karan
Avantika Kesarwani
Yuktha PS
Shashwat Kumar
Rishi Chirchi

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Project Archived

Project Description

Key Features

Project Milestones

Completed:

In Progress:

Other:

Bugs:

Project Architecture

ML Model Design

Project Vision

Getting Started

Installation

Option 1: Docker Setup

Option 2: Local Setup

Usage

Project Preview

Web Application UI

Flutter Application UI

License

Authors

Files

README.md

Latest commit

History

README.md

File metadata and controls

Project Archived

Project Description

Key Features

Project Milestones

Completed:

In Progress:

Other:

Bugs:

Project Architecture

ML Model Design

Project Vision

Getting Started

Installation

Option 1: Docker Setup

Option 2: Local Setup

Usage

Project Preview

Web Application UI

Flutter Application UI

License

Authors