This repository contains an app that allows users to ask questions about a provided document or engage in a question-answering chatbot conversation. The app is built with Streamlit, Llama-Index, LangChain, Llama-cpp-python, and is powered by the LLM NeuralBeagle14-7B-GGUF by Maxime Labonne.
- No OpenAI API key required.
- Runs locally on Macs (Tested on Apple Silicon Mac with macOS Ventura 13.5.1).
- You can turn off your internet connection.
- Python3.11 or above (should work with Python3.10)
- pip (Python’s package installer)
Follow these steps to install and set up the app:
-
Clone the Repository and Navigate to the Directory:
git clone https://github.com/hbacard/chatdoc-plus.git && cd chatdoc-plus
-
Create a Python Virtual Environment:
python3 -m venv .venv
-
Activate the Environment:
-
On Unix-based systems:
source .venv/bin/activate
-
-
Install Dependencies:
pip install -r requirements.txt
- Note: On Linux there might be an error with the installation of
llama-cpp-python
. You can try these steps:sudo apt-get install build-essential
pip install -r requirements.txt
- Note: On Linux there might be an error with the installation of
-
Download the gguf file for
NeuralBeagle14-7B-GGUF
:python3 download_model.py
-
Enable GPU with llama-cpp-python:
-
On Apple Silicon (METAL):
CMAKE_ARGS="-DLLAMA_METAL=on" FORCE_CMAKE=1 pip install --force-reinstall llama-cpp-python==0.1.83 --no-cache-dir
-
On Linux:
CMAKE_ARGS="-DLLAMA_CUBLAS=on" FORCE_CMAKE=1 pip install --force-reinstall llama-cpp-python==0.1.83 --no-cache-dir
-
-
Run the App:
streamlit run app.py
Open your browser and navigate to http://localhost:8501 to interact with the app.
- Note: If the app doesn't start you can try these steps:
- Deactivate the virtual environment with
deactivate
- Reactivate it with
source .venv/bin/activate
- Then run
streamlit run app.py
- Deactivate the virtual environment with
The app provides two modes of operation:
- Q&A Chat Bot: When no document is provided, engage in a question-answering conversation with the AI assistant.
- Document Query Mode: Upload a document and ask questions about its content. The AI assistant will process queries using the document as context.
We welcome contributions to improve this app. If you have suggestions or encounter issues, please open an issue or submit a pull request on GitHub.
This project is licensed under the MIT License.
you may need to istall poppler to use the library pdftotext
.
https://stackoverflow.com/questions/45912641/unable-to-install-pdftotext-on-python-3-6-missing-poppler
brew install poppler
brew install pkg-config poppler python
In respect to the original repo by Hugo Bacard this fork has very little differences.
- There is a new page to query and scrape ArXiv for getting a series of pdf articles, based on my previously opened source repo ScrapXiv.
- There is a new page to scrape huggingface and get more models, other than
neuralbeagle14-7b.Q5_K_M.gguf
by Maxime Labonne. - The chat is now in a page that allow to select documents from the downloaded papers.
- The python code is packaged and linted.
- This is an hobby project and the time for active maintenance will be very limited.
- Please feel free to fork and re-use the code, citing the original repo and its LICENCE.