Extract and process abstracts from emails labeled as "Papers" in your Gmail account, and get them cleaned up using the power of gpt-3.5-turbo.
- Automated Email Fetching: Fetches emails from Gmail labeled as "Papers".
- Abstract Extraction: Extracts abstracts from these emails for consolidated reading.
- Natural Language Processing: Uses gpt-3.5-turbo to process and clean up the abstracts.
- Consolidated Output: Saves cleaned abstracts to a file.
To set up and run the project locally:
- Clone this repository:
git clone https://github.com/AlefAlfa/AbstractsExtractor.git
cd AbstractsExtractor
- Install the required packages:
pip install -r requirements.txt
Before you can fetch emails, you need to authenticate with your Gmail using OAuth2. Follow these steps:
- Go to the Google Cloud Platform Console.
- Create a new project.
- Enable the Gmail API for your project.
- Setup OAuth2 credentials.
- Download the
credentials.json
and place it in the root directory of this project. - For a detailed step-by-step tutorial on these, you can refer to this guide.
Once set up, you can run the main script to fetch, extract, and process abstracts:
python main.py
The cleaned abstracts will be saved in an output file in the project directory.
Feel free to fork this project and make any contributions you think will benefit the project. Pull requests are warmly welcome.
This project is licensed under the MIT License. See the LICENSE file for more details.
Thank you for using or contributing to this project!