Make AutoRAG to Monorepo #960

vkehfdl1 · 2024-11-19T07:06:07Z

No description provided.

…ME.md

Added various entries to ignore specific files and directories in both the root directory's .gitignore and the api directory's .dockerignore. Additionally, included a Dockerfile for building a Python 3.10-slim-based API image with specified dependencies and runtime configurations. A docker-compose.yml file was introduced to define services and networks for frontend and API components.

…ents This commit updates the project naming convention in the README file from "AutoRAG API Server" to "AutoRAG-API" for consistency. Additionally, it modifies the version requirement in the `requirements.txt` file for AutoRAG to be greater than or equal to 0.3.8 to ensure compatibility with the latest features.

…o use port 5001 instead of 5000

…ath' field in ParseRequest model

…rectory creation accurately

…unction indentation.

…arser function

…ntly and improve error handling

# Conflicts: # autorag/autorag/vectordb/couchbase.py

working with uvicorn now

.github/workflows/publish.yml

…c version issues (#971) Co-authored-by: jeffrey <[email protected]>

* add delete endpoint and change to .env based operations * add api endpoint for gathering all env settings * load env variable when start each task * change GET /env to return everything (key & values) --------- Co-authored-by: jeffrey <[email protected]>

Co-authored-by: jeffrey <[email protected]>

# Conflicts: # autorag/autorag/vectordb/qdrant.py

…987) * feat: refactor SQL Trial DB from Pandas Trial DB, and Test code * 🚑 fix: Set correct WORK_DIR based on environment variable - Updated the logic in app.py to properly set the `WORK_DIR` based on the environment variable `AUTORAG_API_ENV`. If the environment is 'dev', the `WORK_DIR` will be located at `"../projects"`, otherwise, it will be set to `"projects"`. Additionally, the `.env` file path is now correctly constructed using the determined `WORK_DIR` value. * 🚑 fix: Update method to use model_validate_json in trial_dict['config'] assignment and update set_trial_config for trial_id with TrialConfig model dump JSON. Add get_all_config_ids and get_all_trial_ids SQL query functions. * ✨ feat: Add CORS headers and handle OPTIONS requests This commit introduces the addition of CORS headers in every response and explicit handling of OPTIONS requests in the API server. Includes setting Access-Control-Allow-Origin, Access-Control-Allow-Credentials, Access-Control-Allow-Headers, and Access-Control-Allow-Methods based on the request origin. * ✅ test: add test file for project creation with setup and cleanup fixtures, including logging configurations, environment setup, client creation, and project directory validation * 🚑 fix: Remove unnecessary commented-out properties in Trial class * 🚑 fix: Set correct WORK_DIR based on environment variable AUTORAG_WORK_DIR * ♻️ refactor: Update code in app.py and schema.py for better handling of working directory and model configuration. Fix deprecated usage in test_app.py and enhance testing in test_trial_config.py. * 📝 docs: update README with instructions for running using Docker Compose and monitoring options. * ✨ feat: start parsing documents task with improved import handling This commit introduces changes to the document parsing task initiation. The import statement for `parse_documents` has been updated within the file. Additionally, the logic for initiating the parsing process has been streamlined and improved for better performance and handling of imports. * ✅ test: add tests for project database operations such as initializing DB, setting/getting trials, updating trial configurations, and retrieving trial information by project or ID. * ♻️ refactor: Improve database initialization in SQLiteProjectDB - Refactored the `_init_db` method to enhance database initialization. - Added logging and enhanced debugging statements for better clarity. - Now checks for the existence of the database file and its directory before initializing. - If the database file does not exist, it creates the necessary directory and tables. - Adjusted permissions for directories (777) and the database file (666) accordingly. * 🚑 fix: correct chunking and parsing tasks in trial_tasks.py * 🔧 chore: Update imports and debug logging level in app.py - Updated import statement in app.py to include chunk_documents from trial_tasks module. - Changed the logging level from INFO to DEBUG for more detailed logging information. * ♻️ refactor: refactor parsing endpoint and improve error handling - Refactored the parsing endpoint to handle configuration data retrieval more efficiently. - Improved error handling to provide more informative error messages in case of missing data or failed tasks. * 🚑 fix: Correct chunked data path and task handling in start_chunking function * ✨ feat: Configure not to use uvloop, apply nest_asyncio, and correct import in app.py - Avoid using uvloop by setting asyncio event loop policy to DefaultEventLoopPolicy(). - Apply nest_asyncio after that to prevent conflicts. - Change the import in app.py from `from database.project_db import SQLiteProjectDB` to the correct import. refactor: Update Celery configuration in celery_app.py - Adjust broker and backend URLs to use 'redis://redis:6379/0'. - Modify the timezone to 'Asia/Seoul' for better synchronization. * 🚑 fix: Install system dependencies and pip, adjust Dockerfile for API service - Removed unnecessary comments related to installing pip as it's clear from the command itself - Added installation of 'watchfiles', setting PYTHONPATH and PYTHONUNBUFFERED environment variables - Created a directory for celery beat schedule and added an entrypoint script - Adjusted permissions for the entrypoint script and removed Windows line endings - Updated entrypoint to /entrypoint.sh in the API service section - Added environment variables for watching files, setting time zone, log level, and disabling Python output buffering * 🔧 chore: update subproject commit reference in autorag-frontend * 🔧 chore: add test_projects to .gitignore * add new lines and fix .env.dev * fix chunk_documents --------- Co-authored-by: Seungwoo hong <Seungwoo hong [email protected]> Co-authored-by: jeffrey <[email protected]>

* Change all datetime.now() to the timezone UTC * properly working UTC timezone in the API server --------- Co-authored-by: jeffrey <[email protected]>

…py (#1005) * ✨ feat: Add QA document generation task in trial_tasks.py and schema.py - Added a new field `qa_task_id` in the Trial schema to store the QA task ID. - Introduced `generate_qa_documents` shared task in `trial_tasks.py` for creating QA documents. - Updated imports and added `QACreationRequest` in `trial_tasks.py`. - Included function `run_qa_creation` in `generate_qa_documents` task for generating QA documents with status tracking and database updates. * 🚑 fix: Return full trial config in get_trial_config Adjusts the return statement in `get_trial_config` to return the complete trial configuration instead of just the model dump. * 🔧 chore: update subproject commit in autorag-frontend to 1434e797 --------- Co-authored-by: Seungwoo hong <Seungwoo hong [email protected]>

* Change the WORK_DIR setting * send file directly

…id. (#1011) * get all parsed documents and the parse is not relevant to the trial_id now * add get chunk list at the API server * chunk document at project view * /parse POST with parse_name * QA creation endpoint

bwook00 and others added 18 commits November 18, 2024 21:38

just commit

3d706dc

just commit

deb5d65

Merge branch 'main' into Feature/#956

e154268

add the root directory

b92172d

.gitignore in the autorag source folder

356c253

edit github actions

bc085e4

fix .env .gitignore

a9043fb

add root .gitignore

9a07a44

set PYTHONPATH at test.yml

e2c08fc

change the name of the test_base.py

4ba97d3

change the VERSION path at docs/conf.py

d9fa4b2

Add api to repository

9d90aca

Add api to repository

6f12102

add autorag at pythonpath

8ff9c86

edit gitignore for tracking projects folder

b6c4232

add README.md at projects folder for tracking projects folder

f32cec3

add autorag-frontend as git submodule

e9d5666

Do not run API test at github actions

7d2b8cb

vkehfdl1 marked this pull request as ready for review November 19, 2024 10:14

vkehfdl1 requested review from hongsw and bwook00 November 19, 2024 10:14

vkehfdl1 marked this pull request as draft November 19, 2024 11:21

홍승우 added 8 commits November 19, 2024 21:00

rename: update file path from api/projects/README.md to projects/READ…

51354ed

…ME.md

📝 docs: remove AutoRAG Workflow API documentation and related resources.

c7c0f9b

✨ feat: Add description for tutorial_1 project

55249ea

🔧 chore: update .gitignore to exclude .DS_Store

33cab10

🚑 fix: Update ports and environment variables in docker-compose.yml t…

0b6bfc3

…o use port 5001 instead of 5000

🚑 fix: Update schema.py with corrected field indentation and added 'p…

270638d

…ath' field in ParseRequest model

홍승우 and others added 9 commits November 19, 2024 21:21

🚑 fix: Fix indentation in validate.py for decorator functions.

60cda5b

🚑 fix: refactor authentication decorator in auth.py

62a9eb6

🚑 fix: Correct get_new_trial_dir parameter naming and handle trial di…

c467580

…rectory creation accurately

🚑 fix: Corrected import formatting in qa_create.py and standardized f…

dea7c09

…unction indentation.

✨ feat: Add dashboard module to autorag package and implement async p…

0473cb3

…arser function

🚑 fix: Refactor PandasTrialDB to handle trial operations more efficie…

5359b4d

…ntly and improve error handling

move upload file endpoint

b0b1970

turn evaluate_history.py workable again

cc56006

just reformat and edit ignore files

06284b2

vkehfdl1 linked an issue Nov 20, 2024 that may be closed by this pull request

Add AutoRAG front-end and API server for run AutoRAG #959

Open

jeffrey and others added 3 commits November 20, 2024 09:42

Merge branch 'main' into Feature/#959

31db596

# Conflicts: # autorag/autorag/vectordb/couchbase.py

working with uvicorn now

e9a546c

Merge pull request #966 from Marker-Inc-Korea/Feature/#965

d15ae04

working with uvicorn now

hongsw previously approved these changes Nov 20, 2024

View reviewed changes

.github/workflows/publish.yml Show resolved Hide resolved

vkehfdl1 and others added 2 commits November 20, 2024 16:29

Merge branch 'main' into Feature/#959

81a2f63

Add env variable to locate the project folder and resolve new pydanti…

24db0d7

…c version issues (#971) Co-authored-by: jeffrey <[email protected]>

vkehfdl1 dismissed hongsw’s stale review via 24db0d7 November 23, 2024 12:54

vkehfdl1 and others added 9 commits November 23, 2024 22:11

upload multiple files at once using key 'files' (#981)

23260f4

Co-authored-by: jeffrey <[email protected]>

Merge branch 'main' into Feature/#959

b4d0776

# Conflicts: # autorag/autorag/vectordb/qdrant.py

Make the default timezone at the API server to UTC (#992)

0558ab2

* Change all datetime.now() to the timezone UTC * properly working UTC timezone in the API server --------- Co-authored-by: jeffrey <[email protected]>

Change the api port to 8000 (#1007)

cd530bd

artifacts/content GET endpoint for sending raw_data files (#1008)

9611073

* Change the WORK_DIR setting * send file directly

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make AutoRAG to Monorepo #960

Make AutoRAG to Monorepo #960

vkehfdl1 commented Nov 19, 2024

Make AutoRAG to Monorepo #960

Are you sure you want to change the base?

Make AutoRAG to Monorepo #960

Conversation

vkehfdl1 commented Nov 19, 2024