From 4e72e6fb04b9658b14de97840014fbec9b1cee46 Mon Sep 17 00:00:00 2001 From: Kartik Virendra Rajput <88619994+kartikvirendrar@users.noreply.github.com> Date: Tue, 20 Aug 2024 14:05:22 +0530 Subject: [PATCH 1/4] added release timeline --- html/shoonya.html | 224 ++++++++++++++++++++++++++++++++++++++++++++-- 1 file changed, 219 insertions(+), 5 deletions(-) diff --git a/html/shoonya.html b/html/shoonya.html index 0d551c8..06172ad 100644 --- a/html/shoonya.html +++ b/html/shoonya.html @@ -32,10 +32,13 @@

Shoonya

- Try Demo - Codebase - Tutorial - API Specs + Try Demo + Codebase + Tutorial + API Specs
@@ -65,6 +68,217 @@

Installation Instructions

team.

+
+
+
+
+
+

Shoonya Releases

+

Lorem ipsum dolor sit amet, consectetur adipisicing elit

+
+
+
+
+
+
    +
  • +
    + V 3.0 +

    May 2024

    +

    + Flower Configuration for asynchronous celery tasks logging. + Deletion confirmation endpoint. + Added new notification types. + Access level changes for manager role. + Minor changes to analytics. + Added new project types - OCRSegmentationCategorization & editing + Dataset and Task data privacy changes to hide public URLs. + New endpoint to add a proxy Audio URL service. + Line Charts based performance analytics. + Integration changes to AcousticNormalizedTranscriptionEditing project type. + Exception handling and code refactoring for backend codebase. + Download endpoints optimisations. + Endpoint support to store transliteration logs to blob storage. + User active/inactive integration. + Decentralized User Invitation to workspace managers. + Changes to email templates. +

    +
    +
  • +
  • +
    + V 2.4 +

    October 2023

    +

    + Workspace and Organisation level Payment Reports. + Support for OCR, ASR data types predictions population. + Consider Batch sampling and automatic annotations creation support to pull new data + items. + Reports scheduling feature. + Backend support for Chitralekha UI for any Audio Project Types. + New project types Acoustic Normalized Transcription and Acoustic Normalized + Transcription Editing utilising CL UI. + Transliteration Logging support using Blob storage. + Support e-mail-based async calculation to all reports. + Code Refactoring and reformatting. + User profile picture upload and change feature using blob storage. + Elastic search and Kibana support for logging setup. + Download all projects within a workspace asynchronously using blob storage. + Some bug fixes for annotation filter, reports mail along with others. +

    +
    +
  • +
  • +
    + V 2.3 +

    July 2023

    +

    + Conversation Verification Project Type. + OCR Project Types modifications. + Option to change the stage of a project to Supercheck Stage. + All endpoints for Supercheck workflow. + Bug fixes for Assigning and Unassigning Tasks. + New field domain in TranslationPair datatype. + Superchecker notes. + Reports Bug fixes. + Frozen Users for Workspace. + Re-invite users. + Modification for Login and Change Password. + Automatic Annotation Creation for external data. + Intra-dataset Automation to populate draft_data_json. + More quality parameters in reports (WER, segment length etc). + Workspace-level analytics. +

    +
    +
  • +
  • +
    + V 2.1 +

    April 2023

    +

    + New user roles for Reviewer, Super-checker, and Admin. + Updated endpoints relevant to Reviewer and Admin roles. + Integration with new Indic-Trans-v2 deployed on Dhruva. + New field in the project model to support project stage in place of deprecated field + review enabled. + New annotation and task statuses to support super-checker flow in upcoming versions. + annotation_type field in all annotations to signify which role it belongs to. + New field named revision_loop_count in the task model to support bookkeeping. + New field named super_checker_user in the task model. + Changes to migrate user roles based on the appropriate work they are assigned to. +

    +
    +
  • +
  • +
    + V 2.0 +

    March 2023

    +

    + Design changes to have a status associated with each annotation + New project Types support for: + Domain Classification along with Sentence verification. + Audio Segmentation + Audio Transcription Editing with support for populating predictions. + Glossary support on annotation page of Translation Projects. + Support for reviewers to accept a task with major or minor changes. + Draft and skip option for reviewers. + Tags support to allow noise tagging in all Audio project types. + Improved UI to increase Audio Transcription productivity. + Word count/ Audio duration based public analytics. + Optimized project listing based on recently worked project. + Export fix in conversation translation editing project type. + Support search and filter based task flow for Start Labeling Now button + New task status to reflect whether a task is exported. + All tasks tab for managerial view of a project + Frontend bug fixes for Automate Datasets page. + Filters for Projects listing and Datasets listing pages. + Support to download all annotations from all tasks of a Translation Project. + Endpoint to allow managers to deallocate tasks for any user in the project. +

    +
    +
  • +
  • +
    + V 1.3 +

    November 2022

    +

    + New project type for Single speaker transcription editing. + Public API endpoint for language based Organization Analytics. + Complete support for all levels of review reports. + Annotation Quality Reports. + Backend support to categorize accepted with major/minor changes. + Search support for Dataset Items table. + Bulk delete endpoints for tasks and data items. + Support for Managers, Org Owners to be able to annotate tasks. +

    +
    +
  • +
  • +
    + V 1.2 +

    October 2022

    +

    + Improved User reports for annotations and review. + Patch to update conversation data type to support Machine Translations. + Celery-based implementation for Automated MT function for Conversation Data Type. + Removal of task-lock deprecated functionality. + Workspace-level User analytics for review workflow. + Endpoint to support bulk deletion of data items and all linked tasks. + Organization-level User analytics for review workflow. + Review workflow-based reports for the Analytics tab (publicly accessible endpoint). + Review reports for User-level progress. + Integration with Azure Translate. + TSV support for Projects download. + Endpoint to support Normalized character-level edit distance between sentences. + Support to filter by task status while downloading projects. + New Boolean field in the Users model to support user input for receiving daily + mails. + Endpoint and celery-beat setup for sending daily progress emails to Users + Support for annotation, and review reports for daily progress mails. +

    +
    +
  • +
  • +
    + V 1.1 +

    August 2022

    +

    + Rename Task status rejected to to_be_revised. + Fix a bug in the review feature. + Add reviewer reports for project analytics. + Refactor User field to Annotator in Project model. + Functionality to remove users from workspace. + Code refactoring to take user_id instead of username or email in various endpoints. + Support automated transformation of Sentence Text Datasets to Translation Pairs + Dataset involving functions for Generating Machine Translations using IndicTrans and + Google Translate models. +

    +
    +
  • +
  • +
    + V 1.0 +

    July 2022

    +

    + Supports all the 22 official Indian languages + Currently support Sentence Verification tasks, Context Translation Verification + project types + Provides AI support with translation + Cleaner hierarchy of Organization, Workspace, Projects. + Reports at various levels (Org, workspace, project, user) and multiple dimensions + Allow creation of task chains and custom inputs as required by Language Experts + Enables language coordinators to enable effective collaboration (Shareable Notes, + Drafts) + RTL and Transliteration based support +

    +
    +
  • +
+
+
+
+
+
@@ -77,4 +291,4 @@

Installation Instructions

- \ No newline at end of file + From 924b4cf80e84782df8e82d9ef12286b7b022233e Mon Sep 17 00:00:00 2001 From: Kartik Virendra Rajput <88619994+kartikvirendrar@users.noreply.github.com> Date: Tue, 20 Aug 2024 14:05:53 +0530 Subject: [PATCH 2/4] added release timeline --- styles/ai4b-tools.css | 104 +++++++++++++++++++++++++++++++++++++++++- 1 file changed, 103 insertions(+), 1 deletion(-) diff --git a/styles/ai4b-tools.css b/styles/ai4b-tools.css index 4ae9756..dc6281e 100644 --- a/styles/ai4b-tools.css +++ b/styles/ai4b-tools.css @@ -69,4 +69,106 @@ } } -/* End style for the tools pages */ \ No newline at end of file +.pt-100{ + padding-top:100px; +} +.pb-100{ + padding-bottom:100px; +} +.section-title { + margin-bottom: 60px; + color: var(--color-secondary); +} +.section-title p { + color: #777; + font-size: 16px; +} +.section-title h4 { + text-transform: capitalize; + font-size: 40px; + position: relative; + padding-bottom: 20px; + margin-bottom: 20px; + font-weight: 600; +} +.section-title h4:before { + position: absolute; + content: ""; + width: 60px; + height: 2px; + background-color: rgb(255, 102, 0); + bottom: 0; + left: 50%; + margin-left: -30px; +} +.section-title h4:after { + position: absolute; + background-color: rgb(255, 102, 0); + content: ""; + width: 10px; + height: 10px; + bottom: -4px; + left: 50%; + margin-left: -5px; + border-radius: 50%; +} +ul.timeline-list { + position: relative; + margin: 0; + padding: 0 +} +ul.timeline-list:before { + position: absolute; + content: ""; + width: 2px; + height: 100%; + background-color: rgb(255, 102, 0); + left: 50%; + top: 0; + -webkit-transform: translateX(-50%); + transform: translateX(-50%); +} +ul.timeline-list li { + position: relative; + clear: both; + display: table; +} +.timeline_content { + border: 2px solid rgb(255, 102, 0); + background-color:#fff +} +ul.timeline-list li .timeline_content { + width: 45%; + color: #333; + padding: 30px; + float: left; + text-align: right; +} +ul.timeline-list li:nth-child(2n) .timeline_content { + float: right; + text-align: left; +} +.timeline_content h4 { + font-size: 22px; + font-weight: 600; + margin: 10px 0; +} +ul.timeline-list li:before { + position: absolute; + content: ""; + width: 25px; + height: 25px; + background-color: rgb(255, 102, 0); + left: 50%; + top: 50%; + -webkit-transform: translate(-50%, -50%); + transform: translate(-50%, -50%); + border-radius: 50%; +} +.timeline_content span { + font-size: 18px; + font-weight: 500; + font-family: poppins; + color: rgb(255, 102, 0); +} +/* End style for the tools pages */ From e4ec567b7c19b377b8ed236c4e55d2d7577c2572 Mon Sep 17 00:00:00 2001 From: kartikvirendrar Date: Thu, 22 Aug 2024 16:43:40 +0530 Subject: [PATCH 3/4] created chitralekha page --- html/chitralekha.html | 464 ++++++++++++++++++++++++++++++++++++++ html/shoonya.html | 512 ++++++++++++++++++++++++------------------ styles/ai4b-tools.css | 112 +++++---- 3 files changed, 824 insertions(+), 264 deletions(-) create mode 100644 html/chitralekha.html diff --git a/html/chitralekha.html b/html/chitralekha.html new file mode 100644 index 0000000..dfba3ef --- /dev/null +++ b/html/chitralekha.html @@ -0,0 +1,464 @@ + + + + + + + AI Annotation Tool + + + + + + + + + + + +
+ +
+
+ +
+
+

Chitralekha

+

Chitralekha is an open-source AI-powered video transcreation platform, + developed by AI4Bharat and EkStep. It has an integrated workforce management system, which + enables end-to-end transcreation of a video from one language to another through the stages of + transcription, translation and voice-over for the translated language. +

+ + +
+
+ + +
+ +
+
+
+ + +
+

Installation Instructions

+

Follow the steps below to install and run the AI Annotation Tool:

+
Backend Setup
+
+# Clone the Chitralekha-Backend repository from GitHub to your local machine.
+git clone https://github.com/AI4Bharat/Chitralekha-Backend.git
+            
+# Create a virtual environment for the project. Replace  with your preferred environment name.
+python3 -m venv 
+
+# Activate the virtual environment. This ensures that the packages you install are isolated from the global Python environment.
+source /bin/activate
+
+# Install all required Python packages listed in the requirements-dev.txt file.
+pip install -r deploy/requirements-dev.txt
+
+# Set up the environment variables needed for the project by copying the example environment file.
+cp .env.example ./backend/.env
+
+# Generate a new secret key for Django (within the virtual environment):
+# Open a Python shell.
+python backend/manage.py shell
+
+    # Import the utility function to generate a secret key.
+    >> from django.core.management.utils import get_random_secret_key
+
+    # Generate and print a new secret key.
+    >> get_random_secret_key()
+
+# Copy the generated secret key and paste it into the .env file as the value for SECRET_KEY.
+
+# Run Migrations
+# The following steps are required only when you run the project for the first time or after making changes to the models.
+
+    # Check if there are any pending migrations.
+    python backend/manage.py makemigrations 
+
+    # Apply all pending migrations to the database.
+    python backend/manage.py migrate
+
+# Create a superuser for accessing the Django admin interface (required only once).
+python backend/manage.py createsuperuser
+
+# Install and start Redis in a separate terminal.
+# Start Flower for monitoring Celery workers.
+python -m celery -A backend flower
+
+# Start the first Celery worker for the "default" queue.
+python -m celery -A backend worker -Q default --concurrency=1 -l DEBUG
+
+# Start the second Celery worker for the "voiceover" queue.
+python -m celery -A backend worker -Q voiceover --concurrency=1 -l DEBUG
+
+# Run the Django development server.
+python manage.py runserver
+
Frontend Setup
+
+# Clone the Chitralekha-Frontend repository from GitHub to your local machine.
+git clone https://github.com/AI4Bharat/Chitralekha-Frontend.git
+
+# Change directory to the newly cloned Chitralekha-Frontend folder.
+cd Chitralekha-Frontend
+
+# Install the necessary dependencies for the project. 
+# The --force flag is used to bypass conflicts with the existing dependencies.
+npm i --force
+
+# Start the development server. This will run the frontend application on a local server.
+npm start
+

If you encounter any issues, please refer to the backend + and frontend documentation or reach out to our support team.

+
+ +
+
+
+
+

Chitralekha Releases

+

Lorem ipsum dolor sit amet, consectetur adipisicing elit

+
+
+
+ +
+
+
+ V 3.1 +

March 2024

+

Bulk Download of VoiceOver Tasks -> Zip file will be emailed to the users.

+

Bug fix in decimal points in Translation Tasks.

+

OnBoarding functionality for new clients.

+

Enable Original Source Translation.

+

User can Preview subtitles in Edit Page.

+
+
+
+
+
+ +
+
+
+
+ V 3.0 +

February 2024

+

Local implementation of Transliteration.

+

Users can add and view glossary.

+

Their added glossary will be automatically replaced in newly created Translation Edit + tasks.

+

Org Owners can upload bulk glossary.

+

Sorted filter list.

+

Added skip tutorial functionality.

+

Description of video as name of file downloaded by NPTEL.

+

Made get updated audio button disabled instead of hiding it.

+

Countdown for remaining time while recording VO task.

+

Testimonials in landing page.

+

Public Charts in landing page.

+

Show decimal values in VoiceOver's duration.

+
+
+
+
+ +
+
+
+ V 2.8 +

January 2024

+

Paginate Reports at Organisation and Project Level.

+

In case of Machine Generated Source type, if Transcription/Translation API fails, Org + Owner/Project Manager can regenerate the response.

+

Show Project Id in Projects list.

+

Org Owners can clone a project into multiple projects (max. 20).

+

Task Id and Project Id are shown in Reports.

+

Organization specific priority upgrade.

+

Org Owner and Project Manager can update roles of members.

+
+
+
+
+
+ +
+
+
+
+ V 2.7 +

January 2024

+

Retained the filters in task list.

+

Newsletter new UI with separate Write and Preview screens.

+

Enable preview for HTML files uploaded in template 3.

+

Org Owner would not have org owner permission if he is a member of project and not + org owner.

+

User can update newsletter email and newsletter categories that they are subscribed + to.

+

Clear Report table when project is changed.

+

Return to task list after completion of a task.

+

Snackbar showing up in the background in Video Update (issue fix).

+

Added category and preview for all the three templates.

+

Newsletter Subject is made customizable.

+
+
+
+
+ +
+
+
+ V 2.6 +

December 2023

+

BG music enabled from FE.

+

Add button in Newsletter -> To add multiple headers and paragraphs in a single + template.

+

Subscribed User can unsubscribe or update their emails in their profile section.

+

Users can unsubscribe directly from the Newsletter email.

+

Sort Tasks list by created_at, updated_at, description in ascending and descending + order. (Default sorting is done by updated_at in descending order.)

+

Fix in Update multi speaker Info in video Update.

+

Support for exporting additional subtitle formats in sbv, TTML, scc, rt.

+

Send Newsletters immediately.

+

All NPTEL users have been subscribed to newsletter.

+

Subscript/superscript for VoiceOver as well.

+

Enable email for users who have already subscribed to daily mails.

+

Label changes for unsubscribe and update email.

+

Toggle for subscription has been removed. It will happen through buttons.

+
+
+
+
+
+ +
+
+
+
+ V 2.5 +

December 2023

+

Accept multiple speaker info in video update.

+

Background music shifted to a new server.

+

New Scheduled Tasks:

+

a. Org owner email notifications -> for new joinees to the organization.

+

b. ETA email notifications -> To all users, reminder ETA mails.

+

Return info of empty cards in Transcription and Translation.

+

Reopen Translation Task.

+

Newsletter -> Users can subscribe to newsletter (in their profile page) and get + updates from our end.

+

Move multiple cards simultaneously.

+

Bug fix in Bulk Task Update.

+

Removed "Original Source" option for the subtitles.

+
+
+
+
+ +
+
+
+ V 2.4 +

September 2023

+

Search by Task Id in Tasks tab.

+

ETA in CSV upload (This field needs to be added in the end. The accepted format is + DD-MM-YYYY. The ETA date should be greater than current date).

+

Show Task queue status separately for Transcription/Translation/VoiceOver.

+

Show "Updated At" in Column selector of Tasks table.

+

Show description by default in tasks Table for NPTEL.

+

Fix for Preview not showing the subtitles for completed tasks.

+

Color of the selected text changed.

+

Shortcut keys changed for few cases.

+

Focus issue for the selected card.

+
+
+
+
+
+ +
+
+
+
+ V 2.3 +

September 2023

+

Background music integration with the target translated video.

+

Add Source Type Original Source for Translation. (Create Translation from YouTube + caption. In this case, Transcription Edit should not be created.)

+

Support for Superscript/subscript in transcription and translation.

+

Introduce Keyboard Shortcuts for performing actions on the edit window.

+

Info of wrong timestamps in Transcription.

+

Reopen already completed Translation. (If a VO task already exists, users will have + to manually delete that before it can be reopened.)

+

Invite multiple members in the organization.

+

Hide Add Org Member Button for project Manager.

+

Search with name or email id in members list of project, while adding members.

+

Incorrect behavior of the options in settings fixed.

+
+
+
+
+ +
+
+
+ V 2.2 +

August 2023

+

Migrate to Dhruva for IndicASR.

+

Find and replace for partial words.

+

Auto logout when page is idle.

+

Info button to show sentences with time issues in Translation.

+

Info button to show empty audios in VoiceOver.

+

Time spent metric for every task.

+

Page mix-up issue fix.

+

Manually created Voice-Over fixes.

+

Make export Async (Send link in email for mp3 and wav. flac can be downloaded + instantly).

+

Fixes in Machine Generated VO. (Call serial TTS API)

+
+
+
+
+
+ +
+
+
+
+ V 2.1 +

July 2023

+

Optimized Editor.

+

Multiple speaker info in Video creation.

+

Speaker tagging:

+

a. Users can choose speaker while doing Transcription only.

+

b. Currently, age is not taken into account.

+

c. If users choose With Speaker Tagging during export of Transcription and + Translation, they will see the names of speakers.

+

d. In VoiceOver, Speech will be generated according to the speaker chosen during + transcription.

+

Export to docx-bilingual for translation.

+

Export VoiceOver to mp4/mp3/wav/flac. (Note: mp3/flac/wav can't be exported for older + VO tasks that are already completed.)

+

Users can update their languages in "My Profile."

+

Create Bulk task for Transcription Review, Translation Edit, Translation Edit and + VoiceOver Edit is enabled. A single user can be assigned bulk tasks.

+

Show only the videos for which tasks are assigned to the editors/reviewers.

+
+
+
+
+ +
+
+
+ V 2.0 +

June 2023

+

VoiceOver Task support.

+

Added Reports for Project and Organization.

+

Upload subtitles to YouTube.

+

API Migration to Dhruva for TTS and NMT.

+

Created Admin Portal (all the features for Admin mode -- creation/updation/deletion + of Organization).

+

Project Settings (Workflow integration and UI change).

+

Enabled RTL and Other User Settings for Transcription and Translation Pages.

+

Added Bulk Actions for creation of Task, Videos, Task Updation, Task/Video Deletion, + SubtitleEx Download.

+

Export Subtitle Feature in different formats.

+

Creation of videos and tasks by uploading CSV.

+

Created Custom Pagination.

+

Enabled Google SSO.

+

Recreated User Profile Section.

+

Made the App optimized and responsive.

+

Paginated Transcription and Translation.

+

Server side Pagination with server side search and filter.

+

Undo Redo feature for transcript/translation editing-review.

+

Bulk Upload of videos at Organization and Project level.

+

Instant Email notification to Editors/Reviewers, when their tasks become active.

+

Periodic Email notifications to Project Manager, about Completed tasks and Active + Tasks.

+
+
+
+
+
+ +
+
+
+
+ V 1.0 +

February 2023

+

Transcription and Translation editing support.

+

Project Management Hierarchy.

+

Maker & Checker process.

+

Transliteration based typing support wherever applicable.

+

Subtitles exporting feature in all popular formats.

+

Video and Audio (mp3) media support.

+

Sign Up/Sign In.

+

Roles based account setup.

+

Word Level Alignment view.

+

Basic Reports at every level.

+

Selection source comparison.

+
+
+
+
+ +
+ +
+ + + + + + + + + \ No newline at end of file diff --git a/html/shoonya.html b/html/shoonya.html index 06172ad..9a4d544 100644 --- a/html/shoonya.html +++ b/html/shoonya.html @@ -39,13 +39,15 @@

Shoonya

class="btn btn-custom">Tutorial API Specs + Blog
- +
@@ -53,229 +55,309 @@

Shoonya

Installation Instructions

-

Follow the steps below to install and run the AI Annotation Tool using Docker:

+

Follow the steps below to install and run the AI Annotation Tool:

+
Backend Setup
-# Step 1: Pull the Docker image
-docker pull annotation-tool:latest
+# Clone the Shoonya-Backend repository from GitHub to your local machine.
+git clone https://github.com/AI4Bharat/Shoonya-Backend.git
 
-# Step 2: Run the Docker container
-docker run -d -p 8080:8080 annotation-tool
+# Create a virtual environment for the project. Replace  with your preferred environment name.
+python3 -m venv 
 
-# Step 3: Access the tool in your browser
-Open your browser and go to http://localhost:8080 to start using the tool.
-        
-

If you encounter any issues, please refer to the documentation or reach out to our support - team.

+# Activate the virtual environment. This ensures that the packages you install are isolated from the global Python environment. +source /bin/activate + +# Install all required Python packages listed in the requirements-dev.txt file. +pip install -r deploy/requirements-dev.txt + +# Set up the environment variables needed for the project by copying the example environment file. +cp .env.example ./backend/.env + +# Generate a new secret key for Django (within the virtual environment): +# Open a Python shell. +python backend/manage.py shell + + # Import the utility function to generate a secret key. + >> from django.core.management.utils import get_random_secret_key + + # Generate and print a new secret key. + >> get_random_secret_key() + +# Copy the generated secret key and paste it into the .env file as the value for SECRET_KEY. + +# Docker Installation +# Build the Docker containers as defined in the docker-compose-local.yml file. +docker-compose -f docker-compose-local.yml build + +# Run the containers in detached mode (-d flag). This will start up all the services defined in the Docker Compose file. +docker-compose -f docker-compose-local.yml up -d + +# Run Migrations +# The following steps are required only when you run the project for the first time or after making changes to the models. + + # Check if there are any pending migrations. + docker-compose exec web python backend/manage.py makemigrations + + # Apply all pending migrations to the database. + docker-compose exec web python backend/manage.py migrate + +# Create a superuser for accessing the Django admin interface (required only once). +docker-compose exec web python backend/manage.py createsuperuser + +# Run the Django development server within the Docker container. +docker-compose exec web python backend/manage.py runserver + +
Frontend Setup
+
+# Clone the Shoonya-Frontend repository from GitHub to your local machine.
+git clone https://github.com/AI4Bharat/Shoonya-Frontend.git
+
+# Change directory to the newly cloned Shoonya-Frontend folder.
+cd Shoonya-Frontend
+
+# Install the necessary dependencies for the project. 
+# The --force flag is used to bypass conflicts with the existing dependencies.
+npm i --force
+
+# Start the development server. This will run the frontend application on a local server.
+npm start
+

If you encounter any issues, please refer to the backend + and frontend documentation or reach out to our support team.

-
-
-
-
-

Shoonya Releases

-

Lorem ipsum dolor sit amet, consectetur adipisicing elit

-
+
+
+
+

Shoonya Releases

+

Lorem ipsum dolor sit amet, consectetur adipisicing elit

+
+
+
+ +
+
+
+ V 3.0 +

May 2024

+

Flower Configuration for asynchronous celery tasks logging.

+

Deletion confirmation endpoint.

+

Added new notification types.

+

Access level changes for manager role.

+

Minor changes to analytics.

+

Added new project types - OCRSegmentationCategorization & editing.

+

Dataset and Task data privacy changes to hide public URLs.

+

New endpoint to add a proxy Audio URL service.

+

Line Charts based performance analytics.

+

Integration changes to AcousticNormalizedTranscriptionEditing project type.

+

Exception handling and code refactoring for backend codebase.

+

Download endpoints optimisations.

+

Endpoint support to store transliteration logs to blob storage.

+

User active/inactive integration.

+

Decentralized User Invitation to workspace managers.

+

Changes to email templates.

+
-
-
-
    -
  • -
    - V 3.0 -

    May 2024

    -

    - Flower Configuration for asynchronous celery tasks logging. - Deletion confirmation endpoint. - Added new notification types. - Access level changes for manager role. - Minor changes to analytics. - Added new project types - OCRSegmentationCategorization & editing - Dataset and Task data privacy changes to hide public URLs. - New endpoint to add a proxy Audio URL service. - Line Charts based performance analytics. - Integration changes to AcousticNormalizedTranscriptionEditing project type. - Exception handling and code refactoring for backend codebase. - Download endpoints optimisations. - Endpoint support to store transliteration logs to blob storage. - User active/inactive integration. - Decentralized User Invitation to workspace managers. - Changes to email templates. -

    -
    -
  • -
  • -
    - V 2.4 -

    October 2023

    -

    - Workspace and Organisation level Payment Reports. - Support for OCR, ASR data types predictions population. - Consider Batch sampling and automatic annotations creation support to pull new data - items. - Reports scheduling feature. - Backend support for Chitralekha UI for any Audio Project Types. - New project types Acoustic Normalized Transcription and Acoustic Normalized - Transcription Editing utilising CL UI. - Transliteration Logging support using Blob storage. - Support e-mail-based async calculation to all reports. - Code Refactoring and reformatting. - User profile picture upload and change feature using blob storage. - Elastic search and Kibana support for logging setup. - Download all projects within a workspace asynchronously using blob storage. - Some bug fixes for annotation filter, reports mail along with others. -

    -
    -
  • -
  • -
    - V 2.3 -

    July 2023

    -

    - Conversation Verification Project Type. - OCR Project Types modifications. - Option to change the stage of a project to Supercheck Stage. - All endpoints for Supercheck workflow. - Bug fixes for Assigning and Unassigning Tasks. - New field domain in TranslationPair datatype. - Superchecker notes. - Reports Bug fixes. - Frozen Users for Workspace. - Re-invite users. - Modification for Login and Change Password. - Automatic Annotation Creation for external data. - Intra-dataset Automation to populate draft_data_json. - More quality parameters in reports (WER, segment length etc). - Workspace-level analytics. -

    -
    -
  • -
  • -
    - V 2.1 -

    April 2023

    -

    - New user roles for Reviewer, Super-checker, and Admin. - Updated endpoints relevant to Reviewer and Admin roles. - Integration with new Indic-Trans-v2 deployed on Dhruva. - New field in the project model to support project stage in place of deprecated field - review enabled. - New annotation and task statuses to support super-checker flow in upcoming versions. - annotation_type field in all annotations to signify which role it belongs to. - New field named revision_loop_count in the task model to support bookkeeping. - New field named super_checker_user in the task model. - Changes to migrate user roles based on the appropriate work they are assigned to. -

    -
    -
  • -
  • -
    - V 2.0 -

    March 2023

    -

    - Design changes to have a status associated with each annotation - New project Types support for: - Domain Classification along with Sentence verification. - Audio Segmentation - Audio Transcription Editing with support for populating predictions. - Glossary support on annotation page of Translation Projects. - Support for reviewers to accept a task with major or minor changes. - Draft and skip option for reviewers. - Tags support to allow noise tagging in all Audio project types. - Improved UI to increase Audio Transcription productivity. - Word count/ Audio duration based public analytics. - Optimized project listing based on recently worked project. - Export fix in conversation translation editing project type. - Support search and filter based task flow for Start Labeling Now button - New task status to reflect whether a task is exported. - All tasks tab for managerial view of a project - Frontend bug fixes for Automate Datasets page. - Filters for Projects listing and Datasets listing pages. - Support to download all annotations from all tasks of a Translation Project. - Endpoint to allow managers to deallocate tasks for any user in the project. -

    -
    -
  • -
  • -
    - V 1.3 -

    November 2022

    -

    - New project type for Single speaker transcription editing. - Public API endpoint for language based Organization Analytics. - Complete support for all levels of review reports. - Annotation Quality Reports. - Backend support to categorize accepted with major/minor changes. - Search support for Dataset Items table. - Bulk delete endpoints for tasks and data items. - Support for Managers, Org Owners to be able to annotate tasks. -

    -
    -
  • -
  • -
    - V 1.2 -

    October 2022

    -

    - Improved User reports for annotations and review. - Patch to update conversation data type to support Machine Translations. - Celery-based implementation for Automated MT function for Conversation Data Type. - Removal of task-lock deprecated functionality. - Workspace-level User analytics for review workflow. - Endpoint to support bulk deletion of data items and all linked tasks. - Organization-level User analytics for review workflow. - Review workflow-based reports for the Analytics tab (publicly accessible endpoint). - Review reports for User-level progress. - Integration with Azure Translate. - TSV support for Projects download. - Endpoint to support Normalized character-level edit distance between sentences. - Support to filter by task status while downloading projects. - New Boolean field in the Users model to support user input for receiving daily - mails. - Endpoint and celery-beat setup for sending daily progress emails to Users - Support for annotation, and review reports for daily progress mails. -

    -
    -
  • -
  • -
    - V 1.1 -

    August 2022

    -

    - Rename Task status rejected to to_be_revised. - Fix a bug in the review feature. - Add reviewer reports for project analytics. - Refactor User field to Annotator in Project model. - Functionality to remove users from workspace. - Code refactoring to take user_id instead of username or email in various endpoints. - Support automated transformation of Sentence Text Datasets to Translation Pairs - Dataset involving functions for Generating Machine Translations using IndicTrans and - Google Translate models. -

    -
    -
  • -
  • -
    - V 1.0 -

    July 2022

    -

    - Supports all the 22 official Indian languages - Currently support Sentence Verification tasks, Context Translation Verification - project types - Provides AI support with translation - Cleaner hierarchy of Organization, Workspace, Projects. - Reports at various levels (Org, workspace, project, user) and multiple dimensions - Allow creation of task chains and custom inputs as required by Language Experts - Enables language coordinators to enable effective collaboration (Shareable Notes, - Drafts) - RTL and Transliteration based support -

    -
    -
  • -
+
+
+ +
+
+
+
+ V 2.4 +

October 2023

+

Workspace and Organisation level Payment Reports.

+

Support for OCR, ASR data types predictions population.

+

Consider Batch sampling and automatic annotations creation support to pull new data + items.

+

Reports scheduling feature.

+

Backend support for Chitralekha UI for any Audio Project Types.

+

New project types Acoustic Normalized Transcription and Acoustic Normalized + Transcription Editing utilizing CL UI.

+

Transliteration Logging support using Blob storage.

+

Support e-mail-based async calculation to all reports.

+

Code Refactoring and reformatting.

+

User profile picture upload and change feature using blob storage.

+

Elastic search and Kibana support for logging setup.

+

Download all projects within a workspace asynchronously using blob storage.

+

Some bug fixes for annotation filter, reports mail along with others.

+
+
+
+
+ +
+
+
+ V 2.3 +

July 2023

+

Conversation Verification Project Type.

+

OCR Project Types modifications.

+

Option to change the stage of a project to Supercheck Stage.

+

All endpoints for Supercheck workflow.

+

Bug fixes for Assigning and Unassigning Tasks.

+

New field domain in TranslationPair datatype.

+

Superchecker notes.

+

Reports Bug fixes.

+

Frozen Users for Workspace.

+

Re-invite users.

+

Modification for Login and Change Password.

+

Automatic Annotation Creation for external data.

+

Intra-dataset Automation to populate draft_data_json.

+

More quality parameters in reports (WER, segment length, etc.).

+

Workspace-level analytics.

+
+
+
+
+
+ +
+
+
+
+ V 2.1 +

April 2023

+

New user roles for Reviewer, Super-checker, and Admin.

+

Updated endpoints relevant to Reviewer and Admin roles.

+

Integration with new Indic-Trans-v2 deployed on Dhruva.

+

New field in the project model to support project stage in place of deprecated field + review enabled.

+

New annotation and task statuses to support super-checker flow in upcoming versions. +

+

annotation_type field in all annotations to signify which role it belongs to.

+

New field named revision_loop_count in the task model to support bookkeeping.

+

New field named super_checker_user in the task model.

+

Changes to migrate user roles based on the appropriate work they are assigned to.

+
+
+
+
+ +
+
+
+ V 2.0 +

March 2023

+

Design changes to have a status associated with each annotation.

+

New project Types support for:

+

- Domain Classification along with Sentence verification.

+

- Audio Segmentation.

+

- Audio Transcription Editing with support for populating predictions.

+

Glossary support on annotation page of Translation Projects.

+

Support for reviewers to accept a task with major or minor changes.

+

Draft and skip option for reviewers.

+

Tags support to allow noise tagging in all Audio project types.

+

Improved UI to increase Audio Transcription productivity.

+

Word count/Audio duration based public analytics.

+

Optimized project listing based on recently worked project.

+

Export fix in conversation translation editing project type.

+

Support search and filter based task flow for Start Labeling Now button.

+

New task status to reflect whether a task is exported.

+

All tasks tab for managerial view of a project.

+

Frontend bug fixes for Automate Datasets page.

+

Filters for Projects listing and Datasets listing pages.

+

Support to download all annotations from all tasks of a Translation Project.

+

Endpoint to allow managers to deallocate tasks for any user in the project.

+
+
+
+
+
+ +
+
+
+
+ V 1.3 +

November 2022

+

New project type for Single speaker transcription editing.

+

Public API endpoint for language-based Organization Analytics.

+

Complete support for all levels of review reports.

+

Annotation Quality Reports.

+

Backend support to categorize accepted with major/minor changes.

+

Search support for Dataset Items table.

+

Bulk delete endpoints for tasks and data items.

+

Support for Managers, Org Owners to be able to annotate tasks.

+
+
+
+
+ +
+
+
+ V 1.2 +

October 2022

+

Improved User reports for annotations and review.

+

Patch to update conversation data type to support Machine Translations.

+

Celery-based implementation for Automated MT function for Conversation Data Type.

+

Removal of task-lock deprecated functionality.

+

Workspace-level User analytics for review workflow.

+

Endpoint to support bulk deletion of data items and all linked tasks.

+

Organization-level User analytics for review workflow.

+

Review workflow-based reports for the Analytics tab (publicly accessible endpoint). +

+

Review reports for User-level progress.

+

Integration with Azure Translate.

+

TSV support for Projects download.

+

Endpoint to support Normalized character-level edit distance between sentences.

+

Support to filter by task status while downloading projects.

+

New Boolean field in the Users model to support user input for receiving daily mails. +

+

Endpoint and celery-beat setup for sending daily progress emails to Users.

+

Support for annotation and review reports for daily progress mails.

+
+
+
+
+
+ +
+
+
+
+ V 1.1 +

August 2022

+

Rename Task status rejected to to_be_revised.

+

Fix a bug in the review feature.

+

Add reviewer reports for project analytics.

+

Refactor User field to Annotator in Project model.

+

Functionality to remove users from workspace.

+

Code refactoring to take user_id instead of username or email in various endpoints. +

+

Support automated transformation of Sentence Text Datasets to Translation Pairs + Dataset involving functions for Generating Machine Translations using IndicTrans and + Google Translate models.

+
+
+
+
+ +
+
+
+ V 1.0 +

July 2022

+

Supports all the 22 official Indian languages.

+

Currently supports Sentence Verification tasks, Context Translation Verification + project types.

+

Provides AI support with translation.

+

Cleaner hierarchy of Organization, Workspace, Projects.

+

Reports at various levels (Org, workspace, project, user) and multiple dimensions.

+

Allows creation of task chains and custom inputs as required by Language Experts.

+

Enables language coordinators to enable effective collaboration (Shareable Notes, + Drafts).

+

RTL and Transliteration based support.

+
+
@@ -291,4 +373,4 @@

July 2022

- + \ No newline at end of file diff --git a/styles/ai4b-tools.css b/styles/ai4b-tools.css index dc6281e..bbec4bb 100644 --- a/styles/ai4b-tools.css +++ b/styles/ai4b-tools.css @@ -112,63 +112,77 @@ margin-left: -5px; border-radius: 50%; } -ul.timeline-list { - position: relative; - margin: 0; - padding: 0 -} -ul.timeline-list:before { - position: absolute; - content: ""; - width: 2px; - height: 100%; - background-color: rgb(255, 102, 0); - left: 50%; - top: 0; - -webkit-transform: translateX(-50%); - transform: translateX(-50%); -} -ul.timeline-list li { - position: relative; - clear: both; - display: table; -} -.timeline_content { - border: 2px solid rgb(255, 102, 0); - background-color:#fff -} -ul.timeline-list li .timeline_content { - width: 45%; - color: #333; - padding: 30px; - float: left; - text-align: right; -} -ul.timeline-list li:nth-child(2n) .timeline_content { - float: right; - text-align: left; -} -.timeline_content h4 { + +.timeline-content h4 { font-size: 22px; font-weight: 600; margin: 10px 0; } -ul.timeline-list li:before { - position: absolute; - content: ""; - width: 25px; - height: 25px; - background-color: rgb(255, 102, 0); - left: 50%; - top: 50%; - -webkit-transform: translate(-50%, -50%); - transform: translate(-50%, -50%); - border-radius: 50%; +.timeline-content p{ + margin: 0; } -.timeline_content span { + +.timeline-content span { font-size: 18px; font-weight: 500; font-family: poppins; color: rgb(255, 102, 0); } + +.timeline-row { + display: flex; + text-align: center; +} + +.timeline-col { + flex: 1; + position: relative; + padding: 10px; +} + +.timeline-col-right { + border-right: solid 2px rgb(255, 102, 0); +} + +.timeline-col-left { + border-left: solid 2px rgb(255, 102, 0); +} + +.timeline-content { + border: solid 2px rgb(255, 102, 0); + margin: 0.5% 5%; + padding: 4%; + width: fit-content; +} + +.timeline-col-right .timeline-content { + text-align: right; + margin-left: auto; +} + +.timeline-col-left .timeline-content { + text-align: left; + margin-right: auto; +} + +.timeline-circle { + width: 20px; + height: 20px; + background-color: rgb(255, 102, 0); + border-radius: 50%; + position: absolute; + top: 50%; + transform: translate(-50%, -50%); +} + +.timeline-col-right .timeline-circle { + right: -1px; + transform: translate(50%, -50%); +} + +.timeline-col-left .timeline-circle { + left: -1px; + transform: translate(-50%, -50%); +} + /* End style for the tools pages */ From 31357897cf7ad7c641bc7e2a1076b1f1ce528243 Mon Sep 17 00:00:00 2001 From: kartikvirendrar Date: Thu, 22 Aug 2024 16:58:44 +0530 Subject: [PATCH 4/4] created anudesh page --- html/anudesh.html | 203 ++++++++++++++++++++++++++++++++++++++++++ html/chitralekha.html | 2 +- html/shoonya.html | 2 +- 3 files changed, 205 insertions(+), 2 deletions(-) create mode 100644 html/anudesh.html diff --git a/html/anudesh.html b/html/anudesh.html new file mode 100644 index 0000000..109e32f --- /dev/null +++ b/html/anudesh.html @@ -0,0 +1,203 @@ + + + + + + + AI Annotation Tool + + + + + + + + + + + +
+ +
+
+ +
+
+

Anudesh

+

Anudesh is an open source platform to annotate Large language model's data at scale, built with a vision to enhance digital presence of under-represented languages in India. +

+ + +
+
+ + +
+ +
+
+
+ + +
+

Installation Instructions

+

Follow the steps below to install and run the AI Annotation Tool:

+
Backend Setup
+
+# Clone the Anudesh-Backend repository from GitHub to your local machine.
+git clone https://github.com/AI4Bharat/Anudesh-Backend.git
+            
+# Create a virtual environment for the project. Replace  with your preferred environment name.
+python3 -m venv 
+
+# Activate the virtual environment. This ensures that the packages you install are isolated from the global Python environment.
+source /bin/activate
+
+# Install all required Python packages listed in the requirements-dev.txt file.
+pip install -r deploy/requirements-dev.txt
+
+# Set up the environment variables needed for the project by copying the example environment file.
+cp .env.example ./backend/.env
+
+# Generate a new secret key for Django (within the virtual environment):
+# Open a Python shell.
+python backend/manage.py shell
+
+    # Import the utility function to generate a secret key.
+    >> from django.core.management.utils import get_random_secret_key
+
+    # Generate and print a new secret key.
+    >> get_random_secret_key()
+
+# Copy the generated secret key and paste it into the .env file as the value for SECRET_KEY.
+
+# Docker Installation
+# Build the Docker containers as defined in the docker-compose-local.yml file.
+docker-compose -f docker-compose-local.yml build
+
+# Run the containers in detached mode (-d flag). This will start up all the services defined in the Docker Compose file.
+docker-compose -f docker-compose-local.yml up -d
+
+# Run Migrations
+# The following steps are required only when you run the project for the first time or after making changes to the models.
+
+    # Check if there are any pending migrations.
+    docker-compose exec web python backend/manage.py makemigrations 
+
+    # Apply all pending migrations to the database.
+    docker-compose exec web python backend/manage.py migrate
+
+# Create a superuser for accessing the Django admin interface (required only once).
+docker-compose exec web python backend/manage.py createsuperuser
+
+# Run the Django development server within the Docker container.
+docker-compose exec web python backend/manage.py runserver
+
+
Frontend Setup
+
+# Clone the Anudesh-Frontend repository from GitHub to your local machine.
+git clone https://github.com/AI4Bharat/Anudesh-Frontend.git
+
+# Change directory to the newly cloned Anudesh-Frontend folder.
+cd Anudesh-Frontend
+
+# Install the necessary dependencies for the project. 
+# The --force flag is used to bypass conflicts with the existing dependencies.
+npm i --force
+
+# Start the development server. This will run the frontend application on a local server.
+npm run dev
+

If you encounter any issues, please refer to the backend + and frontend documentation or reach out to our support team.

+
+ +
+
+
+
+

Anudesh Releases

+

Lorem ipsum dolor sit amet, consectetur adipisicing elit

+
+
+
+ +
+
+
+ V 1.0 +

2024

+

+
+
+
+
+
+ + + +
+ +
+ + + + + + + + + \ No newline at end of file diff --git a/html/chitralekha.html b/html/chitralekha.html index dfba3ef..242ff34 100644 --- a/html/chitralekha.html +++ b/html/chitralekha.html @@ -73,7 +73,7 @@

Chitralekha

- +
diff --git a/html/shoonya.html b/html/shoonya.html index 9a4d544..8d06e7a 100644 --- a/html/shoonya.html +++ b/html/shoonya.html @@ -47,7 +47,7 @@

Shoonya

- +