STT0073: LLM-Based Correction of Inference Transcriptions Using Claude AI #3

jim-gyas · 2024-11-14T05:10:35Z

Description:

Develop a pipeline to process audio transcription data by parsing the catalog, segmenting audio, and generating inference transcriptions. Validate and correct the transcriptions using an LLM to align them with reference texts while preserving context. Save corrected transcriptions and metadata to a structured CSV, with logs capturing errors and issues.

Resources:

Inference Transcription and Transfer text: news-catalog

Completion Criteria:

Successfully parse the catalog, extract metadata, download audio files, and split them into segments based on duration limits.
Generate inference transcriptions for each audio segment and validate them against the reference transcriptions.
Correct inference transcriptions using an LLM, aligning them with reference texts while retaining context and accuracy.
Save corrected transcriptions and metadata to a CSV file, with detailed logs of errors and issues during the process.

Implementation:

Updated Subtasks:

This updated approach reflects the intermediate creation of a Transfer Text CSV file using the transfer_text function and its role in correcting the split audio inference transcriptions.

Card Reviewer: @kaldan007

Reviewed.

The text was updated successfully, but these errors were encountered:

jim-gyas · 2024-11-19T05:17:52Z

API account has insufficient credit to access the Claude API.

jim-gyas · 2024-11-20T04:39:55Z

@kaldan007 and @gangagyatso4364, could you please review my card?

gangagyatso4364 · 2024-11-20T05:23:44Z

add the step to make cer comparison between all three kinds of transcript generated at last.

gangagyatso4364 · 2024-11-20T05:24:30Z

looks good go ahead.

jim-gyas · 2024-11-27T05:19:23Z

@gangagyatso4364, could you please review my updated card?

jim-gyas · 2024-11-27T06:30:28Z

jim-gyas · 2024-11-28T06:36:03Z

file_name,url,inference_transcript,audio_duration,corrected_transcript,is_changed
STT_NW0802_0001_216_to_2528,https://d38pmlk0v88drf.cloudfront.net/wav16k/STT_NW0802_0001_216_to_2528.wav,ཕྱི་ལོ་ཉིས་སྟོང་ཉི་ཤུ་རྩ་བཞི་ལོའི་ཟླ་བ་བརྒྱད་པའི་ནང་།,2.312,དགའ་ལྡན་ཁྲི་ཐོག་ཁང་དང་མཁས་མང་བློ་གསལ་བྱེ་བའི་གླིང་གྲྭ་ཚང་བཅས་ནས་བརྟག་ཞུ་ཕུལ་དོན་བཞིན། བོད་མིའི་བླ་ན་མེད་པའི་དབུ་ཁྲིད་སྤྱི་ནོར་གྱི་གོང་ས་སྐྱབས་མགོན་ཆེན་པོ་མཆོག་ནས་རྒྱ་ག,True
STT_NW0802_0002_2798_to_7662,https://d38pmlk0v88drf.cloudfront.net/wav16k/STT_NW0802_0002_2798_to_7662.wav,རྒྱ་ནག་གཞུང་གིས་མཚོ་སྔོན་ཞིང་ཆེན་གོ་ལོག་ཁུལ་བཙུགས་ནས་ལོ་འཁོར་བདུན་ཅུ་འཁོ་བའི་མཛད་སྒོ་འཚོགས་ཡོད་པ་བཞིན།,4.864,"Here is the corrected transcription:

    རྒྱ་ནག་གཞུང་གིས་མཚོ་སྔོན་ཞིང་ཆེན་གོ་ལོག་ཁུལ་བཙུགས་ནས་ལོ་འཁོར་བདུན་ཅུ་འཁོ་བའི་མཛད་སྒོ་འཚོགས་ཡོད་པ་བཞིན།",True
STT_NW0802_0003_7886_to_12686,https://d38pmlk0v88drf.cloudfront.net/wav16k/STT_NW0802_0003_7886_to_12686.wav,ཕྱི་ཟླ་བརྒྱད་པའི་ནང་རྒྱ་ནག་གཞུང་གི་མགོ་ལོག་མངའ་ཁུལ་གྱི་ས་གནས་གང་སར་དམ་བསྒྲགས་ཤུགས་ཆེར་ཆེ་ཡོད་པ་དང་།,4.8,"Here is the corrected transcription:

    དགའ་ལྡན་ཁྲི་ཐོག་ཁང་དང་མཁས་མང་བློ་གསལ་བྱེ་བའི་གླིང་གྲྭ་ཚང་བཅས་ནས་བརྟག་ཞུ་ཕུལ་དོན་བཞིན། བོད་མིའི་བླ་ན་མེད་པའི་དབུ་ཁྲིད་སྤྱི་ནོར་གླིང་གྲྭ་ཚང་གི་མཁན་པོ་གསར་པའི་ཁྲི་འདོན་",True
STT_NW0802_0004_12814_to_15726,https://d38pmlk0v88drf.cloudfront.net/wav16k/STT_NW0802_0004_12814_to_15726.wav,ལྷག་པར་དགན་སྡེ་ཁག་ཏུ་ཆོས་ཕྱོགས་ཀྱི་བྱེད་སྒོ་ལ་དམ་སྒྲགས་དང་།,2.912,"Here is the transcription with spelling corrections:

    ལྷག་པར་དགན་སྡེ་ཁག་ཏུ་ཆོས་ཕྱོགས་ཀྱི་བྱེད་སྒོ་ལ་དམ་སྒྲགས་དང་། 
    དགའ་ལྡན་ཁྲི་ཐོག་ཁང་དང་མཁས་མང་བློ་གསལ་བྱེ་བའི་གླིང་གྲྭ་ཚང་བཅས་ནས་བརྟག་ཞུ་ཕུལ་དོན་བཞིན། བོད་མིའི་བླ་ན་མེད་པ",True

jim-gyas · 2024-12-02T06:07:04Z

Transfer_text Based Correction

No Spelling Mistake in Inference Transcript:

filename: STT_NW0802_0001_216_to_2528,
inference_transcript: ཕྱི་ལོ་ཉིས་སྟོང་ཉི་ཤུ་རྩ་བཞི་ལོའི་ཟླ་བ་བརྒྱད་པའི་ནང་།,
corrected_transcript: ཕྱི་ལོ་ཉིས་སྟོང་ཉི་ཤུ་རྩ་བཞི་ལོའི་ཟླ་བ་བརྒྱད་པའི་ནང་།,
is_changed: False

Correction of Spelling Mistake:

filename: STT_NW0802_0002_2798_to_7662,
inference_transcript: རྒྱ་ནག་གཞུང་གིས་མཚོ་སྔོན་ཞིང་ཆེན་གོ་ལོག་ཁུལ་བཙུགས་ནས་ལོ་འཁོར་བདུན་ཅུ་འཁོ་བའི་མཛད་སྒོ་འཚོགས་ཡོད་པ་བཞིན།,
corrected_transcript: རྒྱ་ནག་གཞུང་གིས་མཚོ་སྔོན་ཞིང་ཆེན་མགོ་ལོག་ཁུལ་བཙུགས་ནས་ལོ་འཁོར་བདུན་ཅུ་འཁོ་བའི་མཛད་སྒོ་འཚོགས་ཡོད་པ་བཞིན།
is_changed: True

Reason:

Inference_transcript:
རྒྱ་ནག་གཞུང་གིས་མཚོ་སྔོན་ཞིང་ཆེན་གོ་ལོག་ཁུལ་བཙུགས་ནས་ལོ་འཁོར་བདུན་ཅུ་འཁོ་བའི་མཛད་སྒོ་འཚོགས་ཡོད་པ་བཞིན།
(This text uses གོ་ལོག་ཁུལ.)
corrected_transcript:
རྒྱ་ནག་གཞུང་གིས་མཚོ་སྔོན་ཞིང་ཆེན་མགོ་ལོག་ཁུལ་བཙུགས་ནས་ལོ་འཁོར་བདུན་ཅུ་འཁོ་བའི་མཛད་སྒོ་འཚོགས་ཡོད་པ་བཞིན།
(This text uses མགོ་ལོག་ཁུལ.)

Difference:
In inference_transcript, the phrase གོ་ལོག་ཁུལ is used, whereas in corrected_transcript, it is replaced with མགོ་ལོག་ཁུལ.

jim-gyas · 2024-12-04T08:37:08Z

རྒྱ་ནག་གཞུང་གིས་མཚོ་སྔོན་ཞིང་ཆེན་མགོ་ལོག་ཁུལ་བཙུགས་ནས་ལོ་འཁོར་ ༧༠ འཁོར་བའི་མཛད་སྒོ་འཚོགས་ཡོད་པ་བཞིན།

jim-gyas self-assigned this Nov 14, 2024

jim-gyas transferred this issue from OpenPecha/transcription-aligner Nov 20, 2024

kaldan007 transferred this issue from OpenPecha/stt-split-audio Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

STT0073: LLM-Based Correction of Inference Transcriptions Using Claude AI #3

STT0073: LLM-Based Correction of Inference Transcriptions Using Claude AI #3

jim-gyas commented Nov 14, 2024 •

edited

Loading

jim-gyas commented Nov 19, 2024

jim-gyas commented Nov 20, 2024

gangagyatso4364 commented Nov 20, 2024 •

edited

Loading

gangagyatso4364 commented Nov 20, 2024

jim-gyas commented Nov 27, 2024

jim-gyas commented Nov 27, 2024

jim-gyas commented Nov 28, 2024

jim-gyas commented Dec 2, 2024

jim-gyas commented Dec 4, 2024 •

edited

Loading

STT0073: LLM-Based Correction of Inference Transcriptions Using Claude AI #3

STT0073: LLM-Based Correction of Inference Transcriptions Using Claude AI #3

Comments

jim-gyas commented Nov 14, 2024 • edited Loading

Description:

Resources:

Completion Criteria:

Implementation:

Updated Subtasks:

Card Reviewer: @kaldan007

jim-gyas commented Nov 19, 2024

API account has insufficient credit to access the Claude API.

jim-gyas commented Nov 20, 2024

gangagyatso4364 commented Nov 20, 2024 • edited Loading

gangagyatso4364 commented Nov 20, 2024

jim-gyas commented Nov 27, 2024

jim-gyas commented Nov 27, 2024

jim-gyas commented Nov 28, 2024

jim-gyas commented Dec 2, 2024

Transfer_text Based Correction

No Spelling Mistake in Inference Transcript:

Correction of Spelling Mistake:

jim-gyas commented Dec 4, 2024 • edited Loading

jim-gyas commented Nov 14, 2024 •

edited

Loading

gangagyatso4364 commented Nov 20, 2024 •

edited

Loading

jim-gyas commented Dec 4, 2024 •

edited

Loading