Skip to content

A Python-based tool for comparing translated .docx documents against their original versions. It highlights differences, calculates similarity metrics, and generates detailed comparison reports, including suggested corrections.

Notifications You must be signed in to change notification settings

CoderPena/TranslationComparator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

Translation Comparison and Accuracy Verifier

Description

This program compares an English-translated document with its original Portuguese version to verify translation accuracy. It leverages Python and integrates libraries such as docx, deep_translator, difflib, and tqdm to extract content, translate text, and generate detailed reports highlighting differences.

Features

  • Text Extraction: Extracts content from .docx files, including paragraphs and table data.
  • Automated Translation: Utilizes the deep_translator library with GoogleTranslator for translating Portuguese text into English.
  • Similarity Analysis: Compares original and translated texts using a configurable similarity threshold.
  • Report Generation: Produces comprehensive reports detailing differences, including context and suggested corrections.
  • Customizable Threshold: Allows users to define the similarity threshold to refine comparison sensitivity.
  • Progress Tracking: Displays progress bars for text processing and comparisons using tqdm.

Setup

Prerequisites:

  • Python 3.7+
  • Install the required libraries:
    pip install python-docx deep-translator tqdm
    

About

A Python-based tool for comparing translated .docx documents against their original versions. It highlights differences, calculates similarity metrics, and generates detailed comparison reports, including suggested corrections.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages