ocr result appear other language word #251

zhujun5164 · 2024-11-27T06:36:17Z

Hi

i have set the langs to 'zh' when i using the OCR，but the recognize result appear japanese word or other language word。how can i fix it or limit the ocr result in my word dict.

thx

zhujun5164 · 2024-11-27T06:51:27Z

code

from PIL import Image
from surya.ocr import run_ocr
from surya.model.detection.model import load_model as load_det_model, load_processor as load_det_processor
from surya.model.recognition.model import load_model as load_rec_model
from surya.model.recognition.processor import load_processor as load_rec_processor
from surya.recognition import batch_recognition

image = Image.open('text.png')
langs = ["en"] # Replace with your languages - optional but recommended
det_processor, det_model = load_det_processor(), load_det_model()
rec_model, rec_processor = load_rec_model(), load_rec_processor()

predictions = batch_recognition([image], [langs], rec_model, rec_processor)
print(predictions)

image

output
([' ସଙ୍କିତ '], [0.77392578125])

EHadoux · 2024-12-11T10:36:40Z

Well, I actually came here to say the same but with english. I deal with financial reports and it often changes the £ with "છ" or "רא" or "મ" or "દ".
I must say in all fairness that the rest is great.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ocr result appear other language word #251

ocr result appear other language word #251

zhujun5164 commented Nov 27, 2024

zhujun5164 commented Nov 27, 2024

EHadoux commented Dec 11, 2024

ocr result appear other language word #251

ocr result appear other language word #251

Comments

zhujun5164 commented Nov 27, 2024

zhujun5164 commented Nov 27, 2024

EHadoux commented Dec 11, 2024