This is a java and python program which can bypass the captcha of and get user information from the site: VAHAN-NIC
Logic:
- Firstly downloading the captcha
- Secondly cracking it with tesseract
- Using selenium to interact and fill keys
- Getting and displaying the data
There are sites on the internet which do this job and don't ask for the captcha. It can used with data extraction to make it faster.
You may use tess4j for java and write a java program for OCR.