Need python3 scripts.
* We need this to be fast so threading is ideal so we run on cpu with multiple cores.
1: ocr 2 images.
Input: csv and directory of images
Read a row from the csv
From two of the fields in the row are image names that we will use to ocr each image.
Get the output strings from the ocr and test using regex to identify key words.
If we find a match then we need to mark row with and extra field if match occurred.
2: rename files
input: csv and directory of images
read a row from the csv
if field match is true in row then we need to rename files.
Hi.
I've got some experience in this. I developed an e-filing system for a legal company I worked for as my first major python project and can easily reuse my code to do this. Are you aware of the limitations for OCR though? Depending on how accurate you want the system to be you might need some person going through them afterwards. I worked with it a lot and only found our pattern about just over 90% of the time and a certain amount of the found pattern still had one to two characters off. Its just the nature of it.
Just so you know I would be using a python script and tesseract for the OCR.