-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Configure sroie_data_preprocessing.py for expand CLASS_LIST #27
Comments
Hello again, my changed params according to expanded SROIE_CLASS_LIST is as following
after that I have changed process code as following
my result has created as following which is in
I think everything is not okay? am I wrong? Thank you in advance |
Sorry for my delayed response. Are you currently working on a custom dataset or simply expanding the category types of SROIE? |
If you are making modifications to the SROIE dataset, one approach could be to retrieve the OCR content of the key fields by utilizing string similarity. By doing so, you may obtain multiple results. To determine the desired result, you can rely on the coordinates. For instance, fields such as "tax" might have a string adjacent to it that contains the keyword "TAX". It is worth noting that the accuracy of the matched labels can significantly impact the final performance. If it is feasible, I highly recommend considering manual labeling of the OCR results for better performance. |
Thank you for your interest. I am working my own dataset not original Sroie dataset |
Acctualy I have build my own dataset on SROIE's dataset stucture. I mean I have a image, box txt file and json key txt. Everthing okay on my side. Bbox coordinates and ocr result text data are correct. Now I am trying to convert my custom sroie stucture dataset to your model by using According to this intorduce
|
|
Thank you again. I do not want to use regex for matching. it could be any problem for best result? And as following code modification is correct?
In def ground_truth_extraction,
|
I think your code can handle the case well. You may set different |
Thank you so much again, for your great project and support. I will try. |
Hello again. I need a help about expand CLASS_LIST. Firstly thank you for your support in advance
I have configured
SROIE_CLASS_LIST = ["others", "company", "address", "document_number", "date_time", "total", "tax"]
Sample box file is as following
Sample key file is as following
According to above data how can I modify is following code.
And I do not want to use regex for fix data pattern. I would like to modify like just raw text
The text was updated successfully, but these errors were encountered: