4 results found
-
Enhance OCR accuracy to distinguish currency symbol and number
Currently, in Document Matching, Form Extraction, etc., a currency symbol such as "¥" is recognized as a number"1" or "7" when extracting the amount of money, especially if it is handwritten.
Ideally, improve the accuracy of OCR to distinguish between currency symbol and number.17 votes -
OCR Health Score
Implement a confidence scoring system to provide users with insights into the accuracy of OCR-extracted text. I envision this feature offering both document-level and word/number-level confidence scores, empowering users to evaluate the reliability of the extracted data and make informed decisions.
Granular Confidence Levels: Clearly define confidence score ranges (e.g., high, medium, low) and provide corresponding probability values for better interpretability.
Visual Indicators: Incorporate visual cues (e.g., color-coding, icons) to quickly convey confidence levels, enhancing user experience.5 votes -
Export OCR PDF file >> keep OCR
If I, after running OCR on a PDF, export the pdf to my computer, the text is unrecognized and I can't search it anymore using for example Acrobat. It would be nice and usefull if a PDF that was text recognised, stays that way after export.
1 vote -
Run OCR only for the portions of that document which do not contain computer generated text
OCR currently overwrites all computer generated text, sometimes with worse results. OCR may be required if a portion of the document contains computer generated text but a portion does not. In these instances it would be ideal to only run use the OCR to recognize the text that is not yet included in the text layer of the file.
3 votes
- Don't see your idea?