Skip to content

Core

Hello there,

We are constantly looking for ways to improve Datasnipper, and we believe that the best way to do this is by listening to our users. We would love to hear your ideas and suggestions for how we can make the product even better.

Whether you have a feature request or a general suggestion, we want to hear it all. Your feedback is valuable to us, and it will help us prioritize our roadmap and make sure that we are building the right things for our users.
If you want to report a bug, please use this link https://knowledge.datasnipper.com/kb-tickets/new to reach out to our support team. They will be able to help you troubleshoot the issue and ensure that it gets resolved quickly.

Core

Categories

JUMP TO ANOTHER FORUM

  • Hot ideas
  • Top ideas
  • New ideas
  • My feedback

9 results found

  1. Currently, in Document Matching, Form Extraction, etc., a currency symbol such as "¥" is recognized as a number"1" or "7" when extracting the amount of money, especially if it is handwritten.
    Ideally, improve the accuracy of OCR to distinguish between currency symbol and number.

    6 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  2. There a many types of documents which use alternative decimal / a thousand separators such as in tax forms or payroll registers. These forms or system print-outs often use pre-set bars or grids to separates figures. It would be extremely helpful if DataSnipper could detect the logic behind these formats and extract information correctly.

    4 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  3. Implement a confidence scoring system to provide users with insights into the accuracy of OCR-extracted text. I envision this feature offering both document-level and word/number-level confidence scores, empowering users to evaluate the reliability of the extracted data and make informed decisions.

    Granular Confidence Levels: Clearly define confidence score ranges (e.g., high, medium, low) and provide corresponding probability values for better interpretability.
    Visual Indicators: Incorporate visual cues (e.g., color-coding, icons) to quickly convey confidence levels, enhancing user experience.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  4. If I, after running OCR on a PDF, export the pdf to my computer, the text is unrecognized and I can't search it anymore using for example Acrobat. It would be nice and usefull if a PDF that was text recognised, stays that way after export.

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  5. Currently, it is only possible to authenticate to the OCR endpoint using an API key. There are other (more secure) authentication methods available to be configured on these resources within Azure. It would be beneficial if DataSnipper would support changing the authentication method that DataSnipper uses

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    This has been deferred (not planned for the next 6 months).


    Please continue to share this idea, we will continue to monitor for votes and comments!

  6. OCR currently overwrites all computer generated text, sometimes with worse results. OCR may be required if a portion of the document contains computer generated text but a portion does not. In these instances it would be ideal to only run use the OCR to recognize the text that is not yet included in the text layer of the file.

    3 votes

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  7. Is it feasible to implement an automatic dash-to-zero conversion for numerical data extracted from tables? By identifying tables with primarily numerical values, the system could infer that dashes represent zeros and replace them accordingly. This feature would enhance user experience, especially when dealing with large datasets

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)
  8. OCR - Punctuation standardisation - apostrophes & quote marks

    When running OCR on documents depending on font some punctuation, particularly apostrophes & quote marks, come out differently e.g. Group's vs. Group’s.

    This causes issues when using the financial statement suite & version compare features in particular as a lot false changes get flagged which muddy the waters significantly. If logic can be built in so the characters used for apostrophes & quote marks are consistent across the board that would be ideal, I think this setting would be best as the default and having an option to switch back to…

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    This has been deferred (not planned for the next 6 months).


    Please continue to share this idea, we will continue to monitor for votes and comments!

  9. Create a button with alert when a PDF document is uploaded to run or save your document as Optimized to eliminate the meta data on a PDF document that can cause issues. Similar to how you run OCR by selecting button

    1 vote

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    We’ll send you updates on this idea

    How important is this to you?

    We're glad you're here

    Please sign in to leave feedback

    Signed in as (Sign out)

    This has been deferred (not planned for the next 6 months).


    Please continue to share this idea, we will continue to monitor for votes and comments!

  • Don't see your idea?