Treffer: Enhanced personally identifiable information data masker using natural language processing and computer vision.
Weitere Informationen
The "Enhanced PII Data Masker" work outlines a solution that utilizes NLP and CV to change the approach in security concerning documents. It employs Python, spaCy, OpenCV, and Django to detect and redact sensitive data with high accuracy irrespective of the document's format. Having provided various examples of generic and specific solutions, assessment of experiment results proves the effectiveness of the system to protect sensitive data and, consequently, augment confidentiality and privacy of most digital documents in a diverse range of domains. Farther to extend the functionality of the system, some features are added such as Usability of GUI interface is improved, Feedback mechanism for fine tuning the redaction algorithm, Batch processing. All these improvements combined give a solid answer to not only nuances recognition of private information but also its uncompromised occlusion, which is engaging two important imperatives of the modern informational security. [ABSTRACT FROM AUTHOR]
Copyright of AIP Conference Proceedings is the property of American Institute of Physics and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)