Migrating Your Privacy Workflows from Amazon Comprehend to Private AI

AWS Comprehend

At Private AI, we pride ourselves on making scalable solutions for large companies, small businesses, and government institutions that need to securely handle the potentially sensitive data and information that is flowing through their infrastructure. We understand the importance of maintaining data privacy and meeting regulatory compliance standards, and our goal is to ensure that … Read more

A Comparison of the Approaches to Generative AI in the US and EU

Approaches to Generative AI

Generative AI has become a cornerstone of modern technology, with potential applications ranging from content creation and customer service to scientific research and healthcare. However, as these powerful tools proliferate, they present a diverse set of risks and opportunities that require prudent oversight. Two major players in the global landscape, the United States and the … Read more

Privacy Attacks against Data and AI Models (Part 3)

Attacks on AI Models

Privacy Attacks against Data and AI Models Part 1 of this series covers the benefits of AI models in healthcare and the data sources available for training such models. Part 2 addresses privacy compliance challenges as well as challenges surrounding privacy-preserving techniques employed during AI development. This last part of our series gets a bit … Read more

Risks of Noncompliance and Challenges around Privacy-Preserving Techniques (Part 2)

Safeguarding Health Data

Risks of Noncompliance and Challenges around Privacy-Preserving Techniques Despite the promising signs of AI used in healthcare that we explored in Part 1 of this series, ethical concerns persist regarding the potential misuse of these innovations and safeguarding health data. For instance, drug discovery AI systems have demonstrated remarkable efficiency aiding harmful discoveries as well, … Read more

Benefits of AI in Healthcare and Data Sources (Part 1)

Safeguarding health data used in Machine Learning

Benefits of AI in Healthcare and Data Sources AI has already brought marvellous advancements in the healthcare sector but awareness is on the rise around the risks associated with it as well. In this series of posts, we are concerned with the ethical and legal compliance issue of protecting privacy and safeguarding health data used in … Read more

Enhancing Data Lake Security: A Guide to PII Scanning in S3 buckets

Data Lake Security

Introduction Data lakes are messy with vast amounts of data scattered everywhere as they serve as repositories for a myriad of structured and unstructured data. Amidst this chaos lies a treasure trove of valuable and sometimes sensitive information, including data that can directly or indirectly identify individuals, known as Personally Identifiable Information (PII). As organizations … Read more

Navigating GDPR Compliance in the Life Cycle of LLM-Based Solutions

GDPR Compliance for LLM-Based Solutions

In today’s data-driven landscape, the use of AI-based solutions, such as ChatGPT, has become increasingly prevalent. These solutions leverage the power of artificial intelligence to analyze data, generate insights, and facilitate interactions with users. However, with the rise of AI technologies, it is crucial to consider the implications for data protection and privacy, particularly in … Read more

What’s New in Version 3.8

Version 3.8 Updates

Hello, dear community! We continue to see exciting improvements released since 3.7. Here is a synopsis of highlights from the version 3.8 updates release. Translated Redaction Labels Private AI supports text processing in multiple languages, and redaction markers are now also available in multiple languages. See the Supported Language documentation for more information on which languages … Read more

How to Protect Your Business from Data Leaks: Lessons from Toyota and the Department of Home Affairs

Data Leak

Data leaks are a serious threat to any business, and particularly those handling sensitive personal information, such as customer’s financial or health records. We discussed the costs of data breaches previously in this blog post. But even when no malicious actors are involved, data leaks can expose businesses to legal liability, reputational damage, and loss … Read more

Download the Free Report

Request an API Key

Fill out the form below and we’ll send you a free API key for 500 calls (approx. 50k words). No commitment, no credit card required!

Language Packs

Expand the categories below to see which languages are included within each language pack.
Note: English capabilities are automatically included within the Enterprise pricing tier. 

French
Spanish
Portuguese

Arabic
Hebrew
Persian (Farsi)
Swahili

French
German
Italian
Portuguese
Russian
Spanish
Ukrainian
Belarusian
Bulgarian
Catalan
Croatian
Czech
Danish
Dutch
Estonian
Finnish
Greek
Hungarian
Icelandic
Latvian
Lithuanian
Luxembourgish
Polish
Romanian
Slovak
Slovenian
Swedish
Turkish

Hindi
Korean
Tagalog
Bengali
Burmese
Indonesian
Khmer
Japanese
Malay
Moldovan
Norwegian (Bokmål)
Punjabi
Tamil
Thai
Vietnamese
Mandarin (simplified)

Arabic
Belarusian
Bengali
Bulgarian
Burmese
Catalan
Croatian
Czech
Danish
Dutch
Estonian
Finnish
French
German
Greek
Hebrew
Hindi
Hungarian
Icelandic
Indonesian
Italian
Japanese
Khmer
Korean
Latvian
Lithuanian
Luxembourgish
Malay
Mandarin (simplified)
Moldovan
Norwegian (Bokmål)
Persian (Farsi)
Polish
Portuguese
Punjabi
Romanian
Russian
Slovak
Slovenian
Spanish
Swahili
Swedish
Tagalog
Tamil
Thai
Turkish
Ukrainian
Vietnamese

Rappel

Testé sur un ensemble de données composé de données conversationnelles désordonnées contenant des informations de santé sensibles. Téléchargez notre livre blanc pour plus de détails, ainsi que nos performances en termes d’exactitude et de score F1, ou contactez-nous pour obtenir une copie du code d’évaluation.

99.5%+ Accuracy

Number quoted is the number of PII words missed as a fraction of total number of words. Computed on a 268 thousand word internal test dataset, comprising data from over 50 different sources, including web scrapes, emails and ASR transcripts.

Please contact us for a copy of the code used to compute these metrics, try it yourself here, or download our whitepaper.