Is Consent Required for Processing Personal Data via LLMs?

Under the General Data Protection Regulation (GDPR), consent is only one of the legal bases that can be used to process personal data, including when using Large Language Models (LLMs) to boost the efficiency of processing personal information. While obtaining consent has its advantages, for example the possibility of a clear audit trail if consent … Read more

The evolving landscape of data privacy legislation in healthcare in Germany

The healthcare sector has witnessed a remarkable evolution in data privacy legislation from the 1970s to the present, mirroring the technological innovations of the time. The journey we trace here using Germany as an example illustrates how laws have struggled to adapt to protect and make usable sensitive health information against the backdrop of digital … Read more

The CIO’s and CISO’s Guide for Proactive Reporting and DLP with Private AI and Elastic

Being able to manage the data and information within a company’s infrastructure is critical for properly assessing when sensitive information is either being mismanaged or to report an “all clear” when company policies are being followed as intended. As you may be already aware, Private AI provides PII detection and redaction services to enable companies … Read more

The Evolving Landscape of Health Data Protection Laws in the United States

The healthcare sector in the United States has seen a profound transformation in its approach to data privacy, paralleling significant technological advancements, in particular the electronic health record (EHR). This article explores the trajectory of health data protection legislation in the U.S., highlighting key developments in EHR development and adoption and ongoing challenges in balancing … Read more

How to Safely Redact PII from Segment Events using Destination Insert Functions and Private AI API

Twilio Segment is a great centralized platform for storing customer data. However, it’s unlikely Segment will be your only application that needs the data. Most of us will need to download customer information from Segment, store it in the data lake, propagate the data to reporting software, and populate a few backend databases. All of … Read more

WHO’s AI Ethics and Governance Guidance for Large Multi-Modal Models operating in the Health Sector – Data Protection Considerations

The WHO’s publication of its Ethics and governance of artificial intelligence for health: Guidance on large multi-modal models from January 18, 2024 is an almost 100 page document, including over 20 pages of footnotes, which takes a deep dive into the risks and challenges of Large Multi-Modal Models (LMMs) developed, provided, and/or deployed for health … Read more

How to Protect Confidential Corporate Information in the ChatGPT Era

Many of the security measures businesses put in place to protect the personal information of their customers and employees will also help safeguard confidential corporate information. For example, access controls and incident protection systems will help avoid this information getting into the hands of internal and external threat agents. However, if that’s all they do, … Read more

Unlocking the Power of Retrieval Augmented Generation with Added Privacy: A Comprehensive Guide

RAG is a popular approach that improves the accuracy of LLMs by utilizing a knowledge base. In this blog post, we illustrate how to implement RAG without compromising the privacy of your data. What is RAG Large language models, such as OpenAI’s gpt-4-turbo and Anthropic’s Claude-3,are very powerful assistants that help us carry out different … Read more

Leveraging ChatGPT and other AI Tools for Legal Services

In recent years, the emergence of artificial intelligence (AI) and machine learning technologies has created new possibilities for various fields, including the legal sector. ChatGPT, an innovative AI language model created by OpenAI, is at the forefront of this transformation. By leveraging the abilities of ChatGPT, legal professionals can modernize their practice, streamline research, and … Read more

Download the Free Report

Request an API Key

Fill out the form below and we’ll send you a free API key for 500 calls (approx. 50k words). No commitment, no credit card required!

Language Packs

Expand the categories below to see which languages are included within each language pack.
Note: English capabilities are automatically included within the Enterprise pricing tier. 

French
Spanish
Portuguese

Arabic
Hebrew
Persian (Farsi)
Swahili

French
German
Italian
Portuguese
Russian
Spanish
Ukrainian
Belarusian
Bulgarian
Catalan
Croatian
Czech
Danish
Dutch
Estonian
Finnish
Greek
Hungarian
Icelandic
Latvian
Lithuanian
Luxembourgish
Polish
Romanian
Slovak
Slovenian
Swedish
Turkish

Hindi
Korean
Tagalog
Bengali
Burmese
Indonesian
Khmer
Japanese
Malay
Moldovan
Norwegian (Bokmål)
Punjabi
Tamil
Thai
Vietnamese
Mandarin (simplified)

Arabic
Belarusian
Bengali
Bulgarian
Burmese
Catalan
Croatian
Czech
Danish
Dutch
Estonian
Finnish
French
German
Greek
Hebrew
Hindi
Hungarian
Icelandic
Indonesian
Italian
Japanese
Khmer
Korean
Latvian
Lithuanian
Luxembourgish
Malay
Mandarin (simplified)
Moldovan
Norwegian (Bokmål)
Persian (Farsi)
Polish
Portuguese
Punjabi
Romanian
Russian
Slovak
Slovenian
Spanish
Swahili
Swedish
Tagalog
Tamil
Thai
Turkish
Ukrainian
Vietnamese

Rappel

Testé sur un ensemble de données composé de données conversationnelles désordonnées contenant des informations de santé sensibles. Téléchargez notre livre blanc pour plus de détails, ainsi que nos performances en termes d’exactitude et de score F1, ou contactez-nous pour obtenir une copie du code d’évaluation.

99.5%+ Accuracy

Number quoted is the number of PII words missed as a fraction of total number of words. Computed on a 268 thousand word internal test dataset, comprising data from over 50 different sources, including web scrapes, emails and ASR transcripts.

Please contact us for a copy of the code used to compute these metrics, try it yourself here, or download our whitepaper.