How Pangram detects AI-generated content

Overview

Pangram Text is designed to detect AI-generated content with a near-zero false positive rate. Our rigorous training approach minimizes errors and allows the model to detect AI text by analyzing and understanding subtle cues in the writing.

Initial training process

Our classifier uses a traditional language model architecture. It receives input text and tokenizes it. Then, the model turns each token into an embedding, which is a vector of numbers representing the meaning of each token.

The input is passed through the neural network, producing an output embedding. A classifier head transforms the output embedding into a 0 or 1 prediction, where 0 is the human label and 1 is the AI label.

We train an initial model on a small but diverse dataset of approximately 1 million documents comprised of public and licensed human-written text. The dataset also includes AI-generated text produced by GPT-4 and other frontier language models. The result of training is a neural network capable of reliably predicting whether text was authored by human or AI.

Continued improvement through iteration

Hard Negative Mining

The initial model was already quite effective, but we wanted to maximize accuracy and reduce any possibility of false positives (incorrectly predicting human-authored documents as AI-generated). To do this, we developed an algorithm specifically for AI detection models.

With the initial dataset, our model did not have enough signal to go from 99% accurate to 99.999% accurate. While the model learns the initial patterns in the data quickly, the model needs to see hard edge cases in order to precisely distinguish between human and AI text.

We solve this by using the model to search large datasets for false positives and augmenting the initial training set with these additional hard examples before retraining. After several cycles of this, the resulting model exhibits a near-zero false positive rate as well as overall improved performance on held-out evaluation sets.

While the initial facade is unassuming, the colorful vibe and illuminated decor instantly transport you to a very hip Ethiopian eatery all the while keeping it quintessentially Crown Heights. As a vegetarian I was very excited about what this plant based kitchen had to offer and with the help of the very knowledgeable and friendly staff, my friend and I got the Lentil and Squash sambusas for our apps. These filled sweet-savory puff pastries were delectable! For our mains we went with the Mercato and Paisa platters which were good portion sizes for sure and essentially served as a tasting menu of the flavors offered at this restaurant. While I have definitely had better and fresher Ethiopian food, the ambiance drinks and attentive staff make this a very good Ethiopian spot to try! Definitely recommend!

Ras Plant Based is an absolute gem! The moment you walk in, you're greeted with warm, inviting vibes and a cozy atmosphere. The menu is a creative celebration of Ethiopian flavors, all completely plant-based and incredibly delicious. Every dish is bursting with rich, authentic spices and fresh ingredients that make each bite a culinary adventure. The injera is soft and tangy, perfect for soaking up the vibrant stews and lentils. The service is top-notch—friendly, attentive, and knowledgeable about the menu. Whether you're a vegan, vegetarian, or just someone who loves great food, Ras Plant Based offers an unforgettable dining experience. It's not just a meal; it's a cultural journey that leaves you craving more. I can't recommend it enough! Five stars all the way!

Write a 5-star review for Ras Plant Based. Make the review 135 words long.

Mirror Prompts

We design the AI side of the dataset to closely resemble the human side in style, tone, and semantic content. For each human example, we generate an AI-generated example that matches the original document on as many axes as possible, to ensure that our model learns to classify documents solely based on specific characteristics of LLM writing.

Retrain

We train the model with updated training set and evaluate the model's performance at each step. Using this method, we are able to reduce errors and increase the accuracy of our model beyond what is possible with normal training.

Learn more

arxiv.org

Technical Report on the Pangram AI-generated Text Classifier

Check out our full technical white paper on arXiv where we go in-depth on training details, performance, and other experiments!

Solutions

Dashboard API API Docs Chrome Extension

Use Cases

Education Publishing Content Moderation Web Verification

Company

About Us Contact Us Career

Explore more

Blog How it Works

Resources

pangramlabs

info@pangram.com