All Stories
December 2023
-
LLMs for Evaluating LLMs
A good watch on LLMs as Evaluators
-
Karpathy on Hallucinations
Dream machines: it's a feature, not a bug
-
LLMonitor Benchmarks
Weekly benchmarks of popular LLMs using real-world prompts
-
Adversarial dataset creation challenge for text-to-image generators
Novel and long tail failure modes of text-to-image models
-
Frontier Group launches for AI Safety
The Big Guns get safer together
-
AI Engineer Summit
Recordings now on YouTube
April 2023
-
How To Avoid Leaking PII to ChatGPT
A proof-of-concept JavaScript tool to prevent IP address leakage in ChatGPT interactions.
-
ChatGPT bug bounty program doesn’t cover AI security
AI Security: The Limits of Bug Bounty Programs and the Need for Non-ML Red Teaming
-
Defending Against Deepfakes
Have you agreed on a safe word with your loved ones yet?
-
Chat Markup Language (ChatML)
Establishing Conversational Roles and Addressing Syntax-Level Prompt Injections
Page 4 of 11