All Stories

April 2023

Guardrails: Reducing Risky Outputs

Enhancing LLM Output Predictability and Safety with Structured Validation
April 15, 2023
AI Security is Probabilistic Security

Emergent Challenges: Prompt Injections and Ensuring AI Security in an Unpredictable Landscape
April 15, 2023
Obi-ChatGPT - You’re My Only Hope!

Funny Jailbreak of the Week
April 08, 2023
Eight Things to Know about Large Language Models

LLMs as Colleagues? 8 Observations and Future Workplace Implications
April 08, 2023
Reverse Engineering Neural Networks

Building Trust in AI: Seeking Mechanistic Interpretability for AI Explainability and Safety
April 08, 2023
We accidentally invented computers that can lie to us

Hallucinations as Bugs: AI's Double-edged Sword in Disruptive Technology and Society.
April 08, 2023
Slip Through OpenAI Guardrails by Breaking up Tasks

Evading AI Guardrails: Crafting Malware with ChatGPT's Assistance
April 08, 2023
Use ChatGPT to examine every npm and PyPI package for security issues

AI-driven Socket identifies and analyzes 227 vulnerable or malicious packages in npm and PyPI repositories.
April 01, 2023
Introducing Microsoft Security Copilot

A closed-loop learning system for enterprise Security Operations Centers
April 01, 2023
Constitutional AI

Scaling Supervision for Improved Transparency and Accountability in Reinforcement Learning from Human Feedback Systems.
April 01, 2023

Page 5 of 11

Get Daily AI Cybersecurity Tips