Everyone's Talking About Mythos. My Wife Is Still Talking About the TV.
Anthropic's Mythos Preview report measures the model I can actually run. So this weekend, I ran it on my house. Here is what Opus 4.6 did.
Tag
12 posts
Anthropic's Mythos Preview report measures the model I can actually run. So this weekend, I ran it on my house. Here is what Opus 4.6 did.
I spoke at Palm Beach State College's Cybersecurity Symposium on how security teams use AI in practice and what students can do right now to prepare.
Findings from 153 participants classifying AI-generated phishing: technique-level bypass rates, overconfidence patterns, and what security training misses.
Real-time 1v1 ranked matches, a new unlock ladder, and a terminal AI that will not stop talking. Threat Terminal v2.0 goes live tonight.
What is changing in Threat Terminal v2: complete UI overhaul, persistent progression, daily challenges, ranked PvP, badges, and a coin economy.
Preliminary descriptive patterns from 100 participants and 1,612 classified emails in Threat Terminal, before formal statistical analysis begins.
Pilot data from 56 participants in Threat Terminal reveals which phishing techniques humans miss most when AI eliminates writing quality as a signal.
Phishing emails with no urgency, no threats, and no red flags bypass humans at three times the rate of credential harvesting. Training has it backwards.
AI eliminated the grammar errors and broken formatting phishing training taught people to spot. The detection problem is now fundamentally different.
How I built a controlled phishing dataset with the Claude API: batching by technique, automated review, and handling rate limits at scale.
Decisions, pivots, and problems behind designing a phishing research study, and why the constraints produced a cleaner methodology than planned.