AI Tech Digest — May 06, 2026
The AI Tech Digest is evolving. We’re shifting from industry news to what matters to builders: new tools, trending open-source projects, and the best from the AI developer community. If you’re looking for funding rounds and CEO drama, this isn’t the place anymore.
Top Stories
Vercel Open-Sources deepsec: AI-Powered Vulnerability Hunting at Repo Scale
Vercel Labs released deepsec, a CLI-first, open-source security harness that uses coding agents (Claude and Codex) to find vulnerabilities in large codebases. The tool runs on your own infrastructure, supports sandbox-based scaling, and targets repositories where traditional SAST/DAST tools fall short.
The architecture separates concerns cleanly: regex matchers surface suspicious patterns, then an AI agent (Claude by default, Codex as an alternative backend) investigates, enriches findings with context, and triages by severity. It ships with a deepsec triage --severity HIGH command for filtering and deepsec revalidate for confirming fixes. For large-scale scanning, Vercel’s AI Gateway provides unified auth across both agent backends from a single API key.
Why it matters: AI-powered vulnerability scanning is now a CLI command. It arrives as Anthropic’s Mythos model demonstrates that AI agents can find zero-day vulnerabilities in production software, and deepsec puts that capability into everyday CI/CD workflows. The pluggable agent backend (swap Claude for Codex with a flag) means you’re not locked into one provider.
OpenAI Ships GPT-5.5 Instant as New ChatGPT Default
OpenAI rolled out GPT-5.5 Instant, replacing GPT-5.3 Instant as the default model for ChatGPT’s free tier and API. The update promises fewer hallucinations on sensitive topics, more concise responses, and fewer gratuitous emojis. The model scored 81.2 on the AIME 2025 math test, up from 65.4 on the previous default.
Available in ChatGPT starting May 5 and in the API alongside the Pro and Thinking variants released last month. The rollout is phased, so some users may see it before others.
Why it matters: Hundreds of millions of ChatGPT users interact with this model daily. For developers building on the API, GPT-5.5 Instant at the default tier closes a quality gap with the Pro variant, so it’s worth benchmarking if you’ve been paying for higher tiers. The “fewer hallucinations on sensitive topics” claim addresses one of the most common complaints about chat models in production.
Anthropic & OpenAI’s Enterprise JVs Are Already Shopping for Acquisitions
Just one day after Anthropic’s $1.5B joint venture with Blackstone, Hellman & Friedman, and Goldman Sachs launched (and OpenAI finalized its $10B rival JV), Reuters reports both ventures are already in talks to acquire AI services and consulting firms. The stated goal: embed technical teams directly inside client organizations to build customized AI solutions rather than selling software licenses.
The moves follow a “forward-deployed engineering” model where AI company staff work inside portfolio companies of the PE backers. Most of the capital raised is expected to fund acquisitions of engineering services firms.
Why it matters: “Just ship an API” isn’t enough for enterprise AI adoption. Both Anthropic and OpenAI are acknowledging that deploying AI into real business processes still requires human services work, the same labor-intensive integration that has kept traditional consulting firms in business. If you’re building AI services or consulting, these JVs are about to become both your biggest customers and your most well-funded competitors.
- Reuters: JVs in talks to buy AI services firms
- SiliconANGLE: Enterprise AI adoption push
- CNBC: Anthropic teams with Goldman, Blackstone
Noteworthy Releases & Updates
754 Cybersecurity Skills Drop for AI Agent Platforms
A new open-source repository, mukul975/Anthropic-Cybersecurity-Skills, ships 754 structured cybersecurity skills spanning 26 security domains, each mapped to five industry frameworks: MITRE ATT&CK, NIST CSF 2.0, MITRE ATLAS, D3FEND, and NIST AI RMF. The skills follow the agentskills.io open standard (YAML frontmatter + structured Markdown) and work with Claude Code, GitHub Copilot, Codex CLI, Cursor, Gemini CLI, and 20+ other platforms. Licensed Apache 2.0.
Install with npx skills add mukul975/Anthropic-Cybersecurity-Skills or clone the repo and point your agent at it.
Why it matters: 754 structured skills across 26 security domains, each mapped to five compliance frameworks. Instead of prompting your agent to “do a security review” and hoping for the best, you can load structured procedures that map directly to MITRE ATT&CK, NIST CSF 2.0, and three others. One skill, five compliance checkboxes. Security teams building AI-assisted workflows can start using this today.
Anthropic’s Mythos Model + Project Glasswing: Finding Zero-Days at Scale
Anthropic published details on Project Glasswing, an initiative using its Claude Mythos Preview model to find and fix zero-day vulnerabilities in critical software. In just a few weeks, Anthropic claims to have identified thousands of previously unknown zero-day vulnerabilities, many critical, in every major operating system and every major web browser.
Mythos Preview isn’t a security-specific model; it’s a general-purpose frontier model that Anthropic discovered has striking cybersecurity capabilities. The UK’s AI Safety Institute independently confirmed the model’s capabilities in multi-step cyber-attack simulations. Anthropic is extending access to 40+ organizations that maintain critical software, including Microsoft and CrowdStrike.
Separately, Anthropic shipped Claude Code Security, a GitHub Action that uses Claude to analyze PR diffs for security vulnerabilities, available as a limited research preview.
Why it matters: Mythos can find bugs that entire security teams missed, but the dual-use implications are serious. The same capabilities that help defenders can enable attackers. Anthropic is trying to thread the needle with Glasswing by prioritizing defender access. For developers, the Claude Code Security Action is a drop-in GitHub Action that adds AI-powered security analysis to your existing PR review workflow.
Ai2 Releases MolmoAct2: Open Robotics Foundation Model
The Allen Institute for AI (Ai2) released MolmoAct2, a fully open-source robotics foundation model that brings faster, stronger 3D action reasoning to real-world robot tasks. The model uses depth-aware perception tokens and visual reasoning traces via waypoints in image space to convert instructions into low-level action commands for robotic hardware.
MolmoAct2 ships alongside the MolmoAct2-BimanualYAM Dataset, a new bimanual manipulation dataset built on Hugging Face’s low-cost SO-100/101 robotic platform. All model weights, training code, datasets, and action reasoning data are released openly.
The project builds on last year’s MolmoAct, which introduced the “action reasoning model” concept: a VLM that grounds scene semantics through depth perception rather than relying on language to carry out physical motions.
Why it matters: Open robotics models are rare compared to the flood of language and vision models. MolmoAct2 reasons in 3D space through visual waypoints instead of trying to describe physical actions in language. The bimanual dataset fills a gap in robotics research: most open datasets focus on single-arm manipulation. If you’re working in robotics or embodied AI, download the weights and the dataset.
The Skills Directory Race: Four Lines That Took GitHub by Storm
The “4 lines every CLAUDE.md needs”, a single-file GitHub repo distilling Andrej Karpathy’s observations about LLM coding pitfalls, continues to rack up stars, now past 60,000. The repo (forrestchang/andrej-karpathy-skills) sits at #2 on GitHub Trending, while Matt Pocock’s mattpocock/skills holds #1 with 55K+ stars.
Three of the five fastest-growing repos on GitHub are now skill directories, and the broader ecosystem has spawned a comparison site (AgentConn) analyzing the different approaches. The key distinction: Pocock’s skills are vendor-locked to Claude but trivially replaceable, while others aim for cross-platform compatibility.
Why it matters: A markdown file with four behavioral guidelines is one of GitHub’s most-starred repos. That says more about the bottleneck in AI-assisted coding than any product announcement. The bottleneck isn’t model quality; it’s how you instruct the model. The skills ecosystem is still early and fragmented. IDEs and agent frameworks will likely formalize skill loading in the next few months.
- andrej-karpathy-skills on GitHub
- Level Up Coding: The 4 Lines analysis
- AgentConn: Skills directory comparison
Quick Hits
-
Anthropic’s run-rate surpasses $30B: Up from ~$9B at end of 2025, driven largely by Claude Code adoption. Anthropic also secured 5 gigawatts of compute capacity through its expanded Google-Broadcom partnership (up from the 3.5GW announced earlier), with capacity coming online in 2027.
-
OpenClaw past 300K GitHub stars: The open-source AI assistant project continues to be the fastest-growing repo in GitHub history, up from 9K stars in January. The ecosystem now includes VoltAgent’s
awesome-openclaw-skillscollection and multiple enterprise deployments. -
Kimi Code K2.6 full release: Moonshot AI’s coding model, which generated the highest engagement of any April model release, is now expected to have full API access and benchmarks in early May.
What to Watch
-
AI-powered security tooling convergence: Between Vercel’s deepsec, Anthropic’s Mythos/Glasswing, and the 754-skill cybersecurity library, the tools for AI-assisted security are maturing fast. More IDE integrations and CI/CD pipeline tooling should follow in the coming weeks.
-
Enterprise AI JV acquisitions: Both Anthropic’s and OpenAI’s Wall Street-backed ventures are actively shopping. The first acquisition targets will signal which types of AI services companies are most valued, whether pure consulting, vertical solutions, or integration specialists.
-
Skills ecosystem consolidation: With three of GitHub’s top five trending repos being skill directories and multiple cross-platform standards emerging (agentskills.io, AGENTS.md), a shakeout toward one or two dominant formats is likely. Watch for IDE vendors to pick sides.
-
Anthropic compute scaling: 5 gigawatts is an enormous compute commitment. How Anthropic allocates that capacity between training, inference, and the growing Claude Code user base will have downstream effects on API pricing, latency, and model availability.
That’s the digest for May 6, 2026. See you tomorrow.