new software benchmark
͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­͏     ­
Forwarded this email? Subscribe here for more

I signed up for another SaaS

new software benchmark

May 28
 
READ IN APP
 

Hey folks,

“SaaS may be dead” - me, on tuesday

Just signed up for another SaaS tool - me, yesterday

I’m trying really hard to make interactive components for this course/reference manual I’m making. So you as a user can feel the concepts, to help understand them.

I’ve tried so many models, tools and ways to try and develop my own component styles that look good and feel right. And I think I finally found it…

I tweeted my frustrations and Pietro, who I met at OpenAI’s Dev Day last year reminded me to try Magic Path. You can have multiple agents generating design assets, components, animations, whatever on a big shared canvas.

I gave it a go on a fun experiment first and it generated some pretty awesome mechanical-style components.

X avatar for @bentossell
Ben Tossell
@bentossell
magicpath slaps (with droid) tested it out h/t @skirano
Image
Ben Tossell @bentossell
design peeps, a lil help? im building interactive, animated components im using shadcn, tailwind, motion, tegaki, rough-notation. i want a bit of a component generator that can help me cycle through variations, themes, layouts, mix n match different components and things like
7:58 PM · May 27, 2026 · 12K Views
2 Replies · 1 Repost · 29 Likes

So now I have an actual workflow and tools to generate all the components I’m after. I can play with different styles and tweak the smaller parts of the components - the buttons, prompt input box, etc.

early experiments for different styles

So I blew through the Magic Path free plan pretty quickly and then promptly signed up for a pro plan 😬.


Ben’s Bites is brought to you by Palabra.ai — Real-Time Voice AI Translator

9.3× cheaper than a human interpreter. Palabra.ai delivers real-time voice translation in 60+ languages for calls, events and streams – or embed into any app via API. Trusted by DHL, UNICEF, Paramount, BCG and Deloitte. Try it free.


Headlines

  • Claude Code now has a security plugin that checks code as Claude writes it and warns when it spots common risky patterns, like unsafe command execution, insecure HTML handling, or dangerous Python code.

  • DeepSWE tests agents on 113 original long-horizon tasks across 91 active repos and five languages. Prompts are shorter than SWE-bench Pro, but the fixes are much bigger: 668 lines and seven files on average. Current leaderboard: GPT-5.5 70%, GPT-5.4 56%, Claude Opus 4.7 54%, Claude Sonnet 4.6 32%.

  • From the board to building the Software Factory. It doesn’t happen that often, but I’ve seen it a few times recently - investors in a company leave to join the company that they backed. Madison joining Factory is a big signal and a great addition to the team. If you remember, I am an investor in Factory who joined last year but I left earlier this year due to our army of young children running my life, leaving less and less time for work work. 3 under 3 is still A LOT of work 😅.


Subscribed

My feed

  • Software after software - this was a great read, highly recommended. Thorsten runs the coding harness, Amp and always has great takes on the space.

  • Clanker - A word for the machine.

  • Mainframe - turn the work done by your agents into short recap videos for your team.

  • Granite - long-term document for all your files. Drop them in without any tagging/folders, and later search for them in plain English.

  • Surya OCR 2 - 650M parameter OCR/document model.

  • Claude Code trick for non-technical tasks - put a bunch of files in a folder, then tell Claude Code it can write scripts and make HTML.

  • Ramp used 10,000 home-grown security agents to find, validate and patch nearly 100 security issues in six days, with humans reviewing PRs before merge.

  • Slippery Slope - It’s easy to let agents get in between a person and their craft.

  • Supermemory - Building blocks for adding context to your agents.

  • Cursor trained Composer 2.5 by doing RL inside the actual Cursor harness.

  • Polar - fine-tune a model with your agent harness as the training environment with no code changes.

  • Parse 2.0 - the most accurate document parsing API in the world.

  • OpenAI are slurping up a ton of talented builders, most recently, Eric who built RepoPrompt. Great get for OAI, and congrats to Eric 😊

  • Auto-review skill for your agents, from the power-house shipper Peter Steinberger.

  • howtoeval - the no-bullshit guide to eval’ing AI agents.


Afters

X avatar for @sama
Sam Altman
@sama
AI should dramatically increase quality of life and individual freedoms for people around the world. The OpenAI Foundation is making an initial $250M commitment to measurement, transition support, and new approaches to broadly shared prosperity. openaifoundation.org/news/economic-…
4:44 PM · May 27, 2026 · 349K Views
1.06K Replies · 327 Reposts · 3.81K Likes
X avatar for @benhylak
ben hylak
@benhylak
introducing howtoeval dot com. the no-bullshit guide to eval'ing AI agents. from personal experience, and from working with the best companies in the world. there's even a quiz. link below.
Image
5:09 PM · May 27, 2026 · 58K Views
34 Replies · 67 Reposts · 907 Likes
X avatar for @theo
Theo - t3.gg
@theo
Codex, Claude Code, and Cursor are all great tools. They're also much more different than you think. I did a comparison of the three, but not in the usual way. I went deep on how they differ philosophically.
Image
9:18 PM · May 26, 2026 · 219K Views
61 Replies · 71 Reposts · 1.43K Likes
X avatar for @GergelyOrosz
Gergely Orosz
@GergelyOrosz
Why is the creator of OpenCode pretty skeptical about AI productivity gains, and the hype around AI? A very conversation @thdxr (and lots of truth bombs:) Timestamps: 00:00 Intro 07:03 Dax’s path into tech 09:04 Early startup experience 13:16 Getting involved with open source
Image
5:30 PM · May 27, 2026 · 77.7K Views
31 Replies · 91 Reposts · 1.31K Likes
X avatar for @cursor_ai
Cursor
@cursor_ai
We're hosting an event on June 16th in San Francisco. Compile is a one-day event that brings together engineers, researchers, designers, and builders of all kinds to discuss the future of software.
cursor.com
Cursor · Compile
Cursor · Compile
4:31 PM · May 27, 2026 · 232K Views
86 Replies · 89 Reposts · 1.08K Likes

Share Ben's Bites


  • Find me on X, Linkedin, or YouTube

  • Read about me and Ben’s Bites

  • 📷 thumbnail by @keshavatearth

* sponsors who make this newsletter possible :)
Wanna partner with us for the next quarter?
Email us at shanice@bensbites.com or k@bensbites.com

Ben's Bites is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.

Subscribed

Invite your friends and earn rewards

If you enjoy Ben's Bites, share it with your friends and earn rewards when they subscribe.

Invite Friends

 
Share
 
 
Like
Comment
Restack
 

© 2026 Ben's Bites Ltd
548 Market Street PMB 72296, San Francisco, CA 94104
Unsubscribe

Start writing