|
|
|
So, GPT-5.6 is rumored to launch this Thursday, which means we are legally required to say: please do not tattoo these supposed leaks on your forearm just yet.Ā Ā |
That said, the ārumorsā are unusually specific. The chatter points to a possible June 25 launchĀ with a 2M-token context window, cheaper pricing, better agentic coding, stronger image-to-code replication, cleaner frontend generation, and Playwright-style browser testing inside ChatGPT.Ā |
TBH, vision model capabilities are the missing piece of AI development right now. IMO, solve machine vision, you solve engineering. Then itāll just be about how good the human prompterās idea is. We think Claude still has some room to grow here, so if GPT 5.6 is better at vision, that could be a huge deal.Ā |
Anyway, Corey broke down everything we think we know about GPT-5.6, because the interesting part is where OpenAI seems to be aiming: models that can use tools better, check their own work, fix mistakes, and ship something closer to finished because they can see it better. Tiny request from the humans: please let the models play their own games to test them. TY TY :DĀ |
One more to kick things off: A viral r/aivideo Seedance trailer imagined an 80s Avengers movie and turned AI video into an 80s dream-casting debate. |
|
Yāknow, in a couple years, if some of these actors license their likenesses and get paid fairly to do so, you could actually make an alt marvel cinematic universe with these ppl in it⦠just saying⦠|
Hereās what happened in AI today: |
š¼ Sakana Fugu turned a team of AI models into one API. š° OpenAI expanded Daybreak from finding software bugs to helping land fixes. š° Five Eyes agencies warned frontier cyber models may be months from major real-world risk. šŖ Stripe Directory gave agents a way to discover businesses across Stripe. š§Ā Tuesday Tool Tip:Ā compare GLM-5.2 before you commit a workflow to one provider.
|
ā¦and a whole lot more that you can read about here. |
Hey: Want to reach 700,000+ AI-hungry readers? Advertise with us!Ā |
P.S: Love robots? Weāre starting a new robotics newsletter! Sign up early here. |
|
š¼Ā Sakana Fugu Turns a Team of AI Models Into One API |
Most AI apps still make you pick a model like youāre choosing a SaaS plan: GPT 5-whatever for this, Claude Poetic whatever for that, Gemini 3 point whatever (or Nano Banana for funsies) when the first two start acting weird. |
Sakana AI wants to hide that kinda choice behind one button. Its new Sakana Fugu works like a project manager for other models: one request goes in; Fugu decides which AI agents should plan, execute, verify, or synthesize the answer. |
Here's what happened: |
Fugu is a multi-agent system (a group of specialized AI workers) delivered through one OpenAI-compatible API. It comes in two versions: Fugu for faster everyday work, and Fugu Ultra for harder multi-step tasks like AI research, cybersecurity analysis, code review, and patent searches. In one test, Fugu Ultra ran 123 AI training experiments over ~14 hours, beating three frontier-model baselines on final performance. In another, Fugu Ultra made 50 weeks of sequential buy / hold / sell decisions on anonymized stock data and grew a $10K portfolio by 19.43% on average. Tiny billboard disclaimer: past performance does not predict live-market results. Sakana also tested it on blindfold chess, where Fugu Ultra had to remember the board without seeing it and ended four games in checkmate.
|
How to try it: |
Go to the Sakana AI Console. Pick Fugu or Fugu Ultra. Use the OpenAI-compatible API in your workflow. Check regional availability first; EU / EEA access is still pending GDPR compliance.
|
Why this matters: The default AI strategy has been āpick the smartest model.ā Fugu makes a different bet: the best answer may come from coordination, where one system chooses the right models, asks them the right questions, and checks their work. |
That fits Sakanaās broader June product push. Marlin (featured in treats to try as well) applies similar long-horizon thinking to business research, running for hours to produce strategy reports and slides. Fugu turns that philosophy into infrastructure other products can call. |
Our take: This is the agent market growing up. Companies usually hand-build messy chains of model calls. Fugu packages the chain as the product. The meme is that everyone wants to build an orchestrator now, and that teams of models working together is the next hot product. Perplexity has been doing this for a while, and Claude and OpenAI do versions of this with their sub-agent teams.Ā |
The concern is visibility. If Fugu chooses which models touch your data, route your work, and judge the final answer, customers will want logs, controls, and boring enterprise paperwork documenting the trails and traces (who saw my data, on what API, when?). So while the future of AI may be a bunch of models in a trench coat⦠procurement still wants to know whoās inside the coat. |
|
|
|
|
Your bank has the data. But to actually use it ā check cash position, reconcile transactions, understand burn ā you have to export it, paste it somewhere else, and do the work outside your account.Ā |
Mercury Command closes that loop. It's AI built directly into Mercury that surfaces insights and takes action from your live account data. Ask what you need to know, then act on it instantly ā follow up on an outstanding invoice, set a limit on a card, categorize a transaction ā all in the same conversation. You approve every step. Command executes. |
ā Try Mercury Command |
*Mercury is a fintech company, not an FDIC-insured bank. Banking services provided through Choice Financial Group and Column N.A., Members FDIC. |
|
|
|
Hot take: you donāt actually have to worry that much about perfect prompt formatting. |
Try voice-dump prompting instead: hold down the dictation tool (most AI tools have them), ramble for a few minutes, and give the model every fragment, caveat, example, constraint, and āvibeā in your head. As Guinness Chen put it, LLMs are literally built to reconstruct intent from language. |
Then ask it to reconstruct your latent intent, meaning the goal hiding underneath the messy language. |
Copy this before or after your brain dump: |
Iām going to give you a messy brain dump. Do not answer yet.
First:
1. Summarize what Iām trying to do.
2. Identify my implied goal, audience, constraints, tone, and examples.
3. Ask whatās unclear.
4. Rewrite this into a clean prompt I can reuse.
Hereās where you can find the dictate tool on ChatGPT and Claude:
|
Hereās where you can find the dictate tool on ChatGPT and Claude:Ā |
|
Total AI beginner? Start here (goes with this video). |
Have a specific skill you want to learn?Ā Request it here.Ā |
|
|
|
|
š° Around the Horn |
|
OpenAI expanded Daybreak with Codex Security, GPT-5.5-Cyber, a Cyber Partner Program, and Patch the Planet for open-source maintainers. Googleās Intrinsic unveiled a modular AI robot workcell for electronics assembly, with a custom version expected to pilot in Foxconn facilities later this year VibeThinker-3B drew attention last week as a dense 3B model that nearly matched Claude Opus 4.5 on some verifiable reasoning benchmarks. Chevron signed a 20-year Microsoft data-center power deal for gas-power, and Reflection (open AI company) reportedly lined up up to $6.3B in Colossus compute from SpaceX. Five Eyes agencies warned that frontier cyber models capable of major attacks on governments and businesses may be months away, as White House talks shifted toward shared AI security benchmarks. Getty Images struck a multi-year deal to display licensed Getty content inside ChatGPT search and discovery experiences. Samsung Electronics started rolling out ChatGPT Enterprise and Codex to all employees in Korea and Device eXperience workers worldwide. Google DeepMind and A24 announced a research partnership to develop AI-assisted creative workflows for artists. This is funny: Googleās AI apparently recommended DuckDuckGo to users trying to avoid AI-heavy search results.
|
NEW FROM THE NEURON: Diffusion LLMs got their first serious transparency test, and Google DeepMindās DiffusionGemma suggests researchers may be able to inspect how these faster models revise answers mid-generation. |
Click here for absolutely EVERYTHING that happened in AI this week. |
|
|
|
|
AI disruption is inevitable, but the future belongs to Humans+AI. WethosAI makes you irreplaceable. Engage AI to prevent workforce extinction. Join CEO Stuart McClure this Thursday, June 25, to watch live how System 3 Thinking and Cognitive Twins will upskill your career and future-proof your business. |
Register free. |
|
š§ Tuesday Tool Tip: |
So, yesterday we talked all about GLM 5.2. Today, weāre gonna share how to use it. The easy door is OpenRouter. Think of OpenRouter like a universal remote for AI models: instead of setting up GLM 5.2 yourself, you pick it from OpenRouter's model list and send your prompt there. It is the best first test if you just want to compare GLM 5.2 against Claude, GPT, or Gemini on a few real tasks. If you want a more private, secure instance of GLM 5.2, you can use something like Baseten or Fireworks. |
The serious version of this is using Unsloth's GLM-5.2-GGUF. That version is built for running the model on your own machine or private server (although this one is a beast so donāt think you can run it on something with less than 200GB of VRAM). Use this path when you care about privacy, lower repeat costs, or customizing the setup. The easiest local version is Unsloth Studio: install it, open the browser app it gives you, search for unsloth/GLM-5.2-GGUF, and start chatting without building the whole stack yourself. |
If you have a technical teammate, here is the handoff: on OpenRouter, use the model name z-ai/glm-5.2 and the OpenRouter API address. For local testing, use Unsloth's GGUF version through Ollama or llama.cpp. |
Quick API test through OpenRouter: model = "z-ai/glm-5.2" base_url = "https://openrouter.ai/api/v1"
Simple local test through Ollama: ollama run hf.co/unsloth/GLM-5.2-GGUF:UD-Q4_K_M
Local server route through llama.cpp: llama-server -hf unsloth/GLM-5.2-GGUF:UD-Q4_K_M |
Our rule: try OpenRouter first, because it is the fastest way to learn whether GLM 5.2 is actually good enough for your work. See if you even like it first. Move to Unsloth/local only after you know the model is useful and you want more control over cost, privacy, or setup. If you want something that actually can run on a smaller graphics card, try Gemma 4 12B.Ā |
|
|
|
|
| Thatās all for now. | | What'd you think of today's email? | |
|
|
P.S: Before you go⦠have you subscribed to our YouTube Channel? If not, can you? |
P.P.S: Love the newsletter, but only want to get it once per week? Donāt unsubscribeāupdate your preferences here. |