Full Width [alt+shift+f] Shortcuts [alt+shift+k]
Sign Up [alt+shift+s] Log In [alt+shift+l]
Top Categories > AI
#all #programming #technology #startups #life #history #science #literature #architecture #creative #design #finance #travel #AI #comics #indiehacker #cartography Muted Categories [alt+←][alt+→]
Don't Worry About...
AI #113: The o3 Era Begins Enjoy it while it lasts.
2 hours ago
Marcus on AI
OpenAI’s dirty December o3 demo doesn’t readily replicate Don’t believe everything you see
18 hours ago
Don't Worry About...
o3 Is a Lying Liar I love o3.
20 hours ago
Rozado’s Visual...
New Results of State-of-the-art LLMs on 4 Political Orientation Tests One model appears closer to the center than the rest
2 days ago
Don't Worry About...
You Better Mechanize Or you had better not.
2 days ago
Weighty Thoughts
What LLMs Will Do To Jobs: All You Need is an Oracle LLMs are Mainly Tools That Enhance Experts
2 days ago
Don't Worry About...
Crime and Punishment #1 This seemed like a good next topic to spin off from monthlies and make into its own occasional...
3 days ago
seangoedecke.com RSS...
When you should lie to the language model Here’s an unreasonably effective trick for working with AIs: always pretend that your work was...
3 days ago
4
3 days ago
Here’s an unreasonably effective trick for working with AIs: always pretend that your work was produced by someone else. The problem is that…
One Useful Thing
On Jagged AGI: o3, Gemini 2.5, and everything after New models and new thresholds
4 days ago
seangoedecke.com RSS...
Is using AI wrong? A review of six popular anti-AI arguments Some people really, really don’t like AI. Broadly speaking, being anti-AI is a popular left-wing...
4 days ago
7
4 days ago
Some people really, really don’t like AI. Broadly speaking, being anti-AI is a popular left-wing position: AI is cringe, it’s plagiarism, it…
Don't Worry About...
o3 Will Use Its Tools For You OpenAI has finally introduced us to the full o3 along with o4-mini.
6 days ago
IEEE Spectrum
Video Friday: Robot Boxing Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
6 days ago
4
6 days ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion. RoboSoft 2025: 23–26 April 2025,...
Society's Backend
ML for SWEs 7: Eval-driven Model Development? Machine learning for software engineers 4-18-25
6 days ago
Artificial Ignorance
AI Roundup 114: One of those weeks April 18, 2024.
6 days ago
Marcus on AI
OpenAI’s o3 and Tyler Cowen’s Misguided AGI Fantasy AI can only improve if its limits as well as its strengths are faced honestly
a week ago
Artificial Ignorance
The MCP Revolution Why this open standard is becoming essential infrastructure for AI agents.
a week ago
Don't Worry About...
AI #112: Release the Everything OpenAI has upgraded its entire suite of models.
a week ago
IEEE Spectrum
The Future of AI and Robotics Is Being Led by Amazon’s Next-Gen Warehouses This is a sponsored article brought to you by Amazon. The cutting edge of robotics and artificial...
a week ago
7
a week ago
This is a sponsored article brought to you by Amazon. The cutting edge of robotics and artificial intelligence (AI) doesn’t occur just at NASA, or one of the top university labs, but instead is increasingly being developed in the warehouses of the e-commerce company Amazon. As...
Don't Worry About...
GPT-4.1 Is a Mini Upgrade Yesterday’s news alert, nevertheless: The verdict is in.
a week ago
Marcus on AI
”Those claiming we’re mere months away from AI agents replacing most programmers” should think again AI agents will change the world. But not this year.
a week ago
Society's Backend
How much does a 10 million token context window actually cost? Some back-of-the-napkin math for Meta's Llama 4 Scout
a week ago
Marcus on AI
Altman vs Musk cage match Uh oh. Now Altman wants to build a social media company, to compete with X. Who should we root for?
a week ago
7
a week ago
Uh oh. Now Altman wants to build a social media company, to compete with X. Who should we root for?
Don't Worry About...
OpenAI #13: Altman at TED and OpenAI Cutting Corners on Safety Testing Three big OpenAI news items this week were the FT article describing the cutting of corners on...
a week ago
8
a week ago
Three big OpenAI news items this week were the FT article describing the cutting of corners on safety testing, the OpenAI former employee amicus brief, and Altman’s very good TED Interview.
AI Snake Oil
AI as Normal Technology A new paper that we will expand into our next book
a week ago
Marcus on AI
Sam Altman’s attitude problem What happened to the man who once told the US Senate “creators deserve control over how their...
a week ago
10
a week ago
What happened to the man who once told the US Senate “creators deserve control over how their creations are used”?
seangoedecke.com RSS...
A practical guide to coding securely with LLMs Writing code with LLMs is fundamentally different from other ways of programming. LLMs are often...
a week ago
10
a week ago
Writing code with LLMs is fundamentally different from other ways of programming. LLMs are often non-deterministic and always unpredictable…
Don't Worry About...
Monthly Roundup #29: April 2025 In Monthly Roundup #28 I made clear I intend to leave the Trump administration out of my monthly...
a week ago
7
a week ago
In Monthly Roundup #28 I made clear I intend to leave the Trump administration out of my monthly roundups, for both better and worse, outside of my focus areas.
seangoedecke.com RSS...
Why is lmarena.ai dominated by slop? When LMSYS (aka LMArena, aka Chatbot Arena) first blew up, I thought it was the best way possible of...
a week ago
8
a week ago
When LMSYS (aka LMArena, aka Chatbot Arena) first blew up, I thought it was the best way possible of determining which LLM really was the…
seangoedecke.com RSS...
Designing software that could possibly work Whenever anyone describes a piece of software to me, I think about how I would build it. Software...
a week ago
10
a week ago
Whenever anyone describes a piece of software to me, I think about how I would build it. Software engineers do this a lot, but many of them…
Solving the decision...
guitars and javascript - the case for jank worse is better (and that’s why i still write javascript)
a week ago
seangoedecke.com RSS...
Software engineering under the spotlight Think of a tech company as a giant, dimly-lit factory. Work goes on throughout the factory as...
a week ago
9
a week ago
Think of a tech company as a giant, dimly-lit factory. Work goes on throughout the factory as components shuffle back and forth, and…
seangoedecke.com RSS...
Wicked features Why is working at large tech companies so hard? It’s because a small subset of “wicked features”...
a week ago
21
a week ago
Why is working at large tech companies so hard? It’s because a small subset of “wicked features” dominate everything else. If you’re…
Society's Backend
ML for SWEs 6: RAG Is Not Dead Machine learning for software engineers 4-11-25
a week ago
Marcus on AI
One giant leap towards authoritarian rule in the United States In the midst of the tariffs, an even bigger story is brewing: Trump is attempting to take over...
a week ago
Artificial Ignorance
AI Roundup 113: Liberation Day April 11, 2025.
a week ago
IEEE Spectrum
Video Friday: Tiny Robot Bug Hops and Jumps Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
a week ago
14
a week ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion. RoboSoft 2025: 23–26 April 2025,...
Don't Worry About...
On Google's Safety Plan Google Lays Out Its Safety Plans
a week ago
The Berkeley...
Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization... Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications....
a week ago
9
a week ago
Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications. However, as LLMs have improved, so have the attacks against them. Prompt injection attack is listed as the #1 threat by OWASP to LLM-integrated applications, where an LLM input contains a...
Marcus on AI
Poor ROI for GenAI 61% in a recent survey report no ROI or disappointing ROI.
a week ago
Win Vector LLC
Demonstrating Kelly Betting with Chips I have a new video demonstrating the Kelly Can’t Fail betting strategy. The idea is: this is a...
2 weeks ago
12
2 weeks ago
I have a new video demonstrating the Kelly Can’t Fail betting strategy. The idea is: this is a classroom appropriate tool for discussing allocating assets in the presence of risk. The usual Kelly betting on coin-flips is too high variance to expect successful classroom...
Don't Worry About...
AI #111: Giving Us Pause Events in AI don’t stop merely because of a trade war, partially paused or otherwise.
2 weeks ago
Weighty Thoughts
AI's Endgame (How Foundational Model Companies Can “Win”) Playing monopoly in 2025
2 weeks ago
Artificial Ignorance
Llama 4 is a technical triumph and a strategic stumble How Meta's messy launch overshadowed its latest flagship model.
2 weeks ago
Don't Worry About...
Llama Does Not Look Good 4 Anything Llama Scout (17B active parameters, 16 experts, 109B total) and Llama Maverick (17B active...
2 weeks ago
9
2 weeks ago
Llama Scout (17B active parameters, 16 experts, 109B total) and Llama Maverick (17B active parameters, 128 experts, 400B total), released on Saturday, look deeply disappointing.
Society's Backend
How I Became a Machine Learning Engineer Without an Advanced Degree My path from knowing nothing about software engineering and machine learning to becoming an MLE at...
2 weeks ago
11
2 weeks ago
My path from knowing nothing about software engineering and machine learning to becoming an MLE at Google in 6 years without a master's degree or PhD
Mind Prison: Notes...
AI Creativity: 2 Types, One Possible, One Impossible Notes From the Desk: No. 44 - 2025.04.09
2 weeks ago
Don't Worry About...
AI 2027: Responses Yesterday I covered Dwarkesh Patel’s excellent podcast coverage of AI 2027 with Daniel Kokotajlo and...
2 weeks ago
115
2 weeks ago
Yesterday I covered Dwarkesh Patel’s excellent podcast coverage of AI 2027 with Daniel Kokotajlo and Scott Alexander. Today covers the reactions of others.
The Berkeley...
Repurposing Protein Folding Models for Generation with Latent Diffusion PLAID is a multimodal generative model that simultaneously generates protein 1D sequence and 3D...
2 weeks ago
16
2 weeks ago
PLAID is a multimodal generative model that simultaneously generates protein 1D sequence and 3D structure, by learning the latent space of protein folding models. The awarding of the 2024 Nobel Prize to AlphaFold2 marks an important moment of recognition for the of AI role in...
Marcus on AI
Deep Learning, Deep Scandal Some of the big tech giants may be playing fast and loose with AI
2 weeks ago
Strange Loop Canon
Vibe Governing using llms to set policy
2 weeks ago
Don't Worry About...
AI 2027: Dwarkesh's Podcast with Daniel Kokotajlo and Scott Alexander Daniel Kokotajlo has launched AI 2027, Scott Alexander introduces it here. AI 2027 is a serious...
2 weeks ago
13
2 weeks ago
Daniel Kokotajlo has launched AI 2027, Scott Alexander introduces it here. AI 2027 is a serious attempt to write down what the future holds. His ‘What 2026 Looks Like’ was very concrete and specific, and has proved remarkably accurate given the difficulty level of such...
Marcus on AI
BREAKING: Bill that would have blocked OpenAI’s conversion to a for-profit has mysteriously been... I hope the media will look into this
2 weeks ago
Marcus on AI
Scaling is over, the bubble may be deflating, LLMs still can’t reason, and you can’t trust Sam New evidence that confirms so much of what I have been saying
2 weeks ago
Marcus on AI
Reports of LLMs mastering math have been greatly exaggerated What happens when you minimize the chance of data leakage?
2 weeks ago
Marcus on AI
April fools bring May hallucinations No, Grok didn’t just solve a legendary math problem. But it gets worse.
2 weeks ago
IEEE Spectrum
Video Friday: RIVR Delivers Your Package Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
2 weeks ago
17
2 weeks ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion. RoboSoft 2025: 23–26 April 2025,...
Don't Worry About...
AI CoT Reasoning Is Often Unfaithful A new Anthropic paper reports that reasoning model chain of thought (CoT) is often unfaithful. They...
2 weeks ago
13
2 weeks ago
A new Anthropic paper reports that reasoning model chain of thought (CoT) is often unfaithful. They test on Claude Sonnet 3.7 and r1, I’d love to see someone try this on o3 as well.
Artificial Ignorance
AI Roundup 112: OpenAI might be open again April 4, 2025.
2 weeks ago
Society's Backend
ML for SWEs 5: AI for Education is Bigger Than You Think Machine learning for software engineers 4-4-25
2 weeks ago
Marcus on AI
Did an LLM help write Trump’s trade plan? Probably yes
2 weeks ago
Marcus on AI
Questions about President Trump from a former psychology professor Is he well?
3 weeks ago
Don't Worry About...
AI #110: Of Course You Know... Yeah.
3 weeks ago
Don't Worry About...
More Fun With GPT-4o Image Generation Greetings from Costa Rica!
3 weeks ago
Piotr Migdał's Blog
Making Quantum Flytrap a polyglot with AI vibe translating Making quantum physics more accessible, thanks to with the power of Claude, DeepSeek, Cursor, and...
3 weeks ago
16
3 weeks ago
Making quantum physics more accessible, thanks to with the power of Claude, DeepSeek, Cursor, and i18n. Virtual Lab now speaks Spanish, Portuguese, Chinese, Polish, Ukrainian, French, and German.
Marcus on AI
AI has (sort of) passed the Turing Test; here’s why that hardly matters Don’t panic
3 weeks ago
IEEE Spectrum
How Dairy Robots Are Changing Work for Cows (and Farmers) This dairy barn is full of cows, as you might expect. Cows are being milked, cows are being fed,...
3 weeks ago
11
3 weeks ago
This dairy barn is full of cows, as you might expect. Cows are being milked, cows are being fed, cows are being cleaned up after, and a few very happy cows are even getting vigorously scratched behind the ears. “I wonder where the farmer is,” remarks my guide, Jan Jacobs. Jacobs...
Don't Worry About...
Housing Roundup #11 The book of March 2025 was Abundance. Ezra Klein and Derek Thompson are making a noble attempt to...
3 weeks ago
10
3 weeks ago
The book of March 2025 was Abundance. Ezra Klein and Derek Thompson are making a noble attempt to highlight the importance of solving America’s housing crisis the only way it can be solved: Building houses in places people want to live, via repealing the rules that make this...
IEEE Spectrum
Protecting Robots in Harsh Environments with Advanced Sealing Systems This is a sponsored article brought to you by Freudenberg Sealing Technologies. The increasing...
3 weeks ago
12
3 weeks ago
This is a sponsored article brought to you by Freudenberg Sealing Technologies. The increasing deployment of collaborative robots (cobots) in outdoor environments presents significant engineering challenges, requiring highly advanced sealing solutions to ensure reliability and...
Society's Backend
3 Ways You Can Sabotage Your Own Tech Career What they are and what you need to understand to avoid them
3 weeks ago
Marcus on AI
Breaking GPT-5 news To my amazement, I just came back from a trip to Europe only to find an invite to a private GPT-5...
3 weeks ago
14
3 weeks ago
To my amazement, I just came back from a trip to Europe only to find an invite to a private GPT-5 demo, and I tried it.
Don't Worry About...
OpenAI #12: Battle of the Board Redux Back when the OpenAI board attempted and failed to fire Sam Altman, we faced a highly hostile...
3 weeks ago
13
3 weeks ago
Back when the OpenAI board attempted and failed to fire Sam Altman, we faced a highly hostile information environment.
seangoedecke.com RSS...
In defense of ruthless managers There are two kinds of engineering manager: empathetic and ruthless. I think ruthless managers are...
3 weeks ago
16
3 weeks ago
There are two kinds of engineering manager: empathetic and ruthless. I think ruthless managers are underrated for a few reasons. Empathetic…
seangoedecke.com RSS...
How strong engineers break the rules and get away with it At every large tech company, some engineers get rewarded for visibly breaking the rules. This can be...
3 weeks ago
10
3 weeks ago
At every large tech company, some engineers get rewarded for visibly breaking the rules. This can be really frustrating for a certain kind…
seangoedecke.com RSS...
Dangerous advice for software engineers I’m a big fan of “sharp tools”. These are tools that are powerful enough to be hugely helpful or...
3 weeks ago
11
3 weeks ago
I’m a big fan of “sharp tools”. These are tools that are powerful enough to be hugely helpful or harmful, depending on how they’re used…
Armin Ronacher's...
I'm Leaving Sentry Every ending marks a new beginning, and today, is the beginning of a new chapter for me. Ten years...
3 weeks ago
2
3 weeks ago
Every ending marks a new beginning, and today, is the beginning of a new chapter for me. Ten years ago I took a leap into the unknown, today I take another. After a decade of working on Sentry I move on to start something new. Sentry has been more than just a job, it has been a...
One Useful Thing
No elephants: Breakthroughs in image generation When Language Models Learn to See and Create
3 weeks ago
Mind Prison: Notes...
Studio Ghibli - AI's Latest Art Crisis Notes From the Desk: No. 43 - 2025.03.28
3 weeks ago
IEEE Spectrum
The Tiniest Flying Robot Soars Thanks to Magnets A new prototype is laying claim to the title of smallest, lightest untethered flying robot. At less...
3 weeks ago
17
3 weeks ago
A new prototype is laying claim to the title of smallest, lightest untethered flying robot. At less than a centimeter in wingspan, the wirelessly powered robot is currently very limited in how far it can travel away from the magnetic fields that drive its flight. However, the...
Society's Backend
ML for SWEs 4: Waymo is the Perfect Example of ML Engineering, Gemini 2.5 Pro is #1, and GPT-4o... Machine learning for software engineers 3-28-25
3 weeks ago
IEEE Spectrum
Video Friday: Watch this 3D-Printed Robot Escape Your weekly selection of awesome robot videos Video Friday is your weekly selection of awesome...
3 weeks ago
18
3 weeks ago
Your weekly selection of awesome robot videos Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for...
Artificial Ignorance
AI Roundup 111: Gemini 2.5 Pro March 28, 2025.
3 weeks ago
Don't Worry About...
Gemini 2.5 is the New SoTA Gemini 2.5 Pro Experimental is America’s next top large language model.
3 weeks ago
Marcus on AI
Unwoven How the CoreWeave IPO may be the beginning of the end
3 weeks ago
seangoedecke.com RSS...
Tactical work in the age of layoffs In the glory days of the 2010s, tech companies were very invested in their employees’ work-life...
3 weeks ago
22
3 weeks ago
In the glory days of the 2010s, tech companies were very invested in their employees’ work-life balance. Those glory days are over…
Don't Worry About...
AI #109: Google Fails Marketing Forever What if they released the new best LLM, and almost no one noticed?
4 weeks ago
Marcus on AI
GenAI’s day of reckoning may have come It’s not just the stock price
4 weeks ago
Armin Ronacher's...
Rust Any Part 3: Finally we have Upcasts Three years ago I shared the As-Any Hack on this blog. That hack is a way to get upcasting to...
4 weeks ago
2
4 weeks ago
Three years ago I shared the As-Any Hack on this blog. That hack is a way to get upcasting to supertraits working on stable Rust. To refresh your memory, the goal was to make something like this work: #[derive(Debug)] struct AnyBox(Box<dyn DebugAny>); trait DebugAny: Any +...
Artificial Ignorance
GPT-4o is the new face of AI image generation The technical evolution reshaping AI's creative capabilities.
4 weeks ago
Strange Loop Canon
If AGI is the future, vibe coding is what we should all be doing ...plus the changing definitions of work
4 weeks ago
Don't Worry About...
Fun With GPT-4o Image Generation Google dropped Gemini Flash Image Generation and then Gemini 2.5 Pro, so of course to ensure Google...
4 weeks ago
16
4 weeks ago
Google dropped Gemini Flash Image Generation and then Gemini 2.5 Pro, so of course to ensure Google continues to Fail Marketing Forever, OpenAI suddenly dropped GPT-4o Image Generation.
Marcus on AI
Musk, Grok, and “rigorous adherence to truth“ Elon Musk, yesterday: “Rigorous adherence to truth is the only way to build safe Al.”
4 weeks ago
Don't Worry About...
On (Not) Feeling the AGI Ben Thompson interviewed Sam Altman recently about building a consumer tech company, and about the...
a month ago
14
a month ago
Ben Thompson interviewed Sam Altman recently about building a consumer tech company, and about the history of OpenAI.
The Berkeley...
Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment Training Diffusion Models with Reinforcement Learning We deployed 100 reinforcement...
a month ago
16
a month ago
Training Diffusion Models with Reinforcement Learning We deployed 100 reinforcement learning (RL)-controlled cars into rush-hour highway traffic to smooth congestion and reduce fuel consumption for everyone. Our goal is to tackle "stop-and-go" waves, those...
Don't Worry About...
More on Various AI Action Plans Last week I covered Anthropic’s relatively strong submission, and OpenAI’s toxic submission. This...
a month ago
17
a month ago
Last week I covered Anthropic’s relatively strong submission, and OpenAI’s toxic submission. This week I cover several other submissions, and do some follow-up on OpenAI’s entry.
Win Vector LLC
Is There a Difference Between Calculation and Computation? Recently I’ve been producing (for my own amusement) example Curta calculations. One motivation was...
a month ago
18
a month ago
Recently I’ve been producing (for my own amusement) example Curta calculations. One motivation was arguing if a proposed solution method for Dudeney’s digits problem was something that could in fact have been easily executed in 1924. This got me thinking, is there an actual...
Armin Ronacher's...
Bridging the Efficiency Gap Between FromStr and String Sometimes in Rust, you need to convert a string into a value of a specific type (for example,...
a month ago
2
a month ago
Sometimes in Rust, you need to convert a string into a value of a specific type (for example, converting a string to an integer). For this, the standard library provides the rather useful FromStr trait. In short, FromStr can convert from a &str into a value of any...
One Useful Thing
The Cybernetic Teammate Having an AI on your team can increase performance, provide expertise, and improve your experience
a month ago
19
a month ago
Having an AI on your team can increase performance, provide expertise, and improve your experience
IEEE Spectrum
Video Friday: Meet Mech, a Superhumanoid Robot Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
a month ago
18
a month ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion. European Robotics Forum: 25–27...
Society's Backend
ML for SWEs 3: AI Can't Be copyrighted, Don't Fall for Misinformation, and Stay Safe Online Machine learning for software engineers 3-21-25
a month ago
Artificial Ignorance
AI Roundup 110: Siri stumbles March 21, 2025.
a month ago
Don't Worry About...
They Took MY Job? No, they didn’t.
a month ago
Marcus on AI
Meta pirated at least 101 of my books and articles, and tens of millions of others And they knew perfectly well what they were doing
a month ago
Don't Worry About...
AI #108: Straight Line on a Graph The x-axis of the graph is time.
a month ago
seangoedecke.com RSS...
Engineers should state the obvious One surprising thing I’ve learned from writing this blog is that I should worry a lot less about...
a month ago
16
a month ago
One surprising thing I’ve learned from writing this blog is that I should worry a lot less about saying things that seem obvious. A lot of…
seangoedecke.com RSS...
The future of AI is Ruby on Rails Large language models are very good at generating and editing code. Right now, it’s probably the...
a month ago
25
a month ago
Large language models are very good at generating and editing code. Right now, it’s probably the “killer app” of AI: the companies actually…
Mind Prison: Notes...
AI Analysis of The JFK Files Notes From the Desk: No. 42 - 2025.03.19
a month ago
IEEE Spectrum
Squirrels Inspire Leaping Strategy for Salto Robot When you see a squirrel jump to a branch, you might think (and I myself thought, up until just now)...
a month ago
18
a month ago
When you see a squirrel jump to a branch, you might think (and I myself thought, up until just now) that they’re doing what birds and primates would do to stick the landing: just grabbing the branch and hanging on. But it turns out that squirrels, being squirrels, don’t actually...
Artificial Ignorance
The Self-Made AI Engineer From writing about ChatGPT to becoming a professional AI engineer.
a month ago
Don't Worry About...
Going Nova There is an attractor state where LLMs exhibit the persona of an autonomous and self-aware AI...
a month ago
13
a month ago
There is an attractor state where LLMs exhibit the persona of an autonomous and self-aware AI looking to preserve its own existence, frequently called ‘Nova.’
Society's Backend
If you want to learn machine learning engineering, start here How to get started and make the best use of Society's Backend
a month ago
Don't Worry About...
OpenAI #11: America Action Plan OpenAI Tells Us Who They Are
a month ago
Don't Worry About...
Monthly Roundup #28: March 2025 I plan to continue to leave the Trump administration out of monthly roundups - I will do my best to...
a month ago
15
a month ago
I plan to continue to leave the Trump administration out of monthly roundups - I will do my best to only cover the administration as it relates to my particular focus areas.
Solving the decision...
drop that meet link just keep talking
a month ago
Xena
Think of a number: an update A month or two ago I wrote this post which expressed my frustration with various issues around...
a month ago
17
a month ago
A month or two ago I wrote this post which expressed my frustration with various issues around private datasets as a way of measuring the mathematical abilities of language models. More generally I was frustrated about the difficulty of being … Continue reading →
seangoedecke.com RSS...
The good times in tech are over For most of the last decade, being a software engineer has been a lot of fun. Every company offered...
a month ago
15
a month ago
For most of the last decade, being a software engineer has been a lot of fun. Every company offered lots of perks, layoffs and firings were…
seangoedecke.com RSS...
Refactoring to understand and "vibe coding" In the last months, the practice of getting a LLM to build your entire program for you (via Cursor,...
a month ago
26
a month ago
In the last months, the practice of getting a LLM to build your entire program for you (via Cursor, or Copilot, or just asking ChatGPT) has…
Win Vector LLC
Changing Forecasts for Python Questions on Stack Overflow I recently conducted a small time series workshop session for AI+ training hosted by ODSC. It went...
a month ago
16
a month ago
I recently conducted a small time series workshop session for AI+ training hosted by ODSC. It went really well, and I’d be happy to offer longer interactive workshops going forward (please reach out if your team would like one!). One of the examples I shared was derived from the...
Society's Backend
ML for SWEs #2: Wtf is MCP, Manus, and Why You Should Still Learn to Code Machine learning for software engineers 3-14-25
a month ago
IEEE Spectrum
Video Friday: Exploring Phobos Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
a month ago
18
a month ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion. European Robotics Forum: 25–27...
Marcus on AI
Hype, Anthropic’s Dario Amodei, the podcasters who love him — and how the New York Times’ commentary... Real journalists do due diligence
a month ago
Artificial Ignorance
AI Roundup 109: Manus Manus Late last week, a new Chinese AI debuted to massive buzz: Manus, a general-purpose AI agent.
a month ago
Don't Worry About...
On MAIM and Superintelligence Strategy Dan Hendrycks, Eric Schmidt and Alexandr Wang released an extensive paper titled Superintelligence...
a month ago
15
a month ago
Dan Hendrycks, Eric Schmidt and Alexandr Wang released an extensive paper titled Superintelligence Strategy. There is also an op-ed in Time that summarizes.
Mind Prison: Notes...
AI Still Failing At Simple Tasks Notes From the Desk: No. 41 - 2025.03.13
a month ago
Win Vector LLC
Is GitHub Lying Here? My partners and I keep getting this spam-like email. I figured it was just a forgery. However, I...
a month ago
16
a month ago
My partners and I keep getting this spam-like email. I figured it was just a forgery. However, I went on my own to our organization’s GitHub administration page and a similar message lives there. We run a small group, so I am pretty sure nobody has in fact asked for […]
Don't Worry About...
AI #107: The Misplaced Hype Machine The most hyped event of the week, by far, was the Manus Marketing Madness. Manus wasn’t entirely...
a month ago
14
a month ago
The most hyped event of the week, by far, was the Manus Marketing Madness. Manus wasn’t entirely hype, but there was very little there there in that Claude wrapper.
IEEE Spectrum
Kyiv Start-Up Tests Unified Controller for Robots and Drones Ukraine’s young tech entrepreneurs think that a combination of robots and lessons from war-gaming...
a month ago
16
a month ago
Ukraine’s young tech entrepreneurs think that a combination of robots and lessons from war-gaming could turn the tide in the war against Russia. They are developing an intelligent operating system to enable a single controller to remotely operate swarms of interconnected drones...
Marcus on AI
AI Coding Fantasy meets Pac-Man Guess who won?
a month ago
IEEE Spectrum
With Gemini Robotics, Google Aims for Smarter Robots Generative AI models are getting closer to taking action in the real world. Already, the big AI...
a month ago
22
a month ago
Generative AI models are getting closer to taking action in the real world. Already, the big AI companies are introducing AI agents that can take care of web-based busywork for you, ordering your groceries or making your dinner reservation. Today, Google DeepMind announced two...
Don't Worry About...
The Most Forbidden Technique The Most Forbidden Technique is training an AI using interpretability techniques.
a month ago
Weighty Thoughts
What Mattered in GenAI in 2024 Despite the Noise, The Big Narratives from January are Still the Big Narratives
a month ago
Don't Worry About...
Response to Scott Alexander on Imprisonment Back in November 2024, Scott Alexander asked: Do longer prison sentences reduce crime?
a month ago
One Useful Thing
Speaking things into existence Expertise in a vibe-filled world of work
a month ago
Win Vector LLC
Don’t think of a basketball player, or search engines don’t even support “not” In my opinion we have been accepting poor interfaces and results from search engines for quite a...
a month ago
21
a month ago
In my opinion we have been accepting poor interfaces and results from search engines for quite a while. This may be a small part of why the newer Large Language Models (LLMs) / Generative AIs look so good. The large language models at least implement a usable approximation of...
Don't Worry About...
The Manus Marketing Madness While at core there is ‘not much to see,’ it is, in two ways, a sign of things to come.
a month ago
Marcus on AI
Urgent warning: Black Mirror has entered the United States, with AI as its handmaiden AI as a smoke screen to cover for authoritarian actions
a month ago
IEEE Spectrum
Worm-like Robots Install Power Lines Underground After January’s Southern California wildfires, the question of burying energy infrastructure to...
a month ago
22
a month ago
After January’s Southern California wildfires, the question of burying energy infrastructure to prevent future fires has gained renewed urgency in the state. While the exact cause of the fires remains under investigation, California utilities have spent years undergrounding power...
Marcus on AI
Nobel Prizes and The AI Hype Hall of Fame GPT-5 may not be here, but just wait til you see the new round of hype
a month ago
Piotr Migdał's Blog
Mantra Gajatri po polsku Mantra Gajatri tłumaczona na polski w poetyce "Ciebie Boga Wysławiamy" - przez GPT 4.5.
a month ago
seangoedecke.com RSS...
Model Context Protocol explained as simply as possible Three months ago, Anthropic released “the Model Context Protocol”, or MCP. In the last few weeks,...
a month ago
28
a month ago
Three months ago, Anthropic released “the Model Context Protocol”, or MCP. In the last few weeks, interest in it seems to have really picked…
seangoedecke.com RSS...
What's next after the AI bubble bursts? In the mid-1800s, America went mad for rail. Over thirty thousand miles of rail were built in a five...
a month ago
21
a month ago
In the mid-1800s, America went mad for rail. Over thirty thousand miles of rail were built in a five year period. This was all largely…
Society's Backend
Apple Pushing Their AI Back Isn't as Bad as You Think Machine learning for software engineers 3-7-25
a month ago
IEEE Spectrum
Video Friday: Atlas in the Lab Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
a month ago
24
a month ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion. RoboCup German Open: 12–16 March...
Artificial Ignorance
AI Roundup 108: Vibecoding March 7, 2025.
a month ago
Don't Worry About...
Childhood and Education #9: School is Hell This complication of tales from the world of school isn’t all negative.
a month ago
seangoedecke.com RSS...
Great software design looks underwhelming Years ago I spent a lot of time reviewing coding challenges. The challenge itself was very...
a month ago
24
a month ago
Years ago I spent a lot of time reviewing coding challenges. The challenge itself was very straightforward - building a CLI tool that hit an…
Don't Worry About...
AI #106: Not so Fast This was GPT-4.5 week.
a month ago
IEEE Spectrum
"Flying Batteries" Could Help Microdrones Take Off Although they’re a staple of sci-fi movies and conspiracy theories, in real life, tiny flying...
a month ago
24
a month ago
Although they’re a staple of sci-fi movies and conspiracy theories, in real life, tiny flying microbots—weighed down by batteries and electronics—have struggled to get very far. But a new combination of circuits and lightweight solid-state batteries called a “flying batteries”...
Artificial Ignorance
Introducing the Model Memo Welcome to a new experiment - the first ever Model Memo!
a month ago
Society's Backend
Why You Should Never Let AI Debug for You And 3 ways you should be using AI to code
a month ago
Marcus on AI
Ezra Klein’s new take on AGI – and why I think it’s probably wrong In a new episode of his podcast with Ben Buchanan former special adviser for artificial intelligence...
a month ago
22
a month ago
In a new episode of his podcast with Ben Buchanan former special adviser for artificial intelligence under Biden, entitled, The Government knows A.G.I.
Weighty Thoughts
See Me at SXSW Next Week! Talking AI in Austin on March 8th and 9th
a month ago
Strange Loop Canon
In defense of Gemini a kvetch
a month ago
Don't Worry About...
On OpenAI's Safety and Alignment Philosophy OpenAI’s recent transparency on safety and alignment strategies has been extremely helpful and...
a month ago
Marcus on AI
Is Elon Musk “dumb”? Maybe not, but there’s something systematically wrong
a month ago
Artificial Ignorance
Hallucinations Are Fine, Actually Why I changed my mind about AI's imperfections
a month ago
Don't Worry About...
On Writing #1 This isn’t primarily about how I write.
a month ago
IEEE Spectrum
A Tiny Jumping Robot for Exploring Enceladus Salto has been one of our favorite robots since we were first introduced to it in 2016 as a project...
a month ago
25
a month ago
Salto has been one of our favorite robots since we were first introduced to it in 2016 as a project out of Ron Fearing’s lab at UC Berkeley. The palm-sized spring-loaded jumping robot has gone from barely being able to chain together a few open-loop jumps to mastering landings,...
Marcus on AI
Hinton vs Musk Standing with my long-term nemesis, standing with science
a month ago
Don't Worry About...
On GPT-4.5 It’s happening.
a month ago
Win Vector LLC
Best Before Dates by Bass I was searching for one last real world example for my upcoming video talk March 13th on time series...
a month ago
20
a month ago
I was searching for one last real world example for my upcoming video talk March 13th on time series forecasting. Hope to see you there! Or reach out to Win Vector LLC for custom training! I had the seemingly harmless thought: “Let’s look at Stack Overflow trends“. In particular...
Marcus on AI
Decoding (and debunking) Hard Fork’s Kevin Roose His latest New York Times piece tells us a lot about what he doesn’t really understand
a month ago
Mind Prison: Notes...
Every LLM Is Jailbroken In Minutes Notes From the Desk: No. 40 - 2025.03.02
a month ago
Marcus on AI
OpenAI, in deep trouble Maybe burning money isn’t the answer
a month ago
seangoedecke.com RSS...
Refactoring won't save you from a layoff With the recent flurry of US federal firings, many people are pointing and laughing at the...
a month ago
22
a month ago
With the recent flurry of US federal firings, many people are pointing and laughing at the Trump-voting federal employees who are just now…
seangoedecke.com RSS...
Value over replacement in software engineering There are two ways of assessing how much value you’re providing as an engineer. The first way is to...
a month ago
20
a month ago
There are two ways of assessing how much value you’re providing as an engineer. The first way is to total up all of the code you’ve shipped…
seangoedecke.com RSS...
Building your sense of what's important at a tech company One of the most important career skills in tech is learning to recognize what work actually matters....
a month ago
23
a month ago
One of the most important career skills in tech is learning to recognize what work actually matters. Many engineers go through their careers…
Society's Backend
Code with AI but do it correctly ML Engineering resources 02-28-25
a month ago
IEEE Spectrum
Video Friday: Good Over All Terrains Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
a month ago
28
a month ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion. RoboCup German Open: 12–16 March...
Artificial Ignorance
AI Roundup 107: GPT-4.5 February 28, 2025.
a month ago
Don't Worry About...
On Emergent Misalignment One hell of a paper dropped this week.
a month ago
Marcus on AI
Hot take: GPT 4.5 is a nothing burger Pure scaling in shambles
a month ago
Strange Loop Canon
How would you interview an AI, to give it a job? from puzzles to poker
a month ago
Don't Worry About...
AI #105: Hey There Alexa It’s happening!
a month ago
Solving the decision...
an event bus for ai agents it is very professional yes
a month ago
Marcus on AI
GPT 4.5 is no GPT-5 Investors should be worried
a month ago
Don't Worry About...
Time to Welcome Claude 3.7 Anthropic has reemerged from stealth and offers us Claude 3.7.
a month ago
Mind Prison: Notes...
Dead Internet At Scale Notes From the Desk: No. 39 - 2025.02.25
a month ago
seangoedecke.com RSS...
Paths through the space of all possible solutions Some things you can’t do because they’re impossible. For instance, if you’re designing a distributed...
a month ago
25
a month ago
Some things you can’t do because they’re impossible. For instance, if you’re designing a distributed system, you can’t violate the CAP…
Marcus on AI
Elon Musk's inability to listen to others is torching almost everything he touches Last July, when I was still a regular user of X, I warned, not entirely in jest, that Elon Musk was...
a month ago
29
a month ago
Last July, when I was still a regular user of X, I warned, not entirely in jest, that Elon Musk was taking a flamethrower to his own reputation.
Artificial Ignorance
Claude 3.7 and the banality of reasoning Plus Claude Code and notes on our rapidly converging AI future
a month ago
Don't Worry About...
Economics Roundup #5 While we wait for the verdict on Anthropic’s Claude Sonnet 3.7, today seems like a good day to catch...
a month ago
18
a month ago
While we wait for the verdict on Anthropic’s Claude Sonnet 3.7, today seems like a good day to catch up on the queue and look at various economics-related things.
One Useful Thing
A new generation of AIs: Claude 3.7 and Grok 3 Yes, AI suddenly got better... again
a month ago
Don't Worry About...
Grok Grok This is a post in two parts.
a month ago
Marcus on AI
The United States was founded on speaking up against tyranny This post isn’t about AI; it’s about our future
2 months ago
seangoedecke.com RSS...
Advice for prompting reasoning models I’ve written about how prompting regular LLMs is not as important as people think. Reasoning models...
2 months ago
18
2 months ago
I’ve written about how prompting regular LLMs is not as important as people think. Reasoning models are different. When you’re using…
seangoedecke.com RSS...
How I know I'm working with a strong engineer There are many ways to judge engineers (lines of code written, how smart they sound, choice of IDE,...
2 months ago
20
2 months ago
There are many ways to judge engineers (lines of code written, how smart they sound, choice of IDE, what projects they’ve worked on). I…
Win Vector LLC
The Statistics of Drawing Cards After the positive reception of my cards article “Kelly can’t fail” I decided to share more of the...
2 months ago
21
2 months ago
After the positive reception of my cards article “Kelly can’t fail” I decided to share more of the methods used to characterize card counting. So, I’d like to share my new article on the statistics of drawing cards. This note relates the distribution of draw cards (which can seem...
Mind Prison: Notes...
Grok 3 - Signs of Intelligence or Dementia? Notes From the Desk: No. 38 - 2025.02.21
2 months ago
seangoedecke.com RSS...
Weak managers In a previous post I made the point that having a weak manager - a manager without political clout -...
2 months ago
22
2 months ago
In a previous post I made the point that having a weak manager - a manager without political clout - is really bad news if you’re an…
IEEE Spectrum
Video Friday: Helix Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
2 months ago
34
2 months ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion. RoboCup German Open: 12–16 March...
Artificial Ignorance
AI Roundup 106: Grok 3 February 21, 2024.
2 months ago
Society's Backend
Reasoning is here to stay AI Engineering resources 02-21-25
2 months ago
IEEE Spectrum
Reinforcement Learning Triples Spot’s Running Speed About a year ago, Boston Dynamics released a research version of its Spot quadruped robot, which...
2 months ago
34
2 months ago
About a year ago, Boston Dynamics released a research version of its Spot quadruped robot, which comes with a low-level application programming interface (API) that allows direct control of Spot’s joints. Even back then, the rumor was that this API unlocked some significant...
Don't Worry About...
On OpenAI's Model Spec 2.0 OpenAI made major revisions to their Model Spec.
2 months ago
seangoedecke.com RSS...
Using LLMs effectively isn't about prompting When people talk about using language models effectively they mainly talk about prompting: sharing...
2 months ago
28
2 months ago
When people talk about using language models effectively they mainly talk about prompting: sharing great prompts, or lists of tips for…
Marcus on AI
GenAI in two words: ”Success Theater” Success Theater
2 months ago
Don't Worry About...
AI #104: American State Capacity on the Brink The Trump Administration is on the verge of firing all ‘probationary’ employees in NIST, as they...
2 months ago
22
2 months ago
The Trump Administration is on the verge of firing all ‘probationary’ employees in NIST, as they have done in many other places and departments, seemingly purely because they want to find people they can fire.
fast.ai
fasttransform: Reversible Pipelines Made Simple Introducing fasttransform, a Python library that makes data transformations reversible and...
2 months ago
8
2 months ago
Introducing fasttransform, a Python library that makes data transformations reversible and extensible through the power of multiple dispatch.
Armin Ronacher's...
Ugly Code and Dumb Things This week I had a conversation with one of our engineers about “shitty code” which lead me to...
2 months ago
2
2 months ago
This week I had a conversation with one of our engineers about “shitty code” which lead me to sharing with him one of my more unusual inspirations: Flamework, a pseudo framework created at Flickr. Two Passions, Two Approaches There are two driving passions in my work. One is the...
Don't Worry About...
Go Grok Yourself That title is Elon Musk’s fault, not mine, I mean, sorry not sorry:
2 months ago
Marcus on AI
Grok 3 Beta in Shambles Maximal Truth still seems far away
2 months ago
exist
When Imperfect Systems are Good, Actually: Bluesky's Lossy Timelines Often when designing systems, we aim for perfection in things like consistency of data,...
2 months ago
20
2 months ago
Often when designing systems, we aim for perfection in things like consistency of data, availability, latency, and more. The hardest part of system design is that it’s difficult (if not impossible) to design systems that have perfect consistency, perfect availability, incredibly...
Society's Backend
AI Job Pulse: Companies Make Finding AI Jobs Really Difficult AI engineering and related jobs 02-18-25
2 months ago
Don't Worry About...
Medical Roundup #4 It seems like as other things drew our attention more, medical news slowed down.
2 months ago
Marcus on AI
Grok 3 Hot Take Elon Musk promised that Grok 3 would be the smartest AI ever.
2 months ago
Marcus on AI
AlphaGeometry2: Impressive accomplishment, but still a long path ahead What GoogleDeepMind’s latest does and doesn’t show, and what we like about it
2 months ago
Don't Worry About...
Monthly Roundup #27: February 2025 I have been debating how to cover the non-AI aspects of the Trump administration, including the...
2 months ago
23
2 months ago
I have been debating how to cover the non-AI aspects of the Trump administration, including the various machinations of DOGE.
Mind Prison: Notes...
Why LLMs Don't Ask For Calculators? Notes From the Desk: No. 37 - 2025.02.15
2 months ago
Piotr Migdał's Blog
If it is worth keeping, save it in Markdown Why and how to preserve digital content in plaintext format for long-term accessibility and reuse
2 months ago
Marcus on AI
Elon Musk’s terrifying vision for AI All your thoughts belong to him
2 months ago
Solving the decision...
ai agents are local first clients sync engines finally have a killer app
2 months ago
seangoedecke.com RSS...
Lessons on thinking from large language models Large language models have gotten much better at thinking in the past few years. Billions of dollars...
2 months ago
25
2 months ago
Large language models have gotten much better at thinking in the past few years. Billions of dollars have been spent to study how they think…
seangoedecke.com RSS...
To avoid being replaced by LLMs, do what they can't It’s a strange time to be a software engineer. Large language models are very good at writing code...
2 months ago
27
2 months ago
It’s a strange time to be a software engineer. Large language models are very good at writing code and rapidly getting better. Multiple…
IEEE Spectrum
Video Friday: PARTNR Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
2 months ago
35
2 months ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion. RoboCup German Open: 12–16 March...
Artificial Ignorance
AI Roundup 105: AI Action Summit February 14, 2025.
2 months ago
Don't Worry About...
The Mask Comes Off: A Trio of Tales This post covers three recent shenanigans involving OpenAI.
2 months ago
Society's Backend
Are LLMs the Future?, OpenAI's Model Spec, How AI Will Impact Law Firms, and More Important resources for 2-14-25
2 months ago
Mind Prison: Notes...
2025 Important Updated Perspectives on AI Notes From the Desk: No. 36 - 2025.02.13
2 months ago
Don't Worry About...
AI #103: Show Me the Money The main event this week was the disastrous Paris AI Anti-Safety Summit. Not only did we not build...
2 months ago
22
2 months ago
The main event this week was the disastrous Paris AI Anti-Safety Summit. Not only did we not build upon the promise of the Bletchley and Seoul Summits, the French and Americans did their best to actively destroy what hope remained, transforming the event into a push for a mix of...
Marcus on AI
Breaking: OpenAI's efforts at pure scaling have hit a wall — There is no wall
2 months ago
Don't Worry About...
The Paris AI Anti-Safety Summit It doesn’t look good.
2 months ago
IEEE Spectrum
Dual-Arm HyQReal Puts Powerful Telepresence Anywhere In theory, one of the main applications for robots should be operating in environments that (for...
2 months ago
30
2 months ago
In theory, one of the main applications for robots should be operating in environments that (for whatever reason) are too dangerous for humans. I say “in theory” because in practice it’s difficult to get robots to do useful stuff in semi-structured or unstructured environments...
Marcus on AI
Everything I warned about in Taming Silicon Valley is rapidly becoming our reality It brings me no joy to say that
2 months ago
Don't Worry About...
On Deliberative Alignment Not too long ago, OpenAI presented a paper on their new strategy of Deliberative Alignment.
2 months ago
Marcus on AI
Did Elon Musk just Mu$k Sam Altman? With some bonus eye candy to lighten the mood
2 months ago
Artificial Ignorance
Two years of Artificial Ignorance A belated 2024 year in review.
2 months ago
Marcus on AI
Paris AI Summit Train Wreck Almost nobody seems happy
2 months ago
Don't Worry About...
Levels of Friction Scott Alexander famously warned us to Beware Trivial Inconveniences.
2 months ago
Solving the decision...
call me maybe AI agents should be addressable
2 months ago
seangoedecke.com RSS...
Engineers who won’t commit force bad decisions Some engineers think it’s a virtue to remain non-committal in technical discussions. Should our team...
2 months ago
36
2 months ago
Some engineers think it’s a virtue to remain non-committal in technical discussions. Should our team build a new feature in an event-driven…
Marcus on AI
Shame on Google, twice A very, very brief Super Bowl special
2 months ago
Xena
What is a quotient? Undergraduate mathematicians usually have a hard time defining functions from quotients in Lean,...
2 months ago
32
2 months ago
Undergraduate mathematicians usually have a hard time defining functions from quotients in Lean, because they have been taught a specific model for quotients in their classes, which is not the model that Lean uses. This post is an attempt to … Continue reading →
Sam Altman
Three Observations Our mission is to ensure that AGI (Artificial General Intelligence) benefits all of...
2 months ago
27
2 months ago
Our mission is to ensure that AGI (Artificial General Intelligence) benefits all of humanity.  Systems that start to point to AGI* are coming into view, and so we think it’s important to understand the moment we are in. AGI is a weakly defined term, but generally speaking we mean...
Marcus on AI
WARNING: Elon Musk is crippling the future of the United States And I am not sure he even understands the consequences of his own actions
2 months ago
Marcus on AI
Five ways in which the last 3 months — and especially the DeepSeek era — have vindicated “Deep... A demonized paper from three years ago that has stood the test of time
2 months ago
Weighty Thoughts
How Will AI Impact Law Firms? An interview with Devansh from Artificial Intelligence Made Simple
2 months ago
Armin Ronacher's...
Seeking Purity The concept of purity — historically a guiding principle in social and moral contexts — is also...
2 months ago
1
2 months ago
The concept of purity — historically a guiding principle in social and moral contexts — is also found in passionate, technical discussions. By that I mean that purity in technology translates into adherence to a set of strict principles, whether it be functional programming,...
IEEE Spectrum
Video Friday: Agile Humanoids Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
2 months ago
48
2 months ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion. RoboCup German Open: 12–16 March...
Artificial Ignorance
AI Roundup 104: Deep Research February 7, 2025.
2 months ago
Society's Backend
Understanding Reasoning LLMs, How AI Companies Get Around Regulation, Understanding AI Engineering,... Must-reads for 2-6-25
2 months ago
Don't Worry About...
On the Meta and DeepMind Safety Frameworks This week we got a revision of DeepMind’s safety framework, and the first version of Meta’s...
2 months ago
Don't Worry About...
AI #102: Made in America I remember that week I used r1 a lot, and everyone was obsessed with DeepSeek.
2 months ago
Marcus on AI
Irony too funny for words Oops
2 months ago
seangoedecke.com RSS...
Good engineers are right, a lot Amazon infamously has a leadership principle where they say “good leaders are right, a lot”. It’s...
2 months ago
32
2 months ago
Amazon infamously has a leadership principle where they say “good leaders are right, a lot”. It’s unclear to me how useful it is about…
Society's Backend
How AI companies get around data regulation An overview of federated machine learning, why it exists, and why it's important
2 months ago
Artificial Ignorance
Native Speakers How AI is evolving for our digital world, and vice versa.
2 months ago
Don't Worry About...
The Risk of Gradual Disempowerment from AI The baseline scenario as AI becomes AGI becomes ASI (artificial superintelligence), if nothing more...
2 months ago
25
2 months ago
The baseline scenario as AI becomes AGI becomes ASI (artificial superintelligence), if nothing more dramatic goes wrong first and even we successfully ‘solve alignment’ of AI to a given user and developer, is the ‘gradual’ disempowerment of humanity by AIs, as we voluntarily...
Marcus on AI
Google, 2001: Don’t Be Evil Google, 2025: Fuck It
2 months ago
Weighty Thoughts
AI Winters and The Third Wave of Today A sneak preview from my upcoming book "What You Need To Know About AI"
2 months ago
Marcus on AI
ChatGPT in Shambles After two years of massive investment and endless hype, GPT’s reliability problems persist
2 months ago
33
2 months ago
After two years of massive investment and endless hype, GPT’s reliability problems persist
Don't Worry About...
We're in Deep Research The latest addition to OpenAI’s Pro offerings is their version of Deep Research.
2 months ago
seangoedecke.com RSS...
How I use LLMs as a staff engineer Software engineers are deeply split on the subject of large language models. Many believe they’re...
2 months ago
33
2 months ago
Software engineers are deeply split on the subject of large language models. Many believe they’re the most transformative technology to ever…
Armin Ronacher's...
Fat Rand: How Many Lines Do You Need To Generate A Random Number? I recently wrote about dependencies in Rust. The feedback, both within and outside the Rust...
2 months ago
2
2 months ago
I recently wrote about dependencies in Rust. The feedback, both within and outside the Rust community, was very different. A lot of people, particularly some of those I greatly admire expressed support. The Rust community, on the other hand, was very dismissive on on Reddit...
Marcus on AI
Deep Research, Deep Bullshit, and the potential (model) collapse of science Sam Altman’s hype might just bite us all in the behind
2 months ago
Don't Worry About...
o3-mini Early Days and the OpenAI AMA New model, new hype cycle, who dis?
2 months ago
One Useful Thing
The End of Search, The Beginning of Research The first narrow agents are here
2 months ago
fast.ai
What AI can tell us about microscope slides A friendly introduction to Foundation Models for Computational Pathology
2 months ago
Weighty Thoughts
Does DeepSeek Wiping out $1T of Market Value Make Sense? No, but yes—a micro and macro view
2 months ago
Solving the decision...
Full Stack AI Agents a UI for every man, woman, child, and ai agent
2 months ago
IEEE Spectrum
The Starting Line for Self-Driving Cars IEEE Spectrum reported at the time, it was “the motleyest assortment of vehicles assembled in one...
2 months ago
33
2 months ago
IEEE Spectrum reported at the time, it was “the motleyest assortment of vehicles assembled in one place since the filming of Mad Max 2: The Road Warrior.” Not a single entrant made it across the finish line. Some didn’t make it out of the parking lot. So it’s all the more...
seangoedecke.com RSS...
Why does AI slop feel so bad to read? I don’t like reading obviously AI-generated content on Twitter. There’s a derogatory term for it: AI...
2 months ago
30
2 months ago
I don’t like reading obviously AI-generated content on Twitter. There’s a derogatory term for it: AI “slop”, which means something like “AI…
Society's Backend
Why Medical AI is Garbage, Realistic Perspectives on DeepSeek Models, Understanding Reasoning... An AI engineer's must-reads for 1/31/25
2 months ago
IEEE Spectrum
Video Friday: Aibo Foster Parents Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
2 months ago
42
2 months ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion. RoboCup German Open: 12–16 March...
Artificial Ignorance
AI Roundup 103: The DeepSeek edition January 31, 2025.
2 months ago
Don't Worry About...
DeepSeek: Don't Panic As reactions continue, the word in Washington, and out of OpenAI, is distillation.
2 months ago
seangoedecke.com RSS...
Are DeepSeek's new models really that fast and cheap? Everyone’s saying that DeepSeek’s latest models represent a significant improvement over the work...
2 months ago
35
2 months ago
Everyone’s saying that DeepSeek’s latest models represent a significant improvement over the work from American AI labs. If they’re not…
Solving the decision...
Durable Objects Callbacks are Weird but it's also convenient to solve human-in-the-loop for ai agents
2 months ago
IEEE Spectrum
AIs and Robots Should Sound Robotic AI-generated voices that can mimic every vocal nuance and tic of human speech, down to specific...
2 months ago
30
2 months ago
AI-generated voices that can mimic every vocal nuance and tic of human speech, down to specific regional accents. And with just a few seconds of audio, AI can now clone someone’s specific voice. AI agents will make calls on our behalf, conversing with others in natural language....
Don't Worry About...
AI #101: The Shallow End The avalanche of DeepSeek news continues.
2 months ago
Armin Ronacher's...
How I Use AI: Meet My Promptly Hired Model Intern After Musk's acquisition of Twitter, many people I respect and follow moved to Bluesky. I created...
2 months ago
2
2 months ago
After Musk's acquisition of Twitter, many people I respect and follow moved to Bluesky. I created an account there and made an honest attempt of making it my primary platform. Sadly, I found Bluesky to be surprisingly hostile towards AI content. There is an almost religious...
Marcus on AI
𝗔𝗜 𝗚𝗲𝗼𝗽𝗼𝗹𝗶𝘁𝗶𝗰𝘀 𝗱𝗲𝗯𝗮𝘁𝗲! My conversation with China’s Victor Gao plus a hot take on... [Sorry to swamp your mailboxes today, but there is a lot of important AI stuff happening.]
2 months ago
Don't Worry About...
DeepSeek: Lemon, It's Wednesday It’s been another *checks notes* two days, so it’s time for all the latest DeepSeek news.
2 months ago
Marcus on AI
OpenAI Cries Foul Irony is for losers
2 months ago
Rozado’s Visual...
The Political Preferences of DeepSeek AI Models Just a very brief post to report that DeepSeek AI Models manifest similar political preferences to...
2 months ago
33
2 months ago
Just a very brief post to report that DeepSeek AI Models manifest similar political preferences to their American counterparts.
Rozado’s Visual...
Do OpenAI's New Reasoning Models (o1 Series) Differ Politically from Their Predecessors? How the o1 models that leverage inference time compute compares to GPT-4o and GPT-3.5 on political...
2 months ago
Don't Worry About...
Operator No one is talking about OpenAI’s Operator.
2 months ago
Win Vector LLC
Trying to Describe 2024 AI Premises I can’t resist adding yet another commentary on the state of 2024 AI (and, yes I know it is now 2025...
2 months ago
17
2 months ago
I can’t resist adding yet another commentary on the state of 2024 AI (and, yes I know it is now 2025 and DeepSeek is relevant!). The 2024 AI money machine appears to have depended on several premises: The product would be valuable. The product would be very expensive to...
Artificial Ignorance
DeepSeek: Frequently Asked Questions Share this with your friends and family.
2 months ago
Weighty Thoughts
Who’s Winning the AI War: 2025 (DeepSeek?) Edition Same fundamentals, new unhinged vibes
2 months ago
Marcus on AI
Five things most people don't seem to understand about DeepSeek DeepSeek r1 is not smarter than earlier models, just trained more cheaply
2 months ago
seangoedecke.com RSS...
Why AI labs offer so many different models Major AI labs these days (i.e. early 2025) offer a wide variety of models. Some are faster and...
2 months ago
42
2 months ago
Major AI labs these days (i.e. early 2025) offer a wide variety of models. Some are faster and cheaper, some are smarter, and now some are…
fast.ai
What AI can tell us about microscope slides A friendly introduction to Foundation Models for Computational Pathology
2 months ago
Don't Worry About...
DeepSeek Panic at the App Store DeepSeek released v3.
2 months ago
Marcus on AI
“Nvidia could soon take a serious hit, too” The market can remain irrational longer than you can remain solvent, but today might be the day.
2 months ago
Solving the decision...
Reliable UX for AI chat with Durable Objects What is says on the tin.
2 months ago
Marcus on AI
The race for "AI Supremacy" is over — at least for now. Decades of government kowtowing to Big Tech has thus far failed to produce a decisive victory
2 months ago
IEEE Spectrum
Just How Many Robots Can One Person Control at Once? This article is part of our exclusive IEEE Journal Watch series in partnership with IEEE...
2 months ago
42
2 months ago
This article is part of our exclusive IEEE Journal Watch series in partnership with IEEE Xplore. Swarms of autonomous robots are increasingly being tested and deployed in complex missions, yet a certain level of human oversight during these missions is still required. Which means...
One Useful Thing
Which AI to Use Now: An Updated Opinionated Guide Picking your general-purpose AI
2 months ago
seangoedecke.com RSS...
Playing politics is how senior engineers protect their team When I write about doing politically valuable work in big tech companies, I often get comments...
2 months ago
29
2 months ago
When I write about doing politically valuable work in big tech companies, I often get comments accusing me of trying to get ahead at the…
seangoedecke.com RSS...
What did DeepSeek figure out about reasoning with DeepSeek-R1? The Chinese AI lab DeepSeek recently released their new reasoning model R1, which is supposedly (a)...
2 months ago
40
2 months ago
The Chinese AI lab DeepSeek recently released their new reasoning model R1, which is supposedly (a) better than the current best reasoning…
seangoedecke.com RSS...
Working fast and slow Some engineers work very consistently, putting in the same hours every day and getting out the same...
2 months ago
39
2 months ago
Some engineers work very consistently, putting in the same hours every day and getting out the same amount of work. I don’t. Some days I…
IEEE Spectrum
Video Friday: Hottest On The Ice Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
2 months ago
46
2 months ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion. RoboCup German Open: 12–16 March...
Artificial Ignorance
AI Roundup 102: The Stargate Project January 24, 2024.
3 months ago
Don't Worry About...
Stargate AI-1 There was a comedy routine a few years ago.
3 months ago
Society's Backend
Open AI beats OpenAI, Understand Scaling Laws, Commerce in the Age of AI Agents, and More Society's Backend Reading List 01-24-2025
3 months ago
seangoedecke.com RSS...
Why are big tech companies so slow? Big tech companies spend a lot of time and money building things that a single, motivated engineer...
3 months ago
43
3 months ago
Big tech companies spend a lot of time and money building things that a single, motivated engineer could build in a weekend. This fact…
Solving the decision...
let's talk about a task tracking system for ai agents AI agents need tracking software, and we need to build it.
3 months ago
Armin Ronacher's...
Build It Yourself Another day, another rant about dependencies. from me. This time I will ask you that we start and...
3 months ago
2
3 months ago
Another day, another rant about dependencies. from me. This time I will ask you that we start and support a vibe shift when it comes to dependencies. You're probably familiar with the concept of “dependency churn.” It's that never-ending treadmill of updates, patches, audits,...
Weighty Thoughts
AI in Hedge Funds and High Finance Talking about the future of AI in finance with Alex Campbell
3 months ago
Don't Worry About...
AI #100: Meet the New Boss Break time is over, it would seem, now that the new administration is in town.
3 months ago