Daniel Miessler
My Mom Died on Saturday
My mom died on Saturday. My biological mother became unable to function when I was around five due...
over a year ago
My mom died on Saturday. My biological mother became unable to function when I was around five due to mental illness, which left my dad and me on our own. Sometime after we were blessed with a strong, beautiful soul named Rhonda. My dad and I were like rescue dogs, and she saved...
Strange Loop Canon
Seeing Like A Network
Dark Forests, Dense Networks
6 months ago
Dark Forests, Dense Networks
One Useful Thing
On giving AI eyes and ears
AI can listen and see, with bigger implications than we might realize.
a year ago
AI can listen and see, with bigger implications than we might realize.
Society's Backend
OpenAI’s Blunder is a Loss for the ML Community
The timeline and why OpenAI’s actions are a big deal
7 months ago
The timeline and why OpenAI’s actions are a big deal
One Useful Thing
Detecting the Secret Cyborgs
The AI Trap for Organizations
a year ago
The AI Trap for Organizations
Artificial Ignorance
The State of AI Engineering (2024)
Notes from the AI Engineer World's Fair.
5 months ago
Notes from the AI Engineer World's Fair.
Artificial Ignorance
GPTs won't make you rich
But they'll make you more productive.
11 months ago
But they'll make you more productive.
Made by Ollin
NVIDIA Internship (2017)
Notes on my internship at NVIDIA Redmond.
over a year ago
Notes on my internship at NVIDIA Redmond.
Artificial Ignorance
AI Roundup 082: System prompts
August 30, 2024.
4 months ago
One Useful Thing
When you give a Claude a mouse
Some quick impressions of an actual agent
2 months ago
Some quick impressions of an actual agent
Matt Mazur
It’s Time to Build
It’s been a few months so I wanted to say hey to the 7 of you who follow this blog and share a few...
8 months ago
It’s been a few months so I wanted to say hey to the 7 of you who follow this blog and share a few updates about what I’ve been up to. Quick recap At the start of 2023 I quit consulting to go full time on Preceden, my SaaS timeline maker, after growing it on … Continue reading...
The Berkeley...
Modeling Extremely Large Images with $x$T
As computer vision researchers, we believe that every pixel can tell a story. However, there seems...
9 months ago
As computer vision researchers, we believe that every pixel can tell a story. However, there seems to be a writer’s block settling into the field when it comes to dealing with large images. Large images are no longer rare—the cameras we carry in our pockets and those orbiting our...
Marcus on AI
Don’t Ride This Bike! Generative AI’s persistent trouble with compositionality and parts
When the text-to-image AI generation system DALL-E2 was released in April 2022, the two of us,...
3 weeks ago
When the text-to-image AI generation system DALL-E2 was released in April 2022, the two of us, together with Scott Aaronson, ran some informal experiments to probe its abilities.
One Useful Thing
On speaking to AI
Voice changes a lot of things
5 months ago
Voice changes a lot of things
One Useful Thing
The shape of the shadow of The Thing
We can start to see, dimly, what the near future of AI looks like.
a year ago
We can start to see, dimly, what the near future of AI looks like.
Artificial Ignorance
How a $2000/hour escort uses AI to automate sex work
Listen now | A conversation with Adelyn Moore, an independent escort and adult content creator.
5 months ago
Listen now | A conversation with Adelyn Moore, an independent escort and adult content creator.
fast.ai
I was an AI researcher. Now, I am an immunology student.
Last year, I became captivated by a new topic in a way that I hadn’t felt since I first discovered...
a year ago
Last year, I became captivated by a new topic in a way that I hadn’t felt since I first discovered machine learning
One Useful Thing
Magic for English Majors
Programming in prose in an AI-haunted world
a year ago
Programming in prose in an AI-haunted world
Artificial Ignorance
AI Roundup 057: Claude 3
March 8, 2024.
10 months ago
Artificial Ignorance
AI Roundup 046: AI has a CSAM problem
December 22, 2023.
a year ago
Society's Backend
You Are What You Eat: Digital Edition
Why who you follow is more important than who follows you
a year ago
Why who you follow is more important than who follows you
Marcus on AI
An AI rumor you won’t want to miss
Converging evidence that the core hypothesis driving generative AI may be wrong
a month ago
Converging evidence that the core hypothesis driving generative AI may be wrong
Society's Backend
Weekly Backend #5: 55 Resources
The inner workings of transformers, Machine Learning Q and AI, machine learning classifies harmful...
8 months ago
The inner workings of transformers, Machine Learning Q and AI, machine learning classifies harmful viruses, and more
One Useful Thing
The Homework Apocalypse
Fall is going to be very different this year. Educators need to be ready.
a year ago
Fall is going to be very different this year. Educators need to be ready.
Rozado’s Visual...
The Great Awokening as a Global Phenomenon
The striking synchronicity with which Great Awokening terminology increased in news media worldwide
a year ago
The striking synchronicity with which Great Awokening terminology increased in news media worldwide
Rozado’s Visual...
Define Wokeness! Or how you shall know a word by the company it keeps
Visualizing what words often appear in the vicinity of woke/wokeness in news media content...
a year ago
Visualizing what words often appear in the vicinity of woke/wokeness in news media content illustrates why communication is almost impossible between red and blue America
The Berkeley...
Goal Representations for Instruction Following
Goal Representations for Instruction Following
Figure title. Figure caption. This image is...
a year ago
Goal Representations for Instruction Following
Figure title. Figure caption. This image is centered and set to 50%
page width. -->
A longstanding goal of the field of robot learning has been to create generalist agents that can perform tasks for humans. Natural language has...
IEEE Spectrum
This Inventor Is Molding Tomorrow’s Inventors
This article is part of our special report, “Reinventing Invention: Stories from Innovation’s...
2 months ago
This article is part of our special report, “Reinventing Invention: Stories from Innovation’s Edge.”
Marina Umaschi Bers has long been at the forefront of technological innovation for kids. In the 2010s, while teaching at Tufts University, in Massachusetts, she codeveloped the...
Daniel Miessler
NO. 355 | NEWS & ANALYSIS SERIES
SECURITY NEWS ⛔️ There is likely to be a critical TLS vulnerability released this week. Consider...
over a year ago
SECURITY NEWS ⛔️ There is likely to be a critical TLS vulnerability released this week. Consider getting your teams ready by looking for your instances before it drops. ZDNET | GLOBALSIGN | REDDIT DISCUSSION The US accused 13 Chinese nationals of committing espionage-related...
Artificial Ignorance
The AI research tool that saves me hours every week
And why it might revolutionize the search industry.
a year ago
And why it might revolutionize the search industry.
Artificial Ignorance
Tutorial: How to build an AI agent (Part 1)
Part 1: What is an agent, and how do they work?
a year ago
Part 1: What is an agent, and how do they work?
Society's Backend
Taking the AI to Consumers
The Google Pixel 8 Pro is a much bigger deal than you think
a year ago
The Google Pixel 8 Pro is a much bigger deal than you think
fast.ai
AI Safety and the Age of Dislightenment
Model licensing & surveillance will likely be counterproductive by concentrating power in...
a year ago
Model licensing & surveillance will likely be counterproductive by concentrating power in unsustainable ways
Artificial Ignorance
10 AI stories that shaped 2024
Agents, deepfakes, Strawberries, and more.
a week ago
Agents, deepfakes, Strawberries, and more.
Artificial Ignorance
AI Roundup 094: Pixtral et Le Chat
November 22, 2024.
a month ago
Weighty Thoughts
What is Defensibility?
Back to basics for AI startups and others
9 months ago
Back to basics for AI startups and others
The Gradient
Why Doesn’t My Model Work?
Have you ever trained a model you thought was good, but then it failed miserably when applied to...
10 months ago
Have you ever trained a model you thought was good, but then it failed miserably when applied to real world data? If so, you’re in good company.
One Useful Thing
Not much is changing, a lot is changing
OpenAI, Microsoft, and the OpenOffspring
a year ago
OpenAI, Microsoft, and the OpenOffspring
Artificial Ignorance
AI Roundup 050: Synthetic Geometry
January 19, 2024.
11 months ago
Society's Backend
Meet Devansh: Lessons Learned While Building a Community of 193,000 Subscribers
Content creation and networking lessons from a wildly successful freelance tech writer
a year ago
Content creation and networking lessons from a wildly successful freelance tech writer
The Gradient
What Do LLMs Know About Linguistics? It Depends on How You Ask
On the phenomenon of LLM sensitivity to prompting choices through two core linguistic tasks and...
a year ago
On the phenomenon of LLM sensitivity to prompting choices through two core linguistic tasks and categorize how specific prompting choices can affect the model's behavior.
Artificial Ignorance
Getting to the top of the GPT Store (and building an AI-native search engine, too)
Listen now | A conversation with Christian Salem, founder and CPO of Consensus.
8 months ago
Listen now | A conversation with Christian Salem, founder and CPO of Consensus.
AI Snake Oil
GPT-4 and professional benchmarks: the wrong answer to the wrong question
OpenAI may have tested on the training data. Besides, human benchmarks are meaningless for bots.
a year ago
OpenAI may have tested on the training data. Besides, human benchmarks are meaningless for bots.
AI Snake Oil
Will AI transform law?
The hype is not supported by current evidence
11 months ago
The hype is not supported by current evidence
Artificial Ignorance
Case Study: Scaling customer intelligence
Analyzing 10,000 sales calls with Claude
a month ago
Analyzing 10,000 sales calls with Claude
Daniel Miessler
NO. 358 | NEWS, ANALYSIS & DISCOVERY
🦃 We're doing our second-ever discount on UL Membership starting the day after Thanksgiving. But...
over a year ago
🦃 We're doing our second-ever discount on UL Membership starting the day after Thanksgiving. But that's a Friday, so I'm going to enable the discount link earlier. How early, and how much of a discount? You'll have to find out. If the link works before the date, then it's...
Frank’s Ramblings
Vision Transformers are Overrated
Vision transformers (ViTs) have seen an incredible rise in the past four years. They have an obvious...
a year ago
Vision transformers (ViTs) have seen an incredible rise in the past four years. They have an obvious upside: in a visual recognition setting, the receptive field of a pure ViT is effectively the entire image 1. In particular, vanilla ViTs maintain the quadratic time complexity...
Weighty Thoughts
Scaling is a Choice
Progress in tech is rarely “inevitable.” Looking at semiconductors and AI.
2 weeks ago
Progress in tech is rarely “inevitable.” Looking at semiconductors and AI.
PromptArmor Blog
Announcing LASEC: LLM Application Security Executive Certification
Including a never before seen exploit from PromptArmor's cutting edge threat intelligence team.
9 months ago
Including a never before seen exploit from PromptArmor's cutting edge threat intelligence team.
The Berkeley...
Interactive Fleet Learning
Figure 1: “Interactive Fleet Learning” (IFL) refers to robot fleets in industry and academia that...
a year ago
Figure 1: “Interactive Fleet Learning” (IFL) refers to robot fleets in industry and academia that fall back on human teleoperators when necessary and continually learn from them over time.
In the last few years we have seen an exciting development in robotics and artificial...
Sam Altman
Idea Generation
The most common question prospective startup founders ask is how to get ideas for startups. The...
over a year ago
The most common question prospective startup founders ask is how to get ideas for startups. The second most common question is if you have any ideas for their startup.
But giving founders an idea almost always doesn’t work. Having ideas is among the most important qualities for...
One Useful Thing
Scaling: The State of Play in AI
A brief intergenerational pause...
3 months ago
A brief intergenerational pause...
One Useful Thing
Gradually, then Suddenly: Upon the Threshold
Small improvements can lead to big changes
6 months ago
Small improvements can lead to big changes
Daniel Miessler
Summary: Andrej Karpathy on Lex Fridman’s Podcast (Late 2022)
This is member content. Thank you for being a subscriber. This is UL Member Content Subscribe...
a year ago
This is member content. Thank you for being a subscriber. This is UL Member Content Subscribe Already a member? Login
Artificial Ignorance
States are racing ahead of Congress to regulate deepfakes
Several states just banned deepfakes in political ads and porn.
9 months ago
Several states just banned deepfakes in political ads and porn.
Artificial Ignorance
Blade Runner 2024
Reddit's AI Hunters and the quest for authentic digital experiences.
4 months ago
Reddit's AI Hunters and the quest for authentic digital experiences.
Strange Loop Canon
Power, money and human nature
Thoughts on OpenAI, a tragedy
a year ago
Thoughts on OpenAI, a tragedy
Artificial Ignorance
A stroll through Google's Model Garden
What generative AI capabilities does Google offer to developers?
a year ago
What generative AI capabilities does Google offer to developers?
Strange Loop Canon
AI bill vetoed; what's next?
Make AI regulations evidence based
3 months ago
Make AI regulations evidence based
Strange Loop Canon
Why AI hasn’t shown up in the GDP statistics yet
AI is meant to bring us closer to utopia, according to its builders.
5 months ago
AI is meant to bring us closer to utopia, according to its builders.
Daniel Miessler
News, Analysis, and Discovery | NO. 356
SECURITY NEWS TikTok has now admitted, after denying last week, that Chinese staff can in fact read...
over a year ago
SECURITY NEWS TikTok has now admitted, after denying last week, that Chinese staff can in fact read European TikTok data. Pressure is increasing across the US government to outright ban the app, but it's quickly becoming national infrastructure so many young people. MORE |...
Daniel Miessler
Sponsored Interview: Erkang Zheng of JupiterOne
In this standalone episode we’re doing a sponsored interview with Erkang Zheng of JupiterOne. So...
over a year ago
In this standalone episode we’re doing a sponsored interview with Erkang Zheng of JupiterOne. So Jupiter One is a special company to me. I just built a vuln management program at Robinhood based around them, and I believe so much in their vision that I’m looking to actually...
One Useful Thing
Superhuman: What can AI do in 30 minutes?
AI multiplies your efforts. I found out by how much...
a year ago
AI multiplies your efforts. I found out by how much...
PromptArmor Blog
Coming soon
This is PromptArmor Blog.
a year ago
This is PromptArmor Blog.
Society's Backend
Founder Mode, How AI Impacts Education, Diffusion Models As Real-Time Game Engines, and More
Machine learning resources and updates 2024-09-03
4 months ago
Machine learning resources and updates 2024-09-03
One Useful Thing
The Best Available Human Standard
What are the imperatives of the upside?
a year ago
What are the imperatives of the upside?
Weighty Thoughts
The AI Executive Order
AKA: “We’ll do something about it later”
a year ago
AKA: “We’ll do something about it later”
One Useful Thing
Which AI should I use? Superpowers and the State of Play
And then there were three...
9 months ago
And then there were three...
Artificial Ignorance
AI Roundup 026: AudioCraft
August 4, 2023
a year ago
Artificial Ignorance
AI Roundup 056: Data deals
March 1, 2024.
10 months ago
AI Snake Oil
The bait and switch behind AI risk prediction tools
Toronto recently used an AI tool to predict when a public beach will be safe. It went horribly awry....
over a year ago
Toronto recently used an AI tool to predict when a public beach will be safe. It went horribly awry. The developer claimed the tool achieved over 90% accuracy in predicting when beaches would be safe to swim in. But the tool did much worse: on a majority of the days when the...
Society's Backend
LLM-as-a-Judge, Instruction Pretraining, Solving Benchmarks Instead of Real-World ML Problems, and...
Weekly updates and resources 7/22/24
5 months ago
Weekly updates and resources 7/22/24
Artificial Ignorance
AI Zoology: Why are there so many animal models?
Inside the emerging model menagerie.
a year ago
Inside the emerging model menagerie.
AI Snake Oil
I set up a ChatGPT voice interface for my 3-year old. Here’s how it went.
Chatbots are likely to revive familiar debates about kids and apps
a year ago
Chatbots are likely to revive familiar debates about kids and apps
Rozado’s Visual...
DepolarizingGPT
A Political Chatbot that Gives 3 Politically Diverse Answers to Every Prompt
a year ago
A Political Chatbot that Gives 3 Politically Diverse Answers to Every Prompt
One Useful Thing
I hope you weren't getting too comfortable.
I just got access to the new Bing AI. My initial thoughts are that our assumptions about the limits...
a year ago
I just got access to the new Bing AI. My initial thoughts are that our assumptions about the limits of AI were wrong.
Matt Mazur
My Indie SaaS Revenue has Grown 37% per Year for 13 Years
Unlike many indie founders, I’ve never shared revenue numbers for Preceden, my SaaS timeline maker...
11 months ago
Unlike many indie founders, I’ve never shared revenue numbers for Preceden, my SaaS timeline maker tool. Even if they were remarkable – which they are not really – I just don’t think there are many good reasons to publicly share revenue numbers, and there are lots of downsides....
Matt Mazur
Experimenting with GPT-4 Turbo’s JSON mode
One of the many new features announced at yesterday’s OpenAI dev day is better support for...
a year ago
One of the many new features announced at yesterday’s OpenAI dev day is better support for generating valid JSON output. From the JSON mode docs: A common way to use Chat Completions is to instruct the model to always return JSON in some format that makes sense for your use case,...
Artificial Ignorance
AI Roundup 075: Levels of AGI
July 12, 2024.
5 months ago
Strange Loop Canon
AI embraces its product arc
fuzzy processors are entering mass production
7 months ago
fuzzy processors are entering mass production
Matt Mazur
Going Full Time on My SaaS After 13 Years
In January 2010 I soft-launched launched Preceden, a web-based timeline maker tool, followed a few...
over a year ago
In January 2010 I soft-launched launched Preceden, a web-based timeline maker tool, followed a few weeks later by a larger launch on HackerNews: Today – almost 13 years to the day since the initial launch – I’m going full time on it and I couldn’t be more excited. A brief history...
Artificial Ignorance
The subscriptionization of AI
Navigating the paid AI landscape.
9 months ago
Navigating the paid AI landscape.
Society's Backend
Bridging the Gap from Simple Algebra to Machine Learning
You probably know more about machine learning math than you think
11 months ago
You probably know more about machine learning math than you think
Sam Altman
PG and Jessica
A lot of people want to replicate YC in some other industry or some other place or with some other...
over a year ago
A lot of people want to replicate YC in some other industry or some other place or with some other strategy. In general, people seem to assume that: 1) although there was some degree of mystery or luck about how YC got going, it can’t be that hard, and 2) if you can get it off...
Artificial Ignorance
How Intercom is transforming customer support with AI
A conversation with Fergal Reid, VP of AI at Intercom.
8 months ago
A conversation with Fergal Reid, VP of AI at Intercom.
Society's Backend
New Evals for Better Models, AI Research Papers Made Easier to Understand, Train Your Own Flux LoRA,...
Machine learning highlights and resources 9-23-24
3 months ago
Machine learning highlights and resources 9-23-24
One Useful Thing
Signs and Portents
Some hints about what the next year of AI looks like
12 months ago
Some hints about what the next year of AI looks like
IEEE Spectrum
Forums, Competitions, Challenges: Inspiring Creativity in Robotics
This is a sponsored article brought to you by Khalifa University of Science and Technology.
A total...
3 months ago
This is a sponsored article brought to you by Khalifa University of Science and Technology.
A total of eight intense competitions to inspire creativity and innovation along with 13 forums dedicated to diverse segments of robotics and artificial intelligence will be part of the...
Sam Altman
Researchers and Founders
I spent many years working with founders and now I work with researchers.
Although there are always...
over a year ago
I spent many years working with founders and now I work with researchers.
Although there are always individual exceptions, on average it’s surprising to me how different the best people in these groups are (including in some qualities that I had assumed were present in great...
Made by Ollin
Maple Diffusion
I ported Stable Diffusion to my phone
over a year ago
I ported Stable Diffusion to my phone
The Berkeley...
Asymmetric Certified Robustness via Feature-Convex Neural Networks
Asymmetric Certified Robustness via Feature-Convex Neural Networks
TLDR: We propose the asymmetric...
a year ago
Asymmetric Certified Robustness via Feature-Convex Neural Networks
TLDR: We propose the asymmetric certified robustness problem, which requires certified robustness for only one class and reflects real-world adversarial scenarios. This focused setting allows us to introduce...
Artificial Ignorance
AI Roundup 073: Music make you lose control
June 28, 2024.
6 months ago
The Berkeley...
Why do Policy Gradient Methods work so well in Cooperative MARL? Evidence from Policy Representation
In cooperative multi-agent reinforcement learning (MARL), due to its on-policy nature, policy...
over a year ago
In cooperative multi-agent reinforcement learning (MARL), due to its on-policy nature, policy gradient (PG) methods are typically believed to be less sample efficient than value decomposition (VD) methods, which are off-policy. However, some recent empirical studies demonstrate...
AI Snake Oil
AI scaling myths
Scaling will run out. The question is when.
6 months ago
Scaling will run out. The question is when.
Artificial Ignorance
AI Roundup 067: GPT-4o and Google I/O
May 17, 2024.
7 months ago
Rozado’s Visual...
Excess mortality during COVID pandemic, % of population vaccinated and Stringency Index of...
Please be mindful of potential confounds: population density, population behavior, virus virulence...
over a year ago
Please be mindful of potential confounds: population density, population behavior, virus virulence drift, herd immunity, etc.
One Useful Thing
15 Times to use AI, and 5 Not to
Notes on the Practical Wisdom of AI Use
3 weeks ago
Notes on the Practical Wisdom of AI Use
Artificial Ignorance
Tutorial: How to make and share custom GPTs
They're not going to disrupt everything (yet), but they're a ton of fun.
a year ago
They're not going to disrupt everything (yet), but they're a ton of fun.
One Useful Thing
How to Get an AI to Lie to You in Three Simple Steps
I keep getting fooled by AI, and it seems like others are, too.
a year ago
I keep getting fooled by AI, and it seems like others are, too.
Rozado’s Visual...
The Prevalence of Prejudice-Denoting Terms in Spanish Newspapers
Published paper Twitter summary Introduction Previous scholarly literature here and here has...
over a year ago
Published paper Twitter summary Introduction Previous scholarly literature here and here has documented a pronounced increase in the prevalence of prejudice-denoting terms in American news media content. Some have referred to this shift in journalistic discourse and related...
One Useful Thing
Freeing the chatbot
Intelligence, of a sort, is going to be all around us
8 months ago
Intelligence, of a sort, is going to be all around us
Society's Backend
Why everyone loves Spider-Man
Lessons anyone can learn from our friendly neighborhood web-crawler
a year ago
Lessons anyone can learn from our friendly neighborhood web-crawler
Artificial Ignorance
AI Roundup 072: The new new Claude
June 21, 2024.
6 months ago
Society's Backend
I Beat Newsletter Fatigue With AI
And why direct forms of communication will always be super valuable
11 months ago
And why direct forms of communication will always be super valuable
Daniel Miessler
Why Apple Keeps Winning
People are blown away that Apple keeps winning while its competitors are floundering. It’s a simple...
over a year ago
People are blown away that Apple keeps winning while its competitors are floundering. It’s a simple formula. Make consistently super-high-quality products that work together as part of an ecosystem. Google and Microsoft have 20X Apple’s losses in the last year. A staggering $3...
Andrej Karpathy blog
Biohacking Lite
Throughout my life I never paid too much attention to health, exercise, diet or nutrition. I knew...
over a year ago
Throughout my life I never paid too much attention to health, exercise, diet or nutrition. I knew that you’re supposed to get some exercise and eat vegetables or something, but it stopped at that (“mom said”-) level of abstraction. I also knew that I can probably get away with...
One Useful Thing
Thinking Like an AI
A little intuition can help
2 months ago
A little intuition can help
AI Snake Oil
Licensing is neither feasible nor effective for addressing AI risks
Non-proliferation only benefits incumbents
a year ago
Non-proliferation only benefits incumbents
Artificial Ignorance
The hidden side of Apple Intelligence
More than just another keynote recap.
6 months ago
More than just another keynote recap.
fast.ai
AI Harms are Societal, Not Just Individual
In the west, our ideas of harm are largely anchored to an individual being harmed by a particular...
over a year ago
In the west, our ideas of harm are largely anchored to an individual being harmed by a particular action at a discrete moment in time. Yet the harms caused by algorithmic systems are often collective and communal.
Artificial Ignorance
AI Roundup 078: Voice mode
August 2, 2024.
5 months ago
Artificial Ignorance
GPT-4o and the illusion of AGI
Why speed and multimodality is becoming the name of the game.
7 months ago
Why speed and multimodality is becoming the name of the game.
Rozado’s Visual...
The Political Biases of GPT-4
Things are not always what they seem
a year ago
Things are not always what they seem
Marcus on AI
Where will AI be at the end of 2027? A bet
We, Gary Marcus, author, scientist, and noted AI skeptic, and Miles Brundage, an independent AI...
6 days ago
We, Gary Marcus, author, scientist, and noted AI skeptic, and Miles Brundage, an independent AI policy researcher who recently left OpenAI and is bullish on AI progress, have agreed to the following bet, at 10:1 odds, with criteria drawn from two earlier Substack essays by Gary...
One Useful Thing
On-boarding your AI Intern
There's a somewhat weird alien who wants to work for free for you. You should probably get started.
a year ago
There's a somewhat weird alien who wants to work for free for you. You should probably get started.
Society's Backend
AI and Software Reading List 4: State of the Job Market, Apple's Private Cloud Compute Released for...
Society's Backend Reading List 10-28-2024
2 months ago
Society's Backend Reading List 10-28-2024
Artificial Ignorance
Why Claude 3 is a big upgrade
On Monday, Anthropic released Claude 3, a family of models that hit new highs on various benchmarks,...
10 months ago
On Monday, Anthropic released Claude 3, a family of models that hit new highs on various benchmarks, add multi-modal capabilities, and are a real competitor to GPT-4.
Marcus on AI
Humanity’s “Oh shit!” AI moment?
Not yet, but it could come sooner than you think. Not because we are close to AGI, but because we...
3 weeks ago
Not yet, but it could come sooner than you think. Not because we are close to AGI, but because we already have machines that can say one thing and do something else altogether.
Society's Backend
AI Reading List 2: Mathematical Limitations of LLMs, A SQL Roadmap for Data Science, and...
Society's Backend Reading List 10-14-2024
2 months ago
Society's Backend Reading List 10-14-2024
One Useful Thing
Innovation through prompting
Democratizing educational technology... and more
8 months ago
Democratizing educational technology... and more
Society's Backend
Weekly Backend #4: 62 Total Resources
Apple’s LLM OpenELM, GPT-4-Turbo, Phi-3, Grok-1.5 Vision, and more
8 months ago
Apple’s LLM OpenELM, GPT-4-Turbo, Phi-3, Grok-1.5 Vision, and more
Daniel Miessler
Summary: Andrej Kaparthy on Lex Fridman’s Podcast (Late 2022)
9/10 This is a summary of Andrej Kaparthy’s appearance on Lex Fridman’s podcast in late 2022. My...
a year ago
9/10 This is a summary of Andrej Kaparthy’s appearance on Lex Fridman’s podcast in late 2022. My One-Sentence Summary/Highlight The future of programming is not humans writing code, but neural nets creating weights. Capture Neural networks are mathematical expressions with many...
Rozado’s Visual...
Is climate change 100 times more threatening than gain-of-function research?
Mentions of climate change in news media vastly outnumber other existential threats
a year ago
Mentions of climate change in news media vastly outnumber other existential threats
Weighty Thoughts
Apple Wins
Apple Intelligence and aggregation puts Apple in a dominant AI position
6 months ago
Apple Intelligence and aggregation puts Apple in a dominant AI position
Weighty Thoughts
Is AI a Winner-Take-All Market?
A critical question for investors in AI
4 months ago
A critical question for investors in AI
Artificial Ignorance
AI Roundup 093: Diminishing returns
November 15, 2024.
a month ago
Daniel Miessler
NO. 362 | Dependency Scanner, Citrix Attacks, AI Analysis…
SECURITY NEWS Google released an open-source scanner for vulnerabilities in project dependencies....
over a year ago
SECURITY NEWS Google released an open-source scanner for vulnerabilities in project dependencies. It's a front-end to the OSV database that links a dependency list to its vulnerabilities. MORE The latest updates for Apple software fixed a new zero-day that could be used to hack...
IEEE Spectrum
Zen and the Art of Aibo Engineering
Sony’s team made that happen. And since Aibo’s debut, the company has sold
more than 170,000 of...
3 weeks ago
Sony’s team made that happen. And since Aibo’s debut, the company has sold
more than 170,000 of the cute little quadrupeds—a huge number considering their price of several thousand dollars each. From the start, Aibo could express a range of simulated emotions and learn through...
Andrej Karpathy blog
What a Deep Neural Network thinks about your #selfie
Convolutional Neural Networks are great: they recognize things, places and people in your personal...
over a year ago
Convolutional Neural Networks are great: they recognize things, places and people in your personal photos, signs, people and lights in self-driving cars, crops, forests and traffic in aerial imagery, various anomalies in medical images and all kinds of other useful things. But...
Weighty Thoughts
From 6 Weeks to 600 Seconds (or Less)
The revolutionary potential of utilizing AI for electronic hardware design
6 days ago
The revolutionary potential of utilizing AI for electronic hardware design
Made by Ollin
Acapella Extraction with ConvNets
A working prototype
over a year ago
Society's Backend
The Method Google Used to Reduce LLM Size by 66%
A brief overview of knowledge distillation and its capabilities
6 months ago
A brief overview of knowledge distillation and its capabilities
Daniel Miessler
News & Analysis | NO. 352
over a year ago
Matt Mazur
Running Mistral 8x7Bs Mixture of Experts on a Macbook
Below are the steps I used to get Mistral 8x7Bs Mixture of Experts (MOE) model running locally on my...
a year ago
Below are the steps I used to get Mistral 8x7Bs Mixture of Experts (MOE) model running locally on my Macbook (with its Apple M2 chip and 24 GB of memory). Here’s a great overview of the model for anyone interested in learning more. Short version: The Mistral “Mixtral” 8x7B 32k...
Society's Backend
Artificial Intelligence: A New Paradigm Emerges
A realistic perspective of the advantages and pitfalls of artificial intelligence
a year ago
A realistic perspective of the advantages and pitfalls of artificial intelligence
One Useful Thing
AI in organizations: Some tactics
Meet the Lab and the Crowd
3 months ago
Meet the Lab and the Crowd
One Useful Thing
Blinded by Analogies
What is this AI thing? The wrong model can lead us astray
a year ago
What is this AI thing? The wrong model can lead us astray
Artificial Ignorance
AI Roundup 059: In̶f̶l̶e̶c̶t̶i̶o̶n̶ Microsoft AI
March 22, 2023.
9 months ago
AI Snake Oil
Three Ideas for Regulating Generative AI
Policy input to the federal government from a Stanford-Princeton team
a year ago
Policy input to the federal government from a Stanford-Princeton team
AI Snake Oil
AI existential risk probabilities are too unreliable to inform policy
How speculation gets laundered through pseudo-quantification
5 months ago
How speculation gets laundered through pseudo-quantification
Artificial Ignorance
Tutorial: How to chat with your documents
A step-by-step guide to doing Q&A with your data, using LlamaIndex and OpenAI.
a year ago
A step-by-step guide to doing Q&A with your data, using LlamaIndex and OpenAI.
fast.ai
In defense of screen time
Pundits say my husband and I are parenting wrong.
2 months ago
Pundits say my husband and I are parenting wrong.
Andrej Karpathy blog
Self-driving as a case study for AGI
Sparked by progress in Large Language Models (LLMs), there’s a lot of chatter recently about AGI,...
11 months ago
Sparked by progress in Large Language Models (LLMs), there’s a lot of chatter recently about AGI, its timelines, and what it might look like. Some of it is hopeful and optimistic, but a lot of it is fearful and doomy, to put it mildly. Unfortunately, a lot of it is also very...
Rozado’s Visual...
Mentions of Political Extremism in English Wikipedia
A data-driven exploration uncovers disparities. Are they shaped by editorial choices or broader...
3 weeks ago
A data-driven exploration uncovers disparities. Are they shaped by editorial choices or broader societal/historical dynamics?
Artificial Ignorance
The intern and the coach
A recent post from Wharton professor Ethan Mollick shared an impressive new study. In a nutshell, it...
a year ago
A recent post from Wharton professor Ethan Mollick shared an impressive new study. In a nutshell, it found that BCG consultants who used GPT-4 were up to 43% more effective at tasks vs. employees who didn’t.
Artificial Ignorance
AI Roundup 076: Grand theft audio
July 19, 2024.
5 months ago
Artificial Ignorance
AI and the workplace
How employees and CEOs alike can plan for the future.
8 months ago
How employees and CEOs alike can plan for the future.
Sam Altman
Keep the Internet Open
The FCC has announced plans to roll back policies on net neutrality, and its new head has indicated...
over a year ago
The FCC has announced plans to roll back policies on net neutrality, and its new head has indicated he has no plan to stop soon.
The internet is a public good, and I believe access should be a basic right. We've seen such great innovation in software because the internet has...
Society's Backend
What Apple Intelligence Means for You
"We think you're gonna LOVE it"
6 months ago
"We think you're gonna LOVE it"
fast.ai
A new old kind of R&D lab
Answer.AI is a new kind of AI R&D lab which creates practical end-user products based on...
a year ago
Answer.AI is a new kind of AI R&D lab which creates practical end-user products based on foundational research breakthroughs.
Society's Backend
The Unfortunate Truth Regarding AI Regulation
And the impact it'll have for decades to come
7 months ago
And the impact it'll have for decades to come
Marcus on AI
Cognitive scientist Gary Marcus says AI must be regulated. He has a plan.
Fantastic writeup of my views today at The Wall Street Journal:
a month ago
Fantastic writeup of my views today at The Wall Street Journal:
Sam Altman
Tech Workers' Values
For good and bad, technology has become a central force in all
our lives.
As members of the...
over a year ago
For good and bad, technology has become a central force in all
our lives.
As members of the community, we're interested in ways in which
tech companies can use their collective power to protect privacy, rule of law,
freedom of expression, and other fundamental American rights....
Rozado’s Visual...
The Increasing Negativity and Emotionality of News Media Headlines
Published article Introduction I have recently published a paper where we describe a chronological...
over a year ago
Published article Introduction I have recently published a paper where we describe a chronological (2000–2019) analysis of sentiment and emotion in 23 million headlines from 47 news media outlets popular in the United States. We used Transformer language models fine-tuned for...
One Useful Thing
How to... use AI to unstick yourself
We often lose momentum because of something small. AI can help.
a year ago
We often lose momentum because of something small. AI can help.
Sam Altman
The Virus
Although I still hope things will go differently, the experts I’ve spoken to think we are likely to...
over a year ago
Although I still hope things will go differently, the experts I’ve spoken to think we are likely to face a global tragedy—hundreds of thousands of deaths from Covid-19.
I hope that society views this as a warning for the future. Covid-19 is bad, but only a warm-up. I think it’s...
One Useful Thing
What Can be Done in 59 Seconds: An Opportunity (and a Crisis)
Five analytical tasks in under a minute
11 months ago
Five analytical tasks in under a minute
AI Snake Oil
ChatGPT is a bullshit generator. But it can still be amazingly useful
The philosopher Harry Frankfurt defined bullshit as speech that is intended to persuade without...
over a year ago
The philosopher Harry Frankfurt defined bullshit as speech that is intended to persuade without regard for the truth. By this measure, OpenAI’s new chatbot ChatGPT is the greatest bullshitter ever. Large Language Models (LLMs) are trained to produce
Artificial Ignorance
AI Roundup 065: The gpt2-chatbot mystery
May 3, 2024.
8 months ago
Society's Backend
Devin Has Exposed a Major Issue with Software Engineering
And isn't that we're all going to lose our jobs
9 months ago
And isn't that we're all going to lose our jobs
Weighty Thoughts
Writing, Originality, and Why Does Anyone Care What I Write About?
Musings, Insecurities, and Thoughts on Writing a Book (or Substack)
a week ago
Musings, Insecurities, and Thoughts on Writing a Book (or Substack)
Artificial Ignorance
Has YC hit peak AI? (F24)
The latest batch is up to 86% AI startups.
a month ago
The latest batch is up to 86% AI startups.
One Useful Thing
Secret Cyborgs: The Present Disruption in Three Papers
The future is already here, we just need to figure out a few details.
a year ago
The future is already here, we just need to figure out a few details.
Frank’s Ramblings
My Experience Living and Working in China, Part I
In this four-part article, I’ll go over some of the lessons I learned living and doing business in...
over a year ago
In this four-part article, I’ll go over some of the lessons I learned living and doing business in China’s tech industry. During my time in China, I’ve led a team of 10+ engineers to develop a location-based IoT and sensing platform, co-founded an open-source project called...
Daniel Miessler
NO. 367 | Hive Ransom, Anti-Google, Software 2.0…
🎙️If you’re not subscribed to the podcast version of the newsletter, please add it using with your...
a year ago
🎙️If you’re not subscribed to the podcast version of the newsletter, please add it using with your favorite client! APPLE | SPOTIFY | OTHER SECURITY NEWS The FBI infiltrated the HIVE ransomware group, stopping over $130 million in ransomware attacks. HIVE is known for going...
Rozado’s Visual...
The decreasing/increasing prevalence of the terms "global warming" and "climate change" in news...
I will drop this here as a simple curiosity without much further comment: The decreasing/increasing...
over a year ago
I will drop this here as a simple curiosity without much further comment: The decreasing/increasing prevalence of the terms global warming and climate change in news media discourse. For some reason, peak usage of the term global warming happened in 2007 and it has been dropping...
Daniel Miessler
Frontview Mirror: 2023 Edition
This is member content. Thank you for being a subscriber. This is UL Member Content Subscribe...
over a year ago
This is member content. Thank you for being a subscriber. This is UL Member Content Subscribe Already a member? Login
Stories by Andrej...
A Peek at Trends in Machine Learning
over a year ago
AI Snake Oil
Are open foundation models actually more risky than closed ones?
A policy brief on open foundation models
a year ago
A policy brief on open foundation models
Society's Backend
Machine Learning Infrastructure: The Bridge Between Software Engineering and AI
What makes machine learning infra so important and why I find it so interesting
a year ago
What makes machine learning infra so important and why I find it so interesting
IEEE Spectrum
How Amazon Is Changing the Future of Robotics and Logistics
This is a sponsored article brought to you by Amazon.
“Innovation doesn’t just happen because you...
2 weeks ago
This is a sponsored article brought to you by Amazon.
“Innovation doesn’t just happen because you have a good idea,” said Valerie Samzun, a leader in Amazon’s Fulfillment Technologies and Robotics (FTR) division. “It happens because you have the right team, the right...
The Gradient
The Artificiality of Alignment
This essay first appeared in Reboot.
Credulous, breathless coverage of “AI existential risk”...
a year ago
This essay first appeared in Reboot.
Credulous, breathless coverage of “AI existential risk” (abbreviated “x-risk”) has reached the mainstream. Who could have foreseen that the smallcaps onomatopoeia “ꜰᴏᴏᴍ” — both evocative of and directly derived from children’s cartoons —
Society's Backend
What you need to understand about LLM creativity
An simple overview of temperature and its effect on LLM output
8 months ago
An simple overview of temperature and its effect on LLM output
The Berkeley...
The Berkeley Crossword Solver
We recently published the Berkeley Crossword Solver (BCS), the current state of the art for solving...
over a year ago
We recently published the Berkeley Crossword Solver (BCS), the current state of the art for solving American-style crossword puzzles. The BCS combines neural question answering and probabilistic inference to achieve near-perfect performance on most American-style crossword...
The Gradient
What's Missing From LLM Chatbots: A Sense of Purpose
LLM-based chatbots’ capabilities have been advancing every month. These improvements are mostly...
3 months ago
LLM-based chatbots’ capabilities have been advancing every month. These improvements are mostly measured by benchmarks like MMLU, HumanEval, and MATH (e.g. sonnet 3.5, gpt-4o). However, as these measures get more and more saturated, is user experience increasing in proportion to...
One Useful Thing
Superhuman?
What does it mean for AI to be better than a human? And how can we tell?
7 months ago
What does it mean for AI to be better than a human? And how can we tell?
Artificial Ignorance
AI Roundup 045: Google's just getting started
December 15, 2023.
a year ago
Weighty Thoughts
Today’s AI critics don’t understand the history of technology
But is AI different than other technologies?
10 months ago
But is AI different than other technologies?
AI Snake Oil
Evaluating LLMs is a minefield
Annotated slides from a recent talk
a year ago
Annotated slides from a recent talk
Matt Mazur
When LTD Purchasers Meet an Inactive User Policy
Last year I participated in a Lifetime Deal (LTD) promotion to offer Preceden to the AppSumo...
a year ago
Last year I participated in a Lifetime Deal (LTD) promotion to offer Preceden to the AppSumo community. Maybe I’ll dive into my experience there in another post, but I wanted to share an interesting thing that’s happening now, a year after the deal ended. AppSumo has a policy...
Daniel Miessler
NO. 366 | T-Breach, Siri++, Conception Ages…
🎙️If you're not subscribed to the podcast version of the newsletter, please add it using with your...
a year ago
🎙️If you're not subscribed to the podcast version of the newsletter, please add it using with your favorite client! APPLE | SPOTIFY | OTHER SECURITY NEWS Another T-Mobile Breach T-Mobile has had another security breach, this one affecting at least 37 million accounts. They...
IEEE Spectrum
In 2025, People Will Try Living in This Underwater Habitat
The future of human habitation in the sea is taking shape in an abandoned quarry on the border of...
5 days ago
The future of human habitation in the sea is taking shape in an abandoned quarry on the border of Wales and England. There, the ocean-exploration organization Deep has embarked on a multiyear quest to enable scientists to live on the seafloor at depths up to 200 meters for weeks,...
Matt Mazur
Introducing Preceden’s new AI-Powered Timeline Generator
For the past few months I’ve been heads down building an AI-powered timeline generator tool for...
a year ago
For the past few months I’ve been heads down building an AI-powered timeline generator tool for Preceden, my SaaS timeline maker software: The tool – which is free to use and available on Preceden’s homepage – lets you type in a topic or detailed description of a timeline and it...
The Berkeley...
How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark
When we began studying jailbreak evaluations, we found a fascinating paper claiming that you could...
4 months ago
When we began studying jailbreak evaluations, we found a fascinating paper claiming that you could jailbreak frontier LLMs simply by translating forbidden prompts into obscure languages. Excited by this result, we attempted to reproduce it and found something unexpected.
The...
IEEE Spectrum
SwitchBot S10 Review: “This Is the Future of Home Robots”
I’ve been reviewing robot vacuums for more than a decade, and robot mops for just as long. It’s been...
3 months ago
I’ve been reviewing robot vacuums for more than a decade, and robot mops for just as long. It’s been astonishing how the technology has evolved, from the original iRobot Roomba bouncing off of walls and furniture to robots that use lidar and vision to map your entire house and...
IEEE Spectrum
Video Friday: Multiple MagicBots
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
4 weeks ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion.
Humanoids Summit: 11–12 December...
AI Snake Oil
Is Avoiding Extinction from AI Really an Urgent Priority?
The history of technology suggests that the greatest risks come not from the tech, but from the...
a year ago
The history of technology suggests that the greatest risks come not from the tech, but from the people who control it
Rozado’s Visual...
Sentiment Associations of Politically Loaded Terms in News Media
Summary of manuscript “Using Word Embeddings to Probe Sentiment Associations of Politically Loaded...
over a year ago
Summary of manuscript “Using Word Embeddings to Probe Sentiment Associations of Politically Loaded Terms in News and Opinion Articles from News Media Outlets”
Society's Backend
3 Key Principles for AI at Scale [Part 2]
The key to how large AI companies outcompete
2 days ago
The key to how large AI companies outcompete
Sam Altman
The United Slate
I would like to find and support a slate of candidates for the 2018 California elections, and also...
over a year ago
I would like to find and support a slate of candidates for the 2018 California elections, and also to find someone to run a ballot initiative focused on affordable housing in the state. A team of aligned people has a chance to make a real change.
I believe in creating prosperity...
Weighty Thoughts
When the AI Bubble Bursts
It’s when, not if, for these kinds of new technologies
6 months ago
It’s when, not if, for these kinds of new technologies
fast.ai
Can LLMs learn from a single example?
We’ve noticed an unusual training pattern in fine-tuning LLMs. At first we thought it’s a bug, but...
a year ago
We’ve noticed an unusual training pattern in fine-tuning LLMs. At first we thought it’s a bug, but now we think it shows LLMs can learn effectively from a single example.
Matt Mazur
AOL Underground Podcast Interview about AOL-Files.com
Back in 1998 when I was 13 years old I got heavily involved in the AOL hacking scene, originally...
over a year ago
Back in 1998 when I was 13 years old I got heavily involved in the AOL hacking scene, originally building add-on software called progs (Revolution, Meridian), publishing code libraries called bas files (Alpha32), and later co-founding AOL-Files.com (where I went by the hacker...
AI Snake Oil
OpenAI’s policies hinder reproducible research on language models
LLMs have become privately-controlled research infrastructure
a year ago
LLMs have become privately-controlled research infrastructure
Society's Backend
Meta's New Segmentation Model, A New Open-Source Image Generation Model, Apple Intelligence Model...
Machine learning resources and updates 8/5/2024
5 months ago
Machine learning resources and updates 8/5/2024
IEEE Spectrum
It's Surprisingly Easy to Jailbreak LLM-Driven Robots
large language models (LLMs) have exploded in popularity, leading a number of companies to explore...
a month ago
large language models (LLMs) have exploded in popularity, leading a number of companies to explore LLM-driven robots. However, a new study now reveals an automated way to hack into such machines with 100 percent success. By circumventing safety guardrails, researchers could...
One Useful Thing
An Opinionated Guide to Which AI to Use: ChatGPT Anniversary Edition
A simple answer, and then a less simple one.
a year ago
A simple answer, and then a less simple one.
Society's Backend
5 Highlights From Society's Backend in 2024
The top 5 articles and resources
4 days ago
The top 5 articles and resources
Marcus on AI
Could 2025 see the largest cyberattack in history?
In a just-published series of very brief essays called “The Incredible, World-Altering ‘Black Swan’...
2 days ago
In a just-published series of very brief essays called “The Incredible, World-Altering ‘Black Swan’ Events That Could Upend Life in 2025”, Politico asked “15 futurists, foreign policy analysts and other prognosticators”, including me, “to provide some explosive potential...
Artificial Ignorance
OpenAI's o1 is a misunderstood model
Are the latest "reasoning" breakthroughs all they're hyped up to be?
3 months ago
Are the latest "reasoning" breakthroughs all they're hyped up to be?
One Useful Thing
Four Singularities for Research
The rise of AI is creating both crisis and opportunity
7 months ago
The rise of AI is creating both crisis and opportunity
IEEE Spectrum
This Mobile 3D Printer Can Print Directly on Your Floor
Waiting for each part of a 3D-printed project to finish, taking it out of the printer, and then...
a month ago
Waiting for each part of a 3D-printed project to finish, taking it out of the printer, and then installing it on location can be tedious for multi-part projects. What if there was a way for your printer to print its creation exactly where you needed it? That’s the promise of...
Rozado’s Visual...
The Increasing Prominence of Prejudice and Social Justice Rhetoric in UK News Media
I have recently published a report with Matthew Goodwin about the increasing prominence of prejudice...
over a year ago
I have recently published a report with Matthew Goodwin about the increasing prominence of prejudice and social justice rhetoric in UK news media. Recent years have seen considerable debate about the rise of political polarization in British society. Specifically, over the last...
Artificial Ignorance
AI Roundup 068: The ScarJo thing
May 24, 2024.
7 months ago
Artificial Ignorance
AI Roundup 097: Model Mayhem
December 13, 2024.
3 weeks ago
Weighty Thoughts
Deep Tech Startups Are Not Software Startups
Or why AI and “Software/SaaS” are apples and oranges
11 months ago
Or why AI and “Software/SaaS” are apples and oranges
AI Snake Oil
A misleading open letter about sci-fi AI dangers ignores the real risks
Misinformation, labor impact, and safety are all risks. But not in the way the letter implies.
a year ago
Misinformation, labor impact, and safety are all risks. But not in the way the letter implies.
Daniel Miessler
Podcast Audio Quality: AI-based Post-processing vs. Hardware
I’ve been podcasting since 2015 and got really into audio when the plague started. Like…too much....
over a year ago
I’ve been podcasting since 2015 and got really into audio when the plague started. Like…too much. Anyway. I’ve been obsessed with podcast audio quality for years, and have been through so…many…iterations of my setup. I started with a Yeti (still a great mic). Did the...
The Berkeley...
Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination
Sample language model responses to different varieties of English and native speaker...
3 months ago
Sample language model responses to different varieties of English and native speaker reactions.
ChatGPT does amazingly well at communicating with people in English. But whose English?
Only 15% of ChatGPT users are from the US, where Standard American English is the default. But...
Artificial Ignorance
How to talk to your family about AI this Thanksgiving
A handy guide for your uncle's burning questions.
a year ago
A handy guide for your uncle's burning questions.
Artificial Ignorance
Lies, damned lies, and benchmarks
While benchmarks (and leaderboards) are useful tools, they are but a small facet when it comes to...
11 months ago
While benchmarks (and leaderboards) are useful tools, they are but a small facet when it comes to evaluating large language models. Often, they're not the best indicators of real-world utility - and I want to dig into why (and what other approaches exist).
fast.ai
The Jupyter+git problem is now solved
Previously, using git with Jupyter could create conflicts and break notebooks. With nbdev2, the...
over a year ago
Previously, using git with Jupyter could create conflicts and break notebooks. With nbdev2, the problem has been totally solved.
Artificial Ignorance
Distributing the future
A reminder that things take time.
7 months ago
A reminder that things take time.
Weighty Thoughts
One Definite Sign of a Bad Startup Idea
If you can’t tell anyone about your startup idea, it's not a good idea
over a year ago
If you can’t tell anyone about your startup idea, it's not a good idea
AI Snake Oil
A safe harbor for AI evaluation and red teaming
An argument for legal and technical safe harbors for AI safety and trustworthiness research
10 months ago
An argument for legal and technical safe harbors for AI safety and trustworthiness research
Matt Mazur
LearnGPT is for sale. Contact me if you’re interested.
On Friday I announced that I intended to shut down LearnGPT to focus on Preceden, my main business....
a year ago
On Friday I announced that I intended to shut down LearnGPT to focus on Preceden, my main business. I didn’t plan to sell LearnGPT because I didn’t think a month-old, pre-revenue project like this would be able to sell for enough to warrant going through a sale. It’s been three...
AI Snake Oil
Eighteen pitfalls to beware of in AI journalism
A checklist for avoiding hype
over a year ago
A checklist for avoiding hype
Marcus on AI
Sora still appears to have trouble with physics
Exactly as I warned in February
3 weeks ago
Exactly as I warned in February
The Berkeley...
Koala: A Dialogue Model for Academic Research
In this post, we introduce Koala, a chatbot trained by fine-tuning Meta’s LLaMA on dialogue data...
a year ago
In this post, we introduce Koala, a chatbot trained by fine-tuning Meta’s LLaMA on dialogue data gathered from the web. We describe the dataset curation and training process of our model, and also present the results of a user study that compares our model to ChatGPT and...
Marcus on AI
ChatGPT, at Age Two
The bullshit just keeps on coming.
a month ago
The bullshit just keeps on coming.
Society's Backend
Allen AI and DeepSeek are Taking Off, Professional Advice for Working in AI, Agentic Web Design, and...
Society's Backend Reading List 12-2-2024
a month ago
Society's Backend Reading List 12-2-2024
Society's Backend
Welcome to 2024: The Year Where AI is No Longer an Option
Why everyone should learn about machine learning
a year ago
Why everyone should learn about machine learning
IEEE Spectrum
Video Friday: Extreme Off-Road
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
a month ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion.
Humanoids 2024: 22–24 November...
Artificial Ignorance
Looking a gift llama in the mouth
How Llama 3.1 uniquely leverages Meta's business model (and why we should be a little bit cynical...
5 months ago
How Llama 3.1 uniquely leverages Meta's business model (and why we should be a little bit cynical about it)
Rozado’s Visual...
The political preferences of Grok
Elon Musk's response to ChatGPT
a year ago
Elon Musk's response to ChatGPT
The Berkeley...
Rethinking the Role of PPO in RLHF
Rethinking the Role of PPO in RLHF
TL;DR: In RLHF, there’s tension between the reward learning...
a year ago
Rethinking the Role of PPO in RLHF
TL;DR: In RLHF, there’s tension between the reward learning phase, which uses human preference in the form of comparisons, and the RL fine-tuning phase, which optimizes a single, non-comparative reward. What if we performed RL in a comparative...
fast.ai
A New Chapter for fast.ai: How To Solve It With Code
fast.ai is joining Answer.AI, and we’re announcing a new kind of educational experience, ‘How To...
a month ago
fast.ai is joining Answer.AI, and we’re announcing a new kind of educational experience, ‘How To Solve It With Code’
Society's Backend
If You Understand Bananas, You Can Understand Machine Learning
A simplified high-level overview of primary machine learning algorithms for anyone to understand
11 months ago
A simplified high-level overview of primary machine learning algorithms for anyone to understand
One Useful Thing
What just happened
A transformative month rewrites the capabilities of AI
2 weeks ago
A transformative month rewrites the capabilities of AI
Artificial Ignorance
How Shopify is making AI Magic
A case study on turning competitive advantages into useful AI.
a year ago
A case study on turning competitive advantages into useful AI.
IEEE Spectrum
Remote Sub Sustains Science Kilometers Underwater
The water column is hazy as an unusual remotely operated vehicle glides over the seafloor in search...
2 months ago
The water column is hazy as an unusual remotely operated vehicle glides over the seafloor in search of a delicate tilt meter deployed three years ago off the west side of Vancouver Island. The sensor measures shaking and shifting in continental plates that will eventually unleash...
Matt Mazur
“Monthly Billed Annually” is Cursed Copy
There was a great discussion on Twitter recently that began with Daniel Vassallo calling out a SaaS...
a year ago
There was a great discussion on Twitter recently that began with Daniel Vassallo calling out a SaaS for not refunding an accidental annual payment he made on their service. He intended to purchase the monthly plan, but due to an unclear UI and poor copy, he unintentionally...
One Useful Thing
AI is not good software. It is pretty good people.
A pragmatic approach to thinking about AI
a year ago
A pragmatic approach to thinking about AI
Society's Backend
Backend Biweekly #1: Important AI Developments and ML Learning Resources
Updates on Apple, Mistral, and Microsoft and resources to build the GPT tokenizer, use MLX to train...
10 months ago
Updates on Apple, Mistral, and Microsoft and resources to build the GPT tokenizer, use MLX to train models, and benchmark LLMs
Rozado’s Visual...
Is The Great Awokening Really Winding Down? Part II: Evidence from News Media
Others (here, here and here) have argued previously that The Great Awokening might be winding down....
a year ago
Others (here, here and here) have argued previously that The Great Awokening might be winding down. I have shown before some preliminary evidence from Twitter content about how Great Awokening terminology with negative connotations are indeed down but those with positive...
Society's Backend
Always Be Networking
The things you should know about effective networking
a year ago
The things you should know about effective networking
Society's Backend
Updates to Society's Backend
New benefits for paid subscribers, support Society's Backend for just $1/mo, a referral program, and...
10 months ago
New benefits for paid subscribers, support Society's Backend for just $1/mo, a referral program, and more
Artificial Ignorance
AI's massive cash needs are Big Tech's chance to own the future
Over the past year, AI startups have raised some impressive amounts of money. OpenAI raised $10...
11 months ago
Over the past year, AI startups have raised some impressive amounts of money. OpenAI raised $10 billion, Anthropic did $6 billion, Inflection AI raised $1.3 billion, and dozens of companies closed rounds in the hundreds of millions.
Frank’s Ramblings
A Gentle Introduction to Vector Databases
Update: An earlier version of this post was cross-published to the Zilliz learning center, Medium,...
over a year ago
Update: An earlier version of this post was cross-published to the Zilliz learning center, Medium, and DZone.
If you have any feedback, feel free to connect with me on Twitter or Linkedin. If you enjoyed this post and want to learn a bit more about vector databases and embeddings...
Artificial Ignorance
The SearchGPT Paradigm
What people (still) fundamentally misunderstand about AI search.
5 months ago
What people (still) fundamentally misunderstand about AI search.
Artificial Ignorance
AI Roundup 052: AI, EO, DPA
February 2, 2024.
11 months ago
Society's Backend
Ask Stupid Questions
I’m convinced that after having a fourth or fifth child exhaustion kicks in and a person’s long-term...
a year ago
I’m convinced that after having a fourth or fifth child exhaustion kicks in and a person’s long-term memory ceases to function properly. I’ve become victim to this and I’ve started taking very detailed notes during meetings. If my brain won’t store the information, something else...
Marcus on AI
Satya Nadella and the three stages of scientific truth
A textbook case in how good ideas are often initially throttled
a month ago
A textbook case in how good ideas are often initially throttled
AI Snake Oil
How Transparent Are Foundation Model Developers?
Introducing the Foundation Model Transparency Index
a year ago
Introducing the Foundation Model Transparency Index
fast.ai
My family’s unlikely homeschooling journey
Prior to 2020, we never expected to homeschool, and now we have committed to it long-term.
over a year ago
Prior to 2020, we never expected to homeschool, and now we have committed to it long-term.
Matt Mazur
Indie Hacking Week 1 Recap: Starting TimelineGPT, Ending LearnGPT
Today marks the end of my first week of full-time indie hacking. I feel like I’m getting in a good...
a year ago
Today marks the end of my first week of full-time indie hacking. I feel like I’m getting in a good groove as far as my daily routine, but I don’t think it’s quite sunk in yet how much flexibility I have in terms of my daily schedule. For example, I’m still waking up early to […]
Daniel Miessler
Scott Kuffer of Nucleus Security | SPONSORED INTERVIEW SERIES
In this standalone episode we’re doing a sponsored interview with Scott Kuffer, co-founder and COO...
over a year ago
In this standalone episode we’re doing a sponsored interview with Scott Kuffer, co-founder and COO of Nucleus Security. I was already excited by this vendor just based on the research I did to allow them to be a sponsor, but the conversation with them really made me think they’re...
AI Snake Oil
Model alignment protects against accidental harms, not intentional ones
The hand wringing about failures of model alignment is misguided
a year ago
The hand wringing about failures of model alignment is misguided
Daniel Miessler
NO. 357 | NEWS, ANALYSIS, & DISCOVERY SERIES
SECURITY NEWS Attackers have dumped nearly 8 million Australian health records on the dark web after...
over a year ago
SECURITY NEWS Attackers have dumped nearly 8 million Australian health records on the dark web after breaching a health insurance company with almost 10 million customers. MORE NSA has released guidance asking companies to switch to memory-safe languages like Rust, C#, Go, and...
Society's Backend
A Fundamental Overview of Machine Learning Experimentation [Part 1]
And how it differs from the software development you're familiar with
a month ago
And how it differs from the software development you're familiar with
Society's Backend
Know Your Benchmarks
How the Chatbot Arena leaderboard for LLMs works and why it’s important to understand
8 months ago
How the Chatbot Arena leaderboard for LLMs works and why it’s important to understand
AI Snake Oil
Artists can now opt out of generative AI. It’s not enough.
Opting out is the latest example of generative AI developers externalizing costs.
a year ago
Opting out is the latest example of generative AI developers externalizing costs.
Society's Backend
AI Reading List 1: Understand Transformers, Reflection-70B Update, and LLMs Still Cannot Reason
Society's Backend Reading List 10-07-2024
2 months ago
Society's Backend Reading List 10-07-2024
Made by Ollin
Priors for Autonomous Vehicle Development
over a year ago
Strange Loop Canon
Mind the Gap
from empire to umpire
a year ago
fast.ai
Practical Deep Learning for Coders 2022
A complete from-scratch rewrite of fast.ai’s most popular course, that’s been 2 years in the making.
over a year ago
A complete from-scratch rewrite of fast.ai’s most popular course, that’s been 2 years in the making.
Society's Backend
Backend Biweekly #3: 97 Updates and Resources
Huge Nvidia Updates, Gemini Hackathon for Money, and more
9 months ago
Huge Nvidia Updates, Gemini Hackathon for Money, and more
Andrej Karpathy blog
A from-scratch tour of Bitcoin in Python
.wrap {
max-width: 900px;
}
p {
font-family: sans-serif;
font-size: 15px;
...
over a year ago
.wrap {
max-width: 900px;
}
p {
font-family: sans-serif;
font-size: 15px;
font-weight: 300;
overflow-wrap: break-word; /* allow wrapping of very very long strings, like txids */
}
.post pre,
.post code {
background-color: #fafafa;
font-size: 13px; /*...
Society's Backend
Stop Obsessing Over the Product and Start Thinking About the Bigger Picture
The actual takeaways from the iPhone 15 event
a year ago
The actual takeaways from the iPhone 15 event
The Gradient
An Introduction to the Problems of AI Consciousness
Once considered a forbidden topic in the AI community, discussions around the concept of AI...
a year ago
Once considered a forbidden topic in the AI community, discussions around the concept of AI consciousness are now taking center stage, marking a significant shift since the current AI resurgence began over a decade ago.
Rozado’s Visual...
An Analysis of AI Political Preferences from a European Perspective
Introduction
2 months ago
Weighty Thoughts
Who’s Winning the AI War?
All of us, except the AI startups and VCs—unless a real war breaks out
11 months ago
All of us, except the AI startups and VCs—unless a real war breaks out
One Useful Thing
What just happened, what is happening next
The tasks AI can do well are expanding rapidly
9 months ago
The tasks AI can do well are expanding rapidly
IEEE Spectrum
Boston Dynamics and Toyota Research Team Up on Robots
Today, Boston Dynamics and the Toyota Research Institute (TRI) announced a new partnership “to...
2 months ago
Today, Boston Dynamics and the Toyota Research Institute (TRI) announced a new partnership “to accelerate the development of general-purpose humanoid robots utilizing TRI’s Large Behavior Models and Boston Dynamics’ Atlas robot.” Committing to working towards a general purpose...
Daniel Miessler
Unsupervised Learning NO. 365 | China’s Decline, MicrosoftAI, Creativity Ratio…
🎙️If you're not subscribed to the podcast version of the newsletter, please add it using with your...
a year ago
🎙️If you're not subscribed to the podcast version of the newsletter, please add it using with your favorite client! APPLE | SPOTIFY | OTHER SECURITY NEWS NYC Surveillance Amnesty International has revealed new research showing that the NYPD has over 15,000 cameras that can do...
Society's Backend
All Machine Learning Resources and Updates 7/8/24
Everything I've been reading
6 months ago
Everything I've been reading
The Berkeley...
Training Diffusion Models with <br> Reinforcement Learning
function reveal() {
const replay = document.querySelector('.ddpo-replay');
...
a year ago
function reveal() {
const replay = document.querySelector('.ddpo-replay');
replay.style.display = 'flex';
}
window.onload = () => {
const replay = document.querySelector('.ddpo-replay');
replay.addEventListener('click', () => {
...
The Berkeley...
Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark!
Humans excel at processing vast arrays of visual information, a skill that is crucial for achieving...
5 months ago
Humans excel at processing vast arrays of visual information, a skill that is crucial for achieving artificial general intelligence (AGI). Over the decades, AI researchers have developed Visual Question Answering (VQA) systems to interpret scenes within single images and answer...
One Useful Thing
Everyone is above average
Is AI a Leveler, King Maker, or Escalator?
a year ago
Is AI a Leveler, King Maker, or Escalator?
Matt Mazur
Preceden’s Spam Problem
Around a year ago, I started noticing some spammy timelines being created on Preceden, my SaaS...
a year ago
Around a year ago, I started noticing some spammy timelines being created on Preceden, my SaaS timeline maker tool. I’m honestly surprised it took spammers so long: Preceden is a freemium product (meaning people can sign up and try it for free), the product makes it very easy to...
Weighty Thoughts
The Real Risks of AI
Humans are really the ones to be scared of
8 months ago
Humans are really the ones to be scared of
Weighty Thoughts
Most AI startups are doomed
Just because it matters doesn’t mean it’s defensible or profitable
a year ago
Just because it matters doesn’t mean it’s defensible or profitable
Artificial Ignorance
Why now?
What's behind our current AI boom?
a year ago
What's behind our current AI boom?
Society's Backend
OpenAI's Strawberry May Enhance AI Reasoning, Optimization of Vision Language Models, Info on...
Weekly updates and resources 7/15/24
5 months ago
Weekly updates and resources 7/15/24
Rozado’s Visual...
The Political Biases of Google Bard
It is probably only a matter of time until a nation state purposely builds a biased AI system...
a year ago
It is probably only a matter of time until a nation state purposely builds a biased AI system designed to advance government interests
One Useful Thing
On the necessity of a sin
Why treating AI like a person is the future
9 months ago
Why treating AI like a person is the future
Society's Backend
Weekly Backend #7: 39 Resources and Updates
GPT-4o, Google I/O, Fugaku LLM, Prep for Machine Learning Interviews, and more
7 months ago
GPT-4o, Google I/O, Fugaku LLM, Prep for Machine Learning Interviews, and more
IEEE Spectrum
How a Robot Is Grabbing Fuel From a Fukushima Reactor
Thirteen years since a massive earthquake and tsunami struck the Fukushima Dai-ichi nuclear power...
2 months ago
Thirteen years since a massive earthquake and tsunami struck the Fukushima Dai-ichi nuclear power plant in northern Japan, causing a loss of power, meltdowns and a major release of radioactive material, operator Tokyo Electric Power Co. (TEPCO) finally seems to be close to...
Society's Backend
Open Models Catching Up, SearchGPT Release, Deepfake Victims Protected by Regulation, and More
Weekly updates and resources 7/29/24
5 months ago
Weekly updates and resources 7/29/24
Daniel Miessler
AI Art Will Push the Top 1% to Human Artists
One effect I think we’ll see from all this AI-generated art is magnified status for those who insist...
over a year ago
One effect I think we’ll see from all this AI-generated art is magnified status for those who insist on the opposite, i.e., manual, human art. The more manual the better. The more human the better. Ideally there’d only be one of whatever you have, and it’d only be yours. Why is...
Rozado’s Visual...
Northern Awokening: Social-justice and prejudice-signifying language in Canadian news media
I have recently published a report with Aaron Wudrick from the Macdonald-Laurier Institute about...
a year ago
I have recently published a report with Aaron Wudrick from the Macdonald-Laurier Institute about changes in the language that the news media in Canada use. I have documented previously how in American news media mentions of terms that signify distinct forms of prejudice have...
Rozado’s Visual...
Frecuencia de términos en el periódico generalista más leído de España
Frecuencia de términos en el periódico generalista más leído de España: El País Twitter post...
over a year ago
Frecuencia de términos en el periódico generalista más leído de España: El País Twitter post Verification of integrity of frequency counts: https://zenodo.org/record/5674590
Strange Loop Canon
Evaluations are all we need
On analysing talent in LLMs
11 months ago
On analysing talent in LLMs
Daniel Miessler
AI is About to Feel Like AGI, and You Need to Get Ready
I just wrote a piece similar to this last week, but this one drives the point home even more....
over a year ago
I just wrote a piece similar to this last week, but this one drives the point home even more. Basically, the current trajectory of AI, with all the art generation, the language models, etc., are about to become a whole lot more instruction and response based. What does that mean?...
IEEE Spectrum
Robot Photographer Takes the Perfect Picture
Finding it hard to get the perfect angle for your shot? PhotoBot can take the picture for you. Tell...
a month ago
Finding it hard to get the perfect angle for your shot? PhotoBot can take the picture for you. Tell it what you want the photo to look like, and your robot photographer will present you with references to mimic. Pick your favorite, and PhotoBot—a robot arm with a camera—will...
Marcus on AI
25 AI Predictions for 2025, from Marcus on AI
With a review of last year’s predictions
4 days ago
With a review of last year’s predictions
One Useful Thing
How to Use AI to Do Stuff: An Opinionated Guide
Covering the state of play as of Summer, 2023
a year ago
Covering the state of play as of Summer, 2023
IEEE Spectrum
Video Friday: ICRA Turns 40
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE...
3 months ago
Video Friday is your weekly selection of awesome robotics videos, collected by your friends at IEEE Spectrum robotics. We also post a weekly calendar of upcoming robotics events for the next few months. Please send us your events for inclusion.
IROS 2024: 14–18 October 2024, ABU...
Weighty Thoughts
Commentary on Technology, Startups, and Investing
Welcome to Weighty Thoughts by me, James Wang. General Partner at Creative Ventures. Ex-Bridgewater...
over a year ago
Welcome to Weighty Thoughts by me, James Wang. General Partner at Creative Ventures. Ex-Bridgewater and Google X. Co-Founder Lioness. Sign up now so you don’t miss the first issue. In the meantime, tell your friends!
Artificial Ignorance
AI Roundup 092: Watermarking the AI wave
November 8, 2024.
a month ago
Sam Altman
The Merge
A popular topic in Silicon Valley is talking about what year humans and machines will merge (or, if...
over a year ago
A popular topic in Silicon Valley is talking about what year humans and machines will merge (or, if not, what year humans will get surpassed by rapidly improving AI or a genetically enhanced species). Most guesses seem to be between 2025 and 2075.
People used to call this the...
Weighty Thoughts
CUDA is Still a Giant Moat for NVIDIA
Despite everyone’s focus on hardware, the software of AI is what protects NVIDIA
9 months ago
Despite everyone’s focus on hardware, the software of AI is what protects NVIDIA
Weighty Thoughts
Let's talk about AI power costs
It's important, but a lot of recent attention has been concern-trolling
5 months ago
It's important, but a lot of recent attention has been concern-trolling
Strange Loop Canon
Disruption starts at the margins, but doesn't stop there
Contra Hoel on AI and its supply paradox
a year ago
Contra Hoel on AI and its supply paradox
Society's Backend
JAX is for More Than Just Machine Learning
What JAX is and its potential applications
7 months ago
What JAX is and its potential applications
The Berkeley...
Fully Autonomous Real-World Reinforcement Learning with Applications to Mobile Manipulation
Reinforcement learning provides a conceptual framework for autonomous agents to learn from...
a year ago
Reinforcement learning provides a conceptual framework for autonomous agents to learn from experience, analogously to how one might train a pet with treats. But practical applications of reinforcement learning are often far from natural: instead of using RL to learn through trial...