The Berkeley Artificial Intelligence Research Blog

The Berkeley...

Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization... Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications....

3 months ago

39

Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

3 months ago

Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications. However, as LLMs have improved, so have the attacks against them. Prompt injection attack is listed as the #1 threat by OWASP to LLM-integrated applications, where an LLM input contains a...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Repurposing Protein Folding Models for Generation with Latent Diffusion PLAID is a multimodal generative model that simultaneously generates protein 1D sequence and 3D...

3 months ago

47

Repurposing Protein Folding Models for Generation with Latent Diffusion

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

3 months ago

PLAID is a multimodal generative model that simultaneously generates protein 1D sequence and 3D structure, by learning the latent space of protein folding models. The awarding of the 2024 Nobel Prize to AlphaFold2 marks an important moment of recognition for the of AI role in...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment Training Diffusion Models with Reinforcement Learning We deployed 100 reinforcement...

4 months ago

45

Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

4 months ago

Training Diffusion Models with Reinforcement Learning We deployed 100 reinforcement learning (RL)-controlled cars into rush-hour highway traffic to smooth congestion and reduce fuel consumption for everyone. Our goal is to tackle "stop-and-go" waves, those...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Virtual Personas for Language Models via an Anthology of Backstories Anthology, a method for conditioning LLMs to representative, consistent, and diverse virtual...

8 months ago

85

Virtual Personas for Language Models via an Anthology of Backstories

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

8 months ago

Anthology, a method for conditioning LLMs to representative, consistent, and diverse virtual personas by generating and utilizing naturalistic backstories with rich details of individual values and experience. --> We introduce Anthology, a method for conditioning LLMs to...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination Sample language model responses to different varieties of English and native speaker...

10 months ago

129

Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

10 months ago

Sample language model responses to different varieties of English and native speaker reactions. ChatGPT does amazingly well at communicating with people in English. But whose English? Only 15% of ChatGPT users are from the US, where Standard American English is the default. But...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark When we began studying jailbreak evaluations, we found a fascinating paper claiming that you could...

11 months ago

127

How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

11 months ago

When we began studying jailbreak evaluations, we found a fascinating paper claiming that you could jailbreak frontier LLMs simply by translating forbidden prompts into obscure languages. Excited by this result, we attempted to reproduce it and found something unexpected. The...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark! Humans excel at processing vast arrays of visual information, a skill that is crucial for achieving...

a year ago

126

Are We Ready for Multi-Image Reasoning? Launching VHs: The Visual Haystacks Benchmark!

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

a year ago

Humans excel at processing vast arrays of visual information, a skill that is crucial for achieving artificial general intelligence (AGI). Over the decades, AI researchers have developed Visual Question Answering (VQA) systems to interpret scenes within single images and answer...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

TinyAgent: Function Calling at the Edge The ability of LLMs to execute commands through plain language (e.g. English) has enabled agentic...

a year ago

131

TinyAgent: Function Calling at the Edge

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

a year ago

The ability of LLMs to execute commands through plain language (e.g. English) has enabled agentic systems that can complete a user query by orchestrating the right set of tools (e.g. ToolFormer, Gorilla). This, along with the recent multi-modal efforts such as the GPT-4o or...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Modeling Extremely Large Images with $x$T As computer vision researchers, we believe that every pixel can tell a story. However, there seems...

a year ago

149

Modeling Extremely Large Images with $x$T

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

a year ago

As computer vision researchers, we believe that every pixel can tell a story. However, there seems to be a writer’s block settling into the field when it comes to dealing with large images. Large images are no longer rare—the cameras we carry in our pockets and those orbiting our...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

2024 BAIR Graduate Directory Every year, the Berkeley Artificial Intelligence Research (BAIR) Lab graduates some of the most...

a year ago

121

2024 BAIR Graduate Directory

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

a year ago

Every year, the Berkeley Artificial Intelligence Research (BAIR) Lab graduates some of the most talented and innovative minds in artificial intelligence and machine learning. Our Ph.D. graduates have each expanded the frontiers of AI research and are now ready to embark on new...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

The Shift from Models to Compound AI Systems AI caught everyone’s attention in 2023 with Large Language Models (LLMs) that can be instructed to...

a year ago

128

The Shift from Models to Compound AI Systems

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

a year ago

AI caught everyone’s attention in 2023 with Large Language Models (LLMs) that can be instructed to perform general tasks, such as translation or coding, just by prompting. This naturally led to an intense focus on models as the primary ingredient in AI application development,...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Ghostbuster: Detecting Text Ghostwritten by Large Language Models The structure of Ghostbuster, our new state-of-the-art method for detecting AI-generated...

a year ago

122

Ghostbuster: Detecting Text Ghostwritten by Large Language Models

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

a year ago

The structure of Ghostbuster, our new state-of-the-art method for detecting AI-generated text. Large language models like ChatGPT write impressively well—so well, in fact, that they’ve become a problem. Students have begun using these models to ghostwrite assignments, leading...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Asymmetric Certified Robustness via Feature-Convex Neural Networks Asymmetric Certified Robustness via Feature-Convex Neural Networks TLDR: We propose the asymmetric...

a year ago

99

Asymmetric Certified Robustness via Feature-Convex Neural Networks

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

a year ago

Asymmetric Certified Robustness via Feature-Convex Neural Networks TLDR: We propose the asymmetric certified robustness problem, which requires certified robustness for only one class and reflects real-world adversarial scenarios. This focused setting allows us to introduce...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Goal Representations for Instruction Following Goal Representations for Instruction Following Figure title. Figure caption. This image is...

a year ago

126

Goal Representations for Instruction Following

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

a year ago

Goal Representations for Instruction Following Figure title. Figure caption. This image is centered and set to 50% page width. --> A longstanding goal of the field of robot learning has been to create generalist agents that can perform tasks for humans. Natural language has...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Rethinking the Role of PPO in RLHF Rethinking the Role of PPO in RLHF TL;DR: In RLHF, there’s tension between the reward learning...

a year ago

177

Rethinking the Role of PPO in RLHF

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

a year ago

Rethinking the Role of PPO in RLHF TL;DR: In RLHF, there’s tension between the reward learning phase, which uses human preference in the form of comparisons, and the RL fine-tuning phase, which optimizes a single, non-comparative reward. What if we performed RL in a comparative...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Training Diffusion Models with <br> Reinforcement Learning function reveal() { const replay = document.querySelector('.ddpo-replay'); ...

over a year ago

92

Training Diffusion Models with <br> Reinforcement Learning

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

function reveal() { const replay = document.querySelector('.ddpo-replay'); replay.style.display = 'flex'; } window.onload = () => { const replay = document.querySelector('.ddpo-replay'); replay.addEventListener('click', () => { ...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

On the Stepwise Nature of <br> Self-Supervised Learning Figure 1: stepwise behavior in self-supervised learning. When training common SSL algorithms, we...

over a year ago

107

On the Stepwise Nature of <br> Self-Supervised Learning

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

Figure 1: stepwise behavior in self-supervised learning. When training common SSL algorithms, we find that the loss descends in a stepwise fashion (top left) and the learned embeddings iteratively increase in dimensionality (bottom left). Direct visualization of embeddings...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Generating 3D Molecular Conformers via Equivariant Coarse-Graining and Aggregated Attention --> Figure 1: CoarsenConf architecture. (I) The encoder $q_\phi(z| X, \mathcal{R})$ takes the...

over a year ago

99

Generating 3D Molecular Conformers via Equivariant Coarse-Graining and Aggregated Attention

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

--> Figure 1: CoarsenConf architecture. (I) The encoder $q_\phi(z| X, \mathcal{R})$ takes the fine-grained (FG) ground truth conformer $X$, RDKit approximate conformer $\mathcal{R}$ , and coarse-grained (CG) conformer $\mathcal{C}$ as inputs (derived from $X$ and a predefined...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

GPT-4 + Stable-Diffusion = ?: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with... TL;DR: Text Prompt -> LLM -> Intermediate Representation (such as an image layout) -> Stable...

over a year ago

115

GPT-4 + Stable-Diffusion = ?: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

TL;DR: Text Prompt -> LLM -> Intermediate Representation (such as an image layout) -> Stable Diffusion -> Image. Recent advancements in text-to-image generation with diffusion models have yielded remarkable results synthesizing highly realistic and diverse images. However,...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Interactive Fleet Learning Figure 1: “Interactive Fleet Learning” (IFL) refers to robot fleets in industry and academia that...

over a year ago

89

Interactive Fleet Learning

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

Figure 1: “Interactive Fleet Learning” (IFL) refers to robot fleets in industry and academia that fall back on human teleoperators when necessary and continually learn from them over time. In the last few years we have seen an exciting development in robotics and artificial...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Koala: A Dialogue Model for Academic Research In this post, we introduce Koala, a chatbot trained by fine-tuning Meta’s LLaMA on dialogue data...

over a year ago

89

Koala: A Dialogue Model for Academic Research

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

In this post, we introduce Koala, a chatbot trained by fine-tuning Meta’s LLaMA on dialogue data gathered from the web. We describe the dataset curation and training process of our model, and also present the results of a user study that compares our model to ChatGPT and...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Fully Autonomous Real-World Reinforcement Learning with Applications to Mobile Manipulation Reinforcement learning provides a conceptual framework for autonomous agents to learn from...

over a year ago

72

Fully Autonomous Real-World Reinforcement Learning with Applications to Mobile Manipulation

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

Reinforcement learning provides a conceptual framework for autonomous agents to learn from experience, analogously to how one might train a pet with treats. But practical applications of reinforcement learning are often far from natural: instead of using RL to learn through trial...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Keeping Learning-Based Control Safe by Regulating Distributional Shift To regulate the distribution shift experience by learning-based controllers, we seek a mechanism for...

over a year ago

75

Keeping Learning-Based Control Safe by Regulating Distributional Shift

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

To regulate the distribution shift experience by learning-based controllers, we seek a mechanism for constraining the agent to regions of high data density throughout its trajectory (left). Here, we present an approach which achieves this goal by combining features of density...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Reverse engineering the NTK: towards first-principles architecture design Deep neural networks have enabled technological wonders ranging from voice recognition to machine...

over a year ago

61

Reverse engineering the NTK: towards first-principles architecture design

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

Deep neural networks have enabled technological wonders ranging from voice recognition to machine transition to protein engineering, but their design and application is nonetheless notoriously unprincipled. The development of tools and methods to guide this process is one of the...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Why do Policy Gradient Methods work so well in Cooperative MARL? Evidence from Policy Representation In cooperative multi-agent reinforcement learning (MARL), due to its on-policy nature, policy...

over a year ago

63

Why do Policy Gradient Methods work so well in Cooperative MARL? Evidence from Policy Representation

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

In cooperative multi-agent reinforcement learning (MARL), due to its on-policy nature, policy gradient (PG) methods are typically believed to be less sample efficient than value decomposition (VD) methods, which are off-policy. However, some recent empirical studies demonstrate...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

FIGS: Attaining XGBoost-level performance with the interpretability and speed of CART FIGS (Fast Interpretable Greedy-tree Sums): A method for building interpretable models by...

over a year ago

67

FIGS: Attaining XGBoost-level performance with the interpretability and speed of CART

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

FIGS (Fast Interpretable Greedy-tree Sums): A method for building interpretable models by simultaneously growing an ensemble of decision trees in competition with one another. Recent machine-learning advances have led to increasingly complex predictive models, often at the cost...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

The Berkeley Crossword Solver We recently published the Berkeley Crossword Solver (BCS), the current state of the art for solving...

over a year ago

98

The Berkeley Crossword Solver

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

We recently published the Berkeley Crossword Solver (BCS), the current state of the art for solving American-style crossword puzzles. The BCS combines neural question answering and probabilistic inference to achieve near-perfect performance on most American-style crossword...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Rethinking Human-in-the-Loop for Artificial Augmented Intelligence How do we build and evaluate an AI system for real-world applications? In most AI research, the...

over a year ago

96

Rethinking Human-in-the-Loop for Artificial Augmented Intelligence

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

How do we build and evaluate an AI system for real-world applications? In most AI research, the evaluation of AI methods involves a training-validation-testing process. The experiments usually stop when the models have good testing performance on the reported datasets because...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

The Berkeley...

Designing Societally Beneficial Reinforcement Learning Systems Deep reinforcement learning (DRL) is transitioning from a research field focused on game playing to...

over a year ago

87

Designing Societally Beneficial Reinforcement Learning Systems

from The Berkeley Artificial Intelligence Research Blog [alt+shift+b] in AI

over a year ago

Deep reinforcement learning (DRL) is transitioning from a research field focused on game playing to a technology with real-world applications. Notable examples include DeepMind’s work on controlling a nuclear reactor or on improving Youtube video compression, or Tesla attempting...

Direct Link [→] Remove from reading list Add to reading list [alt+a]

upvote [alt+ctrl+↑] downvote [alt+ctrl+↓] prev [↑] next [↓]

New here?

bored reading