Everything you care about in one place

Follow feeds: blogs, news, RSS and more. An effortless way to read and digest content of your choice.

Get Feeder

bair.berkeley.edu

The Berkeley Artificial Intelligence Research Blog

Get the latest updates from The Berkeley Artificial Intelligence Research Blog directly as they happen.

Follow now 375 followers

Latest posts

Last updated 17 days ago

Adaptive Parallel Reasoning: The Next Paradigm in Efficient Inference Scaling

17 days ago

Overview of adaptive parallel reasoning. What if a reasoning model could decide...

Gradient-based Planning for World Models at Longer Horizons

about 1 month ago

GRASP is a new gradient-based planner for learned dynamics (a “world model”)...

Identifying Interactions at Scale for LLMs

2 months ago

Understanding the behavior of complex machine learning systems, particularly Large Language Models...

Information-Driven Design of Imaging Systems

5 months ago

An encoder (optical system) maps objects to noiseless images, which noise corrupts...

RL without TD learning

7 months ago

In this post, I’ll introduce a reinforcement learning (RL) algorithm based on...

What exactly does word2vec learn?

9 months ago

What exactly does word2vec learn, and how? Answering this question amounts to...

Whole-Body Conditioned Egocentric Video Prediction

11 months ago

× Predicting Ego-centric Video from human Actions (PEVA). Given past video frames...

Defending against Prompt Injection with Structured Queries (StruQ) and Preference Optimization (SecAlign)

about 1 year ago

Recent advances in Large Language Models (LLMs) enable exciting LLM-integrated applications. However,...

Repurposing Protein Folding Models for Generation with Latent Diffusion

about 1 year ago

PLAID is a multimodal generative model that simultaneously generates protein 1D sequence...

Scaling Up Reinforcement Learning for Traffic Smoothing: A 100-AV Highway Deployment

about 1 year ago

Training Diffusion Models with Reinforcement Learning We deployed 100 reinforcement learning (RL)-controlled...

Virtual Personas for Language Models via an Anthology of Backstories

over 1 year ago

We introduce Anthology, a method for conditioning LLMs to representative, consistent, and...

Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination

over 1 year ago

Sample language model responses to different varieties of English and native speaker...