Everything you care about in one place

Follow feeds: blogs, news, RSS and more. An effortless way to read and digest content of your choice.

Get Feeder

blog.ml.cmu.edu

Blog | Machine Learning | Carnegie Mellon University

Get the latest updates from Blog | Machine Learning | Carnegie Mellon University directly as they happen.

Follow now 134 followers

Latest posts

Last updated 3 days ago

Validating LLM-as-a-Judge Systems under Rating Indeterminacy

3 days ago

Figure 1: Our framework for validating LLM-as-a-judge systems under rating indeterminacy, where...

Carnegie Mellon at NeurIPS 2025

10 days ago

CMU researchers are presenting 156 papers at the Thirty-Ninth Annual Conference on...

How to Explore to Scale RL Training of LLMs on Hard Problems?

16 days ago

Figure 1. Three regimes of exploration: Current RL model can explore via:...

Carnegie Mellon University at EMNLP 2025

about 1 month ago

CMU researchers are presenting 50 papers at the Thirtieth Conference on Empirical...

Learning from Failure to Tackle Extremely Hard Problems

about 2 months ago

This blog post is based on the work BaNEL: Exploration Posteriors for...

Diffusion Beats Autoregressive in Data-Constrained Settings

3 months ago

TLDR If you are compute-constrained, use autoregressive models; if you are data-constrained,...

Verlog: A Multi-turn RL framework for LLM agents

3 months ago

Verlog is a multi-turn reinforcement learning framework built for long-horizon LLM-agentic tasks...

Carnegie Mellon University at ICML 2025

5 months ago

CMU researchers are presenting 127 papers at the Forty-Second International Conference on...

RLHF 101: A Technical Tutorial on Reinforcement Learning from Human Feedback

6 months ago

Reinforcement Learning from Human Feedback (RLHF) is a popular technique used to...

Unlearning or Obfuscating? Jogging the Memory of Unlearned LLMs via Benign Relearning

7 months ago

Machine unlearning is a promising approach to mitigate undesirable memorization of training...

Carnegie Mellon University at ICLR 2025

8 months ago

CMU researchers are presenting 143 papers at the Thirteenth International Conference on...

Allie: A Human-Aligned Chess Bot

8 months ago

Play against Allie on lichess! Introduction In 1948, Alan Turning designed what...