Follow feeds: blogs, news, RSS and more. An effortless way to read and digest content of your choice.
Get Feederblog.ml.cmu.edu
Get the latest updates from Blog | Machine Learning | Carnegie Mellon University directly as they happen.
Follow now 134 followers
Last updated 3 days ago
3 days ago
Figure 1: Our framework for validating LLM-as-a-judge systems under rating indeterminacy, where...
10 days ago
CMU researchers are presenting 156 papers at the Thirty-Ninth Annual Conference on...
16 days ago
Figure 1. Three regimes of exploration: Current RL model can explore via:...
about 1 month ago
CMU researchers are presenting 50 papers at the Thirtieth Conference on Empirical...
about 2 months ago
This blog post is based on the work BaNEL: Exploration Posteriors for...
3 months ago
TLDR If you are compute-constrained, use autoregressive models; if you are data-constrained,...
3 months ago
Verlog is a multi-turn reinforcement learning framework built for long-horizon LLM-agentic tasks...
5 months ago
CMU researchers are presenting 127 papers at the Forty-Second International Conference on...
6 months ago
Reinforcement Learning from Human Feedback (RLHF) is a popular technique used to...
7 months ago
Machine unlearning is a promising approach to mitigate undesirable memorization of training...
8 months ago
CMU researchers are presenting 143 papers at the Thirteenth International Conference on...
8 months ago
Play against Allie on lichess! Introduction In 1948, Alan Turning designed what...