Follow feeds: blogs, news, RSS and more. An effortless way to read and digest content of your choice.
Get Feederblog.ml.cmu.edu
Get the latest updates from Blog | Machine Learning | Carnegie Mellon University directly as they happen.
Follow now 134 followers
Last updated about 1 month ago
about 1 month ago
People use LLMs to ask for insight on a variety of important...
about 2 months ago
Figure 1: Our framework for validating LLM-as-a-judge systems under rating indeterminacy, where...
about 2 months ago
CMU researchers are presenting 156 papers at the Thirty-Ninth Annual Conference on...
2 months ago
Figure 1. Three regimes of exploration: Current RL model can explore via:...
3 months ago
CMU researchers are presenting 50 papers at the Thirtieth Conference on Empirical...
3 months ago
This blog post is based on the work BaNEL: Exploration Posteriors for...
4 months ago
TLDR If you are compute-constrained, use autoregressive models; if you are data-constrained,...
5 months ago
Verlog is a multi-turn reinforcement learning framework built for long-horizon LLM-agentic tasks...
7 months ago
CMU researchers are presenting 127 papers at the Forty-Second International Conference on...
8 months ago
Reinforcement Learning from Human Feedback (RLHF) is a popular technique used to...
8 months ago
Machine unlearning is a promising approach to mitigate undesirable memorization of training...
9 months ago
CMU researchers are presenting 143 papers at the Thirteenth International Conference on...