Follow feeds: blogs, news, RSS and more. An effortless way to read and digest content of your choice.
Get Feederlilianweng.github.io
Get the latest updates from Lil'Log directly as they happen.
Follow now 216 followers
Last updated 5 months ago
5 months ago
Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or...
10 months ago
Hallucination in large language models usually refers to the model generating unfaithful...
about 1 year ago
Diffusion models have demonstrated strong results on image synthesis in past years...
about 2 years ago
Prompt Engineering, also known as In-Context Prompting, refers to methods for how...
over 2 years ago
Many new Transformer architecture improvements have been proposed since my last post...
over 2 years ago
Large transformer models are mainstream nowadays, creating SoTA results for a variety...
over 2 years ago
Neural networks are well known to be over-parameterized and can often easily...
almost 3 years ago
Processing images to generate text, such as image captioning and visual question-answering...
about 3 years ago
Here comes the Part 3 on learning with not enough data (Previous...
about 3 years ago
This is part 2 of what to do when facing a limited...
over 3 years ago
When facing a limited amount of labeled data for supervised learning tasks...