← Find more feeds

lilianweng.github.io

Lil'Log

Get the latest updates from Lil'Log directly as they happen.

Follow now 226 followers

Latest posts

Last updated 3 months ago

Why We Think

3 months ago

Special thanks to John Schulman for a lot of super valuable feedback...

Read full

Reward Hacking in Reinforcement Learning

8 months ago

Reward hacking occurs when a reinforcement learning (RL) agent exploits flaws or...

Read full

Extrinsic Hallucinations in LLMs

about 1 year ago

Hallucination in large language models usually refers to the model generating unfaithful...

Read full

Diffusion Models for Video Generation

over 1 year ago

Diffusion models have demonstrated strong results on image synthesis in past years...

Read full

Prompt Engineering

over 2 years ago

Prompt Engineering, also known as In-Context Prompting, refers to methods for how...

Read full

The Transformer Family Version 2.0

over 2 years ago

Many new Transformer architecture improvements have been proposed since my last post...

Read full

Large Transformer Model Inference Optimization

over 2 years ago

Large transformer models are mainstream nowadays, creating SoTA results for a variety...

Read full

Some Math behind Neural Tangent Kernel

almost 3 years ago

Neural networks are well known to be over-parameterized and can often easily...

Read full

Generalized Visual Language Models

about 3 years ago

Processing images to generate text, such as image captioning and visual question-answering...

Read full

Learning with not Enough Data Part 3: Data Generation

over 3 years ago

Here comes the Part 3 on learning with not enough data (Previous...

Read full

FAQ

over 3 years ago

Read full

Learning with not Enough Data Part 2: Active Learning

over 3 years ago

This is part 2 of what to do when facing a limited...

Read full

Or log in

Everything you care about in one place

Lil'Log

Latest posts

Why We Think

Reward Hacking in Reinforcement Learning

Extrinsic Hallucinations in LLMs

Diffusion Models for Video Generation

Prompt Engineering

The Transformer Family Version 2.0

Large Transformer Model Inference Optimization

Some Math behind Neural Tangent Kernel

Generalized Visual Language Models

Learning with not Enough Data Part 3: Data Generation

FAQ

Learning with not Enough Data Part 2: Active Learning

Try Feeder for free