Everything you care about in one place

Follow feeds: blogs, news, RSS and more. An effortless way to read and digest content of your choice.

Get Feeder

magazine.sebastianraschka.com

Ahead of AI

Get the latest updates from Ahead of AI directly as they happen.

Follow now 69 followers

Latest posts

Last updated 20 days ago

My Workflow for Understanding LLM Architectures

20 days ago

Many people asked me over the past months to share my workflow...

Components of A Coding Agent

about 1 month ago

In this article, I want to cover the overall design of coding...

A Visual Guide to Attention Variants in Modern LLMs

about 2 months ago

I had originally planned to write about DeepSeek V4. Since it still...

A Dream of Spring for Open-Weight LLMs: 10 Architectures from Jan-Feb 2026

2 months ago

If you have struggled a bit to keep up with open-weight model...

Categories of Inference-Time Scaling for Improved LLM Reasoning

3 months ago

Inference scaling has become one of the most effective ways to improve...

The State Of LLMs 2025: Progress, Progress, and Predictions

4 months ago

As 2025 comes to a close, I want to look back at...

LLM Research Papers: The 2025 List (July to December)

4 months ago

In June, I shared a bonus article with my curated and bookmarked...

A Technical Tour of the DeepSeek Models from V3 to V3.2

5 months ago

A Technical Tour of the DeepSeek Models from V3 to V3.2Subtitle: Understanding...

Beyond Standard LLMs

6 months ago

From DeepSeek R1 to MiniMax-M2, the largest and most capable open-weight LLMs...

Understanding the 4 Main Approaches to LLM Evaluation (From Scratch)

7 months ago

How do we actually evaluate LLMs?It’s a simple question, but one that...

Understanding and Implementing Qwen3 From Scratch

8 months ago

Previously, I compared the most notable open-weight architectures of 2025 in The...

From GPT-2 to gpt-oss: Analyzing the Architectural Advances

9 months ago

OpenAI just released their new open-weight LLMs this week: gpt-oss-120b and gpt-oss-20b...