DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
An LLM API call, in 4 GIFs

Statelessness and cost-saving tips

An LLM API call, in 4 GIFs

64
Comments 38
4 min read
How Model Distillation Actually Works (and What the 'China Distilled Our Model' Headlines Really Mean)

How Model Distillation Actually Works (and What the 'China Distilled Our Model' Headlines Really Mean)

4
Comments
6 min read
Pydantic AI vs LangChain vs instructor: structured LLM outputs compared

Pydantic AI vs LangChain vs instructor: structured LLM outputs compared

Comments
5 min read
5 Things That Look Terrible as Plain Text (And How OpenUI Fixes Them)

5 Things That Look Terrible as Plain Text (And How OpenUI Fixes Them)

2
Comments
9 min read
The .txt File as the Soul of a Personal AI — FileRAG Memory Architecture

The .txt File as the Soul of a Personal AI — FileRAG Memory Architecture

2
Comments
4 min read
I built a Rust LLM inference engine with custom WGSL GPU kernels, here's what I learned!

I built a Rust LLM inference engine with custom WGSL GPU kernels, here's what I learned!

Comments
1 min read
Tracking Five Upstreams, Fuzzing the Parsers, and a Front Door: What Changed in llm-cli-gateway

Tracking Five Upstreams, Fuzzing the Parsers, and a Front Door: What Changed in llm-cli-gateway

1
Comments
8 min read
When You Swap Your AI Agent's Brain — Everything Breaks

When You Swap Your AI Agent's Brain — Everything Breaks

Comments
6 min read
5 walls I hit shipping an AI reading app from West Africa (and what I'd tell past-me)

5 walls I hit shipping an AI reading app from West Africa (and what I'd tell past-me)

Comments
5 min read
Why I built Ajah after Helicone went into maintenance mode

Why I built Ajah after Helicone went into maintenance mode

Comments 1
1 min read
RAG SOTA: I Built SEQUOIA and Tested 7 Pipelines — Full Results

RAG SOTA: I Built SEQUOIA and Tested 7 Pipelines — Full Results

Comments 1
2 min read
The Most Annoying Part of AI Coding Isn’t Bad Code

The Most Annoying Part of AI Coding Isn’t Bad Code

Comments
1 min read
Extract Plain Text from Medium Posts for RAG and Search Indexes

Extract Plain Text from Medium Posts for RAG and Search Indexes

Comments
2 min read
The Open Source Illusion: Why "Free" AI Models Are Getting Expensive

The Open Source Illusion: Why "Free" AI Models Are Getting Expensive

Comments 1
2 min read
Fine-Tuning Qwen2.5-0.5B to Write SRE Post-Mortem Summaries

Fine-Tuning Qwen2.5-0.5B to Write SRE Post-Mortem Summaries

2
Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.