DEV Community

# pytorch

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Tracing torch.cuda.empty_cache() on an RTX 4090 - Where Do the 53 MB Go?

Tracing torch.cuda.empty_cache() on an RTX 4090 - Where Do the 53 MB Go?

Comments
5 min read
QAT vs PTQ on our edge vision model: 6 months of A/B data

QAT vs PTQ on our edge vision model: 6 months of A/B data

Comments
4 min read
Structured channel pruning got our detector under 12ms on a Jetson

Structured channel pruning got our detector under 12ms on a Jetson

Comments
4 min read
Serving 40 LoRA adapters on one base model: the throughput we got

Serving 40 LoRA adapters on one base model: the throughput we got

Comments
4 min read
torch.compile recompiled our SDXL UNet 38 times in production

torch.compile recompiled our SDXL UNet 38 times in production

Comments
4 min read
LLM-as-judge variance broke our DPO training signal for 3 weeks

LLM-as-judge variance broke our DPO training signal for 3 weeks

Comments
4 min read
The bf16 grad accumulator that killed our SDXL LoRA training

The bf16 grad accumulator that killed our SDXL LoRA training

Comments
4 min read
I Built a Diagnostic Toolkit for PyTorch Because I Was Tired of Guessing Why Models Fail

I Built a Diagnostic Toolkit for PyTorch Because I Was Tired of Guessing Why Models Fail

Comments
2 min read
Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)

Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)

Comments
5 min read
VLM-scored calibration sets for INT8 quantisation, routed through Bifrost

VLM-scored calibration sets for INT8 quantisation, routed through Bifrost

Comments
4 min read
Prefix caching in vLLM under multi-tenant agent traffic

Prefix caching in vLLM under multi-tenant agent traffic

Comments 1
4 min read
Why your diffusion model is slow at batch size 1 (and what actually helps)

Why your diffusion model is slow at batch size 1 (and what actually helps)

Comments
4 min read
Your PyTorch Model File Can Execute Arbitrary Code — Here's How I Built a Scanner to Detect It

Your PyTorch Model File Can Execute Arbitrary Code — Here's How I Built a Scanner to Detect It

Comments
3 min read
Distilling SAM 2 into a 6MB student for industrial inspection

Distilling SAM 2 into a 6MB student for industrial inspection

Comments
4 min read
My high-res image-to-video kept OOMing — turns out I was decoding outside no_grad

My high-res image-to-video kept OOMing — turns out I was decoding outside no_grad

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.