DEV Community

# mlops

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Perplexity held flat after INT4. Task accuracy dropped 7 points.

Perplexity held flat after INT4. Task accuracy dropped 7 points.

Comments
4 min read
The seam our tiled upscaler left on every 4K product render

The seam our tiled upscaler left on every 4K product render

Comments
4 min read
Portkey Alternative: I Switched Away from Portkey. Here's the Honest Reason Why.

Portkey Alternative: I Switched Away from Portkey. Here's the Honest Reason Why.

1
Comments
7 min read
Speculative decoding shifted our output distribution and evals missed it

Speculative decoding shifted our output distribution and evals missed it

1
Comments
4 min read
LLMOps in 2026: AI Demo to Production Guide

LLMOps in 2026: AI Demo to Production Guide

Comments
8 min read
Streamlining MLOps: Model Deployment with MLflow

Streamlining MLOps: Model Deployment with MLflow

Comments
2 min read
The latency tax of an LLM gateway: I measured Bifrost's overhead

The latency tax of an LLM gateway: I measured Bifrost's overhead

Comments
4 min read
AI Workloads Are Reshaping Kubernetes in 2026: GPU Scheduling, MLOps, and the Platform Engineering Reckoning

AI Workloads Are Reshaping Kubernetes in 2026: GPU Scheduling, MLOps, and the Platform Engineering Reckoning

Comments
4 min read
What DevOps Taught Me About AI Governance

What DevOps Taught Me About AI Governance

Comments
4 min read
From ML Tooling to Analytical Governance: Recent Updates to KMDS

From ML Tooling to Analytical Governance: Recent Updates to KMDS

1
Comments
3 min read
RLAIF Is Eating RLHF — Here Are the Four Places Human Feedback Still Wins

RLAIF Is Eating RLHF — Here Are the Four Places Human Feedback Still Wins

Comments
6 min read
A 9-point eval gain vanished when we deduped train against test

A 9-point eval gain vanished when we deduped train against test

Comments
4 min read
OpenAI Already Told Us the Kubernetes Scaling Story, Most People Just Did Not Read It Closely

OpenAI Already Told Us the Kubernetes Scaling Story, Most People Just Did Not Read It Closely

Comments
10 min read
I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.

I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.

Comments
3 min read
Quantization formats compared: GGUF vs GPTQ vs AWQ vs NF4

Quantization formats compared: GGUF vs GPTQ vs AWQ vs NF4

Comments
7 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.