Are you running it locally with llama.cpp? If so, is it working without any twea...

sosodev · 2026-03-05T01:36:28 1772674588

I’ve been running it via llama-server with no issues. Running the latest Bartowski 6-bit quant

brightball · 2026-03-05T02:39:51 1772678391

Bartowski? Like Chuck Bartowski from the TV show?

BoredomIsFun · 2026-03-05T08:11:05 1772698265

Different one. Bartowski is a minor celebrity in the local LLM world, together with Unsloth.

Balinares · 2026-03-05T16:48:41 1772729321

What's the selling point of these quants vs the Unsloth ones?

BoredomIsFun · 2026-03-06T09:34:13 1772789653

Sometimes unsloth has broken ones for a particular model, sometimes no quants at all, and there is subtle difference in behavior.

abhikul0 · 2026-03-05T05:02:23 1772686943

Thanks, i'll check his quants.

arcanemachiner · 2026-03-04T17:51:59 1772646719

Have you tried the '--jinja' flag in llama-server?

abhikul0 · 2026-03-04T18:32:47 1772649167

Yes, it fails too. I’m using the unsloth q4_km quant. Similarly fails with devstral2 small too, fixed that by using a similar template i found for it. Maybe it’s the quants that are broken, need to redownload I guess.