Comment by steveklabnik

steveklabnik Sep 29, 2025 parent

> It would make sense they scale up and down depending on utilization right?

It would, but

> To state it plainly: We never reduce model quality due to demand, time of day, or server load.

https://www.anthropic.com/engineering/a-postmortem-of-three-...

If you believe them or not is another matter, but that's what they themselves say.

transcriptase Sep 29, 2025

Well knowing the state of the tech industry they probably have a different, legal-team approved definition of “reducing model quality” than face value.

After all, using a different context window, subbing in a differently quantized model, throttling response length, rate limiting features aren’t technically “reducing model quality”.

This item has no comments currently.