Comment by corlinp - Hacker Neue

corlinp Sep 27, 2025 parent

The performance is great, but the censorship is ridiculous for me. I tried it as a backend for my game Guessix[1], but it would refuse for ridiculous reasons like "Cannot answer questions about copyrighted works like Harry Potter."

1. https://guessix.com/

artdigital Sep 27, 2025

Try the uncensored/jailbroken variants like openai-gpt-oss-20b-abliterated-uncensored-neo-imatrix

I just tried to ask it how to make crystal meth and it generated a very detailed step by step guide

Squarex Sep 27, 2025

I have heard that uncensorted gpt-oss is not very good because of it being trained mainly on synthetic data. Is not not true?

bavell Sep 27, 2025

Iirc abliteration (ablation?) can be done without "training" and is pretty quick. It finds the individual weights related to the concept you want to ablate, and modifies those weights to "deactivate" them. Precision brain surgery, to anthropomorphize.

Squarex Sep 27, 2025

The problem with synthetic data would be that the censored information would not be in the training data at all.

corlinp OP Sep 27, 2025

Very interesting! Do the benchmarks hold up well or does it reduce performance in other areas too?

BoorishBears Sep 27, 2025

Use constrained generation

corlinp OP Sep 27, 2025

Do you mean like structured outputs? Unfortunately here the model is guided to explicitly tell you when you violate the rules and why, it can confuse it's system rules with the game rules and say you're not allowed to ask a question about copyrighted material etc.

artdigital Sep 27, 2025

Mind explaining?

BoorishBears Sep 27, 2025

If you constrain the model to a JSON schema, most frivolous refusals go away.

And if you finetune on a few formatted examples the effect is even greater

7thpower Sep 27, 2025

Curious as well

This item has no comments currently.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous