Comment by danielmarkbruce

danielmarkbruce Nov 6, 2025 parent

"open source" has come to mean "open weight" in model land. It is what it is. Words are used for communication, you are the one misusing the words.

You can update the weights of the model, continue to train, whatever. Nobody is stopping you.

a-dub Nov 6, 2025

it still doesn't sit right. sure it's different in terms of mutability from say, compiled software programs, but it still remains not end to end reproducible and available for inspection.

these words had meaning long before "model land" became a thing. overloading them is just confusing for everyone.

danielmarkbruce OP Nov 6, 2025

It's not confusing, no one is really confused except the people upset that the meaning is different in a different context.

On top of that, in many cases a company/group/whoever can't even reproduce the model themselves. There are lots of sources of non-determinism even if folks are doing things in a very buttoned up manner. And, when you are training on trillions of tokens, you are likely training on some awful sounding stuff - "Facebook is trained llama 4 on nazi propaganda!" is not what they want to see published.

How about just being thankful?

a-dub Nov 7, 2025

i disagree. words matter. the whole point of open source is that anyone can look and see exactly how the sausage is made. that is the point. that is why the word "open" is used.

...and sure, compiling gcc is nondeterministic too, but i can still inspect the complete source from where it comes because it is open source, which means that all of the source materials are available for inspection.

danielmarkbruce OP Nov 7, 2025

The point of open source in software is as you say. It's just not the same thing though. Using words and phrases differently in different fields is common.

a-dub Nov 7, 2025

...and my point is that it should be.

the practice of science itself would be far stronger if it took more pages from open source software culture.

5 More Comments →

mensetmanusman Nov 7, 2025

Weights are meaningless without training data and source.

antiframe Nov 7, 2025

I get a lot of meaning out of weights and source (without the training data), not sure about you. Calling it meaningless seems like exaggeration.

mensetmanusman Nov 7, 2025

Can you change the weights to improve?

HarHarVeryFunny Nov 7, 2025

You can fine tune without the original training data, which for a large LLM is typically going to mean using LoRA - keeping the original weights unchanged and adding separate fine-tuning weights.

danielmarkbruce OP Nov 11, 2025

it's a bunch of numbers. Of course you can change them.

This item has no comments currently.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous