Comment by aitchnyu - Hacker Neue

One user is able to fit more code into LLM context window since tabs take one token and spaces take 4.

Did these systems really not learn to recognize a run of four spaces as a single token, despite all the Python source code fed to them?

For one tokenizer, a couple years ago.

This item has no comments currently.