Comment by kubb - Hacker Neue

kubb Jun 19, 2025 parent

Or will they ever be reliable. Your question is already making an assumption.

vFunct Jun 19, 2025

Its perfectly reliable for the things you know it to be, such as operations within its context window size.

Don't ask LLMs to "Write me Microsoft Excel".

Instead, ask it to "Write a directory tree view for the Open File dialog box in Excel".

Break your projects down into the smallest chunks you can for the LLMs. The more specific you are, the more reliable it's going to be.

The rest of this year is going to be companies figuring out how to break down large tasks into smaller tasks for LLM consumption.

diggan Jun 19, 2025

They're reliable already if you change the way you approach them. These probabilistic token generators probably never will be "reliable" if you expect them to 100% always output exactly what you had in mind, without iterating in user-space (the prompts).

kubb OP Jun 19, 2025

I also think they might never become reliable.

flir Jun 19, 2025

There is a bar below which they are reliable.

"Write a Python script that adds three numbers together".

Is that bar going up? I think it probably is, although not as fast/far as some believe. I also think that "unreliable" can still be "useful".

diggan Jun 19, 2025

But what does that mean? If you tell the LLM "Say just 'hi' without any extra words or explanations", do you not get "hi" back from it?

TeMPOraL Jun 19, 2025

That's literally the wrong way to use LLMs though.

LLMs think in tokens, the less they emit the dumber they are, so asking them to be concise, or to give the answer before explanation, is extremely counterproductive.

diggan Jun 19, 2025

I was trying to make a point regarding "reliability", not a point about how to prompt or how to use them for work.

2 More Comments →

kubb OP Jun 19, 2025

Sometimes I get "Hi!", sometimes "Hey!".

diggan Jun 19, 2025

Which model? Just tried a bunch of ChatGPT, OpenAI's API, Claude, Anthropic's API and DeepSeek's API with both chat and reasonee, every single one replied with a single "hi".

3 More Comments →

dist-epoch Jun 19, 2025

I remember when people were saying here on HN that AIs will never be able to generate picture of hands with just 5 fingers because they just "don't have common sense"

This item has no comments currently.