Comment by Kim_Bruning

Kim_Bruning Dec 15, 2025 parent

Probably the correct answer the LLM should give is "if you have to ask, definitely don't do that". Or... it can start asking diagnostic questions, expert-system style.

But yeah, I can imagine a multi-modal model actually might have more information and common sense than a human in a (for them) novel situation.

If only to say "don't be an idiot", "pick higher ground" . Or even just as a rubber duck!

Forgeties79 Dec 16, 2025

I uploaded a simple spreadsheet that was 8 rows and 12 columns. Not even 100 full cells. They were filled with plain text numbers and names, and a few dozen had green blocks, otherwise no other info/styling and no formulas. I asked ChatGPT “how many cells are green.” It told me 13 (there were over 30). I uploaded a photo. Still couldn’t do it.

I understand there are things a typical LLM can do and things that it cannot, this is mostly just because I figured it couldn’t do it and I just wanted to see what would happen. But the average person is not really given much information on the constraints and all of these companies are promising the moon with these tools.

Short version: It definitely did not have more common sense or information than a human, and we all know it sure would have given a very confident answer about conditions in the area to this person that were likely not correct. Definitely incorrect if it’s based off a photo.

In my experience when it has to crawl the Internet it’s particularly flaky. The other day I queried who won which awards in the game awards. 3 different models got it wrong, all of them omitted at least 2 categories. You could throw a rock on a search engine and find 80 lists ready to go.

hammock Dec 17, 2025

If you pay for the llm and turn thinking up to max , it succeeds at many tasks that it normally fails on free version

This item has no comments currently.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous