Comment by roselan - Hacker Neue

roselan Apr 2, 2025 parent

I'm surprised there was no human tested for a base reference point. I'm pretty sure some of us would not pass the test held by another human.

Ukv Apr 2, 2025

Human win rate would be 1 minus the model win rate, to my understanding. So 77% against ELIZA, 27% against GPT-4.5 with a human persona.

This item has no comments currently.

It looks like you have JavaScript disabled. This web app requires that JavaScript is enabled. Please enable JavaScript to use this site (or just go read Hacker News).

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous