- 2 points
- 2 points
- 1 point
- it's very similar to the verification engineering problem i wrote about on HN last week. AI is as good as we can prove their work is genuine. and we need humans in the loop to fill in the gaps between autonomous systems and ultimately be held accountable by human laws. it's kind of sad but the reality we are facing
- It's nice to see a wide array of discussions under this! Glad that I didn't give up on this thought and end up writing it down.
I want to stress that the main point of my article is not really about AI coding, it's about letting AI perform any arbitrary tasks reliably. Coding is an interesting one because it seems like it's a place where we can exploit structure and abstraction and approaches (like TDD) to make verification simpler - it's like spot-checking in places with a very low soundness error.
I'm encouraging people to look for tasks other than coding to see if we can find similar patterns. The more we can find these cost asymmetry (easier to verify than doing), the more we can harness AI's real potential.
- im hoping this can introduce a framework to help people visualize the problem and figure out a way to close that gap. image generation is something every one can verify, but code generation is perhaps not. but if we can make verifying code as effortless as verifying images (not saying it's possible), then our productivity can enter the next level...
- 198 points
- 10 points
- 92 points
- 4 points
- 3 points
- oh interesting thoughts! let me digest and get back to you. overall i just defined a read API description that is "read content from this gist file by calling this API, it returns content of a text file, formatted as CSV, with first row defining the columns" and a write API description that is similar.
I develop on a paid plan GPT-4o (it allows me creating custom GPTs), but users can be on free tier GPT-4o to use it (although it limits image uploads to a handful per day). free tier github as always.
- 6 points
- 4 points
- 6 points
- 255 points
- 2 points
- 2 points
- 2 points
But it's really about making vibe products disposable and responsive that I think it will make our experiences with AI app builders a lot better.