Preferences

yuedongze
Joined 339 karma

  1. Ultimately, I'd like to go for the experience of - if I want an app, instead of downloading an existing one, I just go there and vibe one, and I can use it right away, like perhaps a calculator, a world clock, a note pad, or prototyping an app idea. The page gets torn down right after each use, or you can create a link to make it reusable/sharable for a certain duration.

    But it's really about making vibe products disposable and responsive that I think it will make our experiences with AI app builders a lot better.

  2. haha yea i got tired of waiting and wanted to see things coming to life. interestingly there is another HN post right above this one that asks what people do when waiting for LLM response. hopefully we dont need to wait!
  3. it's very similar to the verification engineering problem i wrote about on HN last week. AI is as good as we can prove their work is genuine. and we need humans in the loop to fill in the gaps between autonomous systems and ultimately be held accountable by human laws. it's kind of sad but the reality we are facing
  4. > because it looks about right to me

    this is something one can look in further. it is really probabilistic checkable proofs underneath, and we are naturally looking for places where it needs to look right, and use that as a basis of assuming the work is done right.

  5. It's nice to see a wide array of discussions under this! Glad that I didn't give up on this thought and end up writing it down.

    I want to stress that the main point of my article is not really about AI coding, it's about letting AI perform any arbitrary tasks reliably. Coding is an interesting one because it seems like it's a place where we can exploit structure and abstraction and approaches (like TDD) to make verification simpler - it's like spot-checking in places with a very low soundness error.

    I'm encouraging people to look for tasks other than coding to see if we can find similar patterns. The more we can find these cost asymmetry (easier to verify than doing), the more we can harness AI's real potential.

  6. I think it's definitely an interesting subject for Verification Engineering. the easier to task AI to do work more precisely, the easier we can check their work.
  7. indeed, i see verification debt outweighing tradition tech debt very very soon...
  8. oh i mean the other direction! checking if a generated image is "good" that no one will tell something is off and it look naturally, rather than checking if they are fake.
  9. i've seen a lot of startups that use AI to QA human work. how about the idea of use humans to QA AI work? a lot of interesting things might follow
  10. im hoping this can introduce a framework to help people visualize the problem and figure out a way to close that gap. image generation is something every one can verify, but code generation is perhaps not. but if we can make verifying code as effortless as verifying images (not saying it's possible), then our productivity can enter the next level...
  11. oh interesting thoughts! let me digest and get back to you. overall i just defined a read API description that is "read content from this gist file by calling this API, it returns content of a text file, formatted as CSV, with first row defining the columns" and a write API description that is similar.

    I develop on a paid plan GPT-4o (it allows me creating custom GPTs), but users can be on free tier GPT-4o to use it (although it limits image uploads to a handful per day). free tier github as always.

  12. I’m curious who is willing to use more than 10 clouds
  13. The HTTP/S proxy network of NL is interesting. But I think QUIC unreliable transport + MASQUE is the future here

This user hasn’t submitted anything.

Keyboard Shortcuts

Story Lists

j
Next story
k
Previous story
Shift+j
Last story
Shift+k
First story
o Enter
Go to story URL
c
Go to comments
u
Go to author

Navigation

Shift+t
Go to top stories
Shift+n
Go to new stories
Shift+b
Go to best stories
Shift+a
Go to Ask HN
Shift+s
Go to Show HN

Miscellaneous

?
Show this modal