eachro
Joined 884 karma
- eachro parentDo you think this would be appropriate for a command line tool that hits various apis as the function calls? Ex: "what's the weather in SF tomorrow?" Or "daily price change of apple, Tesla stock for past week"? (Let's assume I have documented the apis thoroughly somewhere that the model has access to or fine tuned it on this data)
- "And 50% of the time they work 50% of the time."
I think this is still an incredible outcome given how many dice rolls you can take in parallel with multiple claude/o3/gemini attempts at a problem with slightly different prompts. Granted, each rollout does not come for free given the babysitting you need to do but the cost is much lower than going down the path yourself/having junior colleagues make the attempt.
- During the openai gym era of RL, one of the great selling pts was that RL was very approachable for a new comer as the gym environments were small and tractable that a hobbyist could learn a little bit of RL, try it out on cartpole and see how it'd perform. Are there similarly tractable RL tasks/learning environments with LLMs? From the outside, my impression is that you need some insane GPU access to even start to mess around with these models. Is there something one can do on a normal MacBook air for instance in this LLM x RL domain?
- This is cool that simd primitives exist in the std lib of rust. I've wanted wanted to mess around a bit more with simd in python but I don't think that native support exists. Or your have to go down to C/C++ bindings to actually mess around with it (last I checked at least, please correct me if I'm wrong).
- I wonder how he regards his success as a productivity guru vs professor. All credit to him for achieving a level of success, notoriety that most will never get close to in their life. At the same time, I suspect productivity guru is not quite what he wanted to become. Or maybe it is. I don't know.
- When people learn about distributed systems outside of work, how do they actually get hands on experience with it (assuming they don't go spinning up a bunch of machines on aws/gcp/azure/etc)? I find it easiest to learn by doing, writing simple proof of concepts but that seems a bit harder to do in this area than others? What is the hello world/mnist of messing around with distributed systems?
- Every year I see tons of CS students captivated by the beauty of algorithms in theoretical CS. I lose track of all the bright eyed undergraduates saying they love thinking about algorithms and would ideally like to spend their summer doing research on these sorts of problems. More often than not, I end up telling them they should focus their efforts on the systems side of things and chat with an OS or DB professor rather than a prof in TCS, but very few of them actually take this advice.
- Super fun game. I really wish there were a larger online community that plays it. Jane Street made a great website for playing it online: https://figgie.com/. But alas, there's pretty much never anyone online to play against.