Preferences

sabareesh
Joined 286 karma
CTO @ guidedchoice.com , 3nickels.com . Making finance easy for everyone

  1. I am looking for some open source terminal for iphone .I have code server running which i can just use terminal from vs code on safari
  2. Sorry to disappoint. But purely codex and claude code
  3. I have switched to terminal
  4. TL;DR is that they didn't clean the repo (.git/ folder), model just reward hacked its way to look up future commits with fixes. Credit goes to everyone in this thread for solving this: https://xcancel.com/xeophon/status/2006969664346501589

    (given that IQuestLab published their SWE-Bench Verified trajectory data, I want to be charitable and assume genuine oversight rather than "benchmaxxing", probably an easy to miss thing if you are new to benchmarking)

    https://www.reddit.com/r/LocalLLaMA/comments/1q1ura1/iquestl...

  5. Non starter for us, we cant ship propriety data to a third party servers.
  6. this has one of the worse score in AA-Omniscience Hallucination Rate
  7. Nope lower is better compared to recent open ai models this is bad. I am looking at AA-Omniscience Hallucination Rate
  8. So is 10,000 IU of daily does ok ?
  9. Technically you kind of get this in Nevada when using Tesla insurance and if you drive 100 % FSD. If you drive manually you are pretty much doxed for random Front collision Warning which is super sensitive
  10. It might be that our current tokenization is inefficient compared to how well image pipeline does. Language already does lot of compression but there might be even better way to represent it in latent space
  11. Similar feeling. Seems it is good at certain things and if something doesnt work it want to do things simply and in turn becomes something that you didnt ask for and certain times opposite of what you wanted. On the other hand with codex certain time you feel the AGI but that is like 2 out of 10 sessions. This is primarily may be due to how complete the prompt and how well you define the problems.
  12. "Simple, Boring Tech Stack:" Good advice but bad example, because it depends on what engineers are familiar and comfortable with and technology itself should be mature enough. You dont want to spend time building orchestrator when k8s solves it for you. Most cloud provides provide you with k8s as a service, which are miles better than using shell scripts, if you are already familiar with k8s
  13. This is great,but most work is involved in curating the dataset and the objective functions for RL.
  14. Looks very similar to DGX spark
  15. Here is my rig, running GLM 4.5 Air. Very impressed by this model

    https://sabareesh.com/posts/llm-rig/

    https://huggingface.co/zai-org/GLM-4.5

  16. That was confusing part of this video . May be there are some limitation on the tools he uses to tune
  17. Where can i find the specs. I am actively working on some project with robot arm and found following appealing eventhough this doesnt include servo or cameras or controllers. https://www.aliexpress.us/item/3256808789646447.html?spm=a2g...
  18. Good move, not sure they are exposing other modalities as well ?
  19. Interesting one thing that is more satisfying with robotics is that because you can see your creations in real world
  20. Learning, prototype and then scale it in to cloud. Also can be used as inference engine to train another model if you are using model as a judge for RL.
  21. This is what i have https://sabareesh.com/posts/llm-rig/ All You Need is 4x 4090 GPUs to Train Your Own Model
  22. Raw models may not be good enough. I wonder how thinking models do on these
  23. Hate to say it but seems Safari might be the alternative. Only missing piece is ublock origin
  24. I have seen several videos on this but seeing an actual demo on a homemade setup really helps understanding how it is actually done
  25. Seems they lost the ship , it is supposed to be v2 and had several changes

This user hasn’t submitted anything.

Keyboard Shortcuts

Story Lists

j
Next story
k
Previous story
Shift+j
Last story
Shift+k
First story
o Enter
Go to story URL
c
Go to comments
u
Go to author

Navigation

Shift+t
Go to top stories
Shift+n
Go to new stories
Shift+b
Go to best stories
Shift+a
Go to Ask HN
Shift+s
Go to Show HN

Miscellaneous

?
Show this modal