7
points
sabareesh
Joined 286 karma
CTO @ guidedchoice.com , 3nickels.com . Making finance easy for everyone
- sabareesh parentI am looking for some open source terminal for iphone .I have code server running which i can just use terminal from vs code on safari
- TL;DR is that they didn't clean the repo (.git/ folder), model just reward hacked its way to look up future commits with fixes. Credit goes to everyone in this thread for solving this: https://xcancel.com/xeophon/status/2006969664346501589
(given that IQuestLab published their SWE-Bench Verified trajectory data, I want to be charitable and assume genuine oversight rather than "benchmaxxing", probably an easy to miss thing if you are new to benchmarking)
https://www.reddit.com/r/LocalLLaMA/comments/1q1ura1/iquestl...
- Watch out these model are hallucinating lot more https://artificialanalysis.ai/evaluations/omniscience?omnisc...
- 1 point
- Similar feeling. Seems it is good at certain things and if something doesnt work it want to do things simply and in turn becomes something that you didnt ask for and certain times opposite of what you wanted. On the other hand with codex certain time you feel the AGI but that is like 2 out of 10 sessions. This is primarily may be due to how complete the prompt and how well you define the problems.
- "Simple, Boring Tech Stack:" Good advice but bad example, because it depends on what engineers are familiar and comfortable with and technology itself should be mature enough. You dont want to spend time building orchestrator when k8s solves it for you. Most cloud provides provide you with k8s as a service, which are miles better than using shell scripts, if you are already familiar with k8s
- Here is my rig, running GLM 4.5 Air. Very impressed by this model
- Where can i find the specs. I am actively working on some project with robot arm and found following appealing eventhough this doesnt include servo or cameras or controllers. https://www.aliexpress.us/item/3256808789646447.html?spm=a2g...
- This is what i have https://sabareesh.com/posts/llm-rig/ All You Need is 4x 4090 GPUs to Train Your Own Model
- 3 points