Profile: sabareesh - Hacker Neue

sabareesh

Joined Apr 12, 2017 286 karma

CTO @ guidedchoice.com , 3nickels.com . Making finance easy for everyone

sabareesh 4 days ago parent

I am looking for some open source terminal for iphone .I have code server running which i can just use terminal from vs code on safari
sabareesh 5 days ago parent

Sorry to disappoint. But purely codex and claude code
sabareesh 5 days ago parent

I have switched to terminal
sabareesh Jan 3, 2026 parent

TL;DR is that they didn't clean the repo (.git/ folder), model just reward hacked its way to look up future commits with fixes. Credit goes to everyone in this thread for solving this: https://xcancel.com/xeophon/status/2006969664346501589
(given that IQuestLab published their SWE-Bench Verified trajectory data, I want to be charitable and assume genuine oversight rather than "benchmaxxing", probably an easy to miss thing if you are new to benchmarking)
https://www.reddit.com/r/LocalLLaMA/comments/1q1ura1/iquestl...
sabareesh Dec 30, 2025 parent

Non starter for us, we cant ship propriety data to a third party servers.
sabareesh Dec 18, 2025 parent

this has one of the worse score in AA-Omniscience Hallucination Rate
sabareesh Dec 18, 2025 parent

Nope lower is better compared to recent open ai models this is bad. I am looking at AA-Omniscience Hallucination Rate
sabareesh Dec 17, 2025 parent

Watch out these model are hallucinating lot more https://artificialanalysis.ai/evaluations/omniscience?omnisc...
sabareesh Dec 9, 2025 parent

So is 10,000 IU of daily does ok ?
7 points Nov 28, 2025

One point I made that didn't come across: Ilya

0 comments sabareesh twitter.com
1 point Nov 20, 2025

Show HN: MCP Compact – Middleware to trim noisy browser/DOM/tool outputs

0 comments sabareesh sabareesh.com
sabareesh Nov 10, 2025 parent

Technically you kind of get this in Nevada when using Tesla insurance and if you drive 100 % FSD. If you drive manually you are pretty much doxed for random Front collision Warning which is super sensitive
sabareesh Oct 22, 2025 parent

It might be that our current tokenization is inefficient compared to how well image pipeline does. Language already does lot of compression but there might be even better way to represent it in latent space
sabareesh Oct 20, 2025 parent

Similar feeling. Seems it is good at certain things and if something doesnt work it want to do things simply and in turn becomes something that you didnt ask for and certain times opposite of what you wanted. On the other hand with codex certain time you feel the AGI but that is like 2 out of 10 sessions. This is primarily may be due to how complete the prompt and how well you define the problems.
sabareesh Oct 13, 2025 parent

"Simple, Boring Tech Stack:" Good advice but bad example, because it depends on what engineers are familiar and comfortable with and technology itself should be mature enough. You dont want to spend time building orchestrator when k8s solves it for you. Most cloud provides provide you with k8s as a service, which are miles better than using shell scripts, if you are already familiar with k8s
sabareesh Sep 19, 2025 parent

This is great,but most work is involved in curating the dataset and the objective functions for RL.
sabareesh Aug 25, 2025 parent

Looks very similar to DGX spark
sabareesh Aug 8, 2025 parent

Here is my rig, running GLM 4.5 Air. Very impressed by this model
https://sabareesh.com/posts/llm-rig/
https://huggingface.co/zai-org/GLM-4.5
sabareesh Jul 23, 2025 parent

That was confusing part of this video . May be there are some limitation on the tools he uses to tune
sabareesh Jun 11, 2025 parent

Where can i find the specs. I am actively working on some project with robot arm and found following appealing eventhough this doesnt include servo or cameras or controllers. https://www.aliexpress.us/item/3256808789646447.html?spm=a2g...
sabareesh Jun 9, 2025 parent

Good move, not sure they are exposing other modalities as well ?
sabareesh Jun 2, 2025 parent

Interesting one thing that is more satisfying with robotics is that because you can see your creations in real world
sabareesh May 30, 2025 parent

Learning, prototype and then scale it in to cloud. Also can be used as inference engine to train another model if you are using model as a judge for RL.
sabareesh May 30, 2025 parent

This is what i have https://sabareesh.com/posts/llm-rig/ All You Need is 4x 4090 GPUs to Train Your Own Model
sabareesh Mar 27, 2025 parent

Raw models may not be good enough. I wonder how thinking models do on these
sabareesh Mar 1, 2025 parent

Hate to say it but seems Safari might be the alternative. Only missing piece is ublock origin
sabareesh Jan 27, 2025 parent

I have seen several videos on this but seeing an actual demo on a homemade setup really helps understanding how it is actually done
3 points Jan 27, 2025

Homemade Silicon Chips [video]

1 comment sabareesh youtube.com
sabareesh Jan 16, 2025 parent

Seems they lost the ship , it is supposed to be v2 and had several changes

This user hasn’t submitted anything.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous