Profile: msoad - Hacker Neue

msoad

Joined Jan 12, 2013 8,467 karma

19 points Jul 24, 2025

Ask HN: Help me navigate a PIP at a remote startup in the Netherlands

17 comments msoad
msoad Jan 28, 2025 parent

My layman view is that more compute (more reasoning) will not solve harder problems. I'm using those models every day and when problem hits a certain complexity it will fail, no matter how much it "reasons"
msoad Jan 28, 2025 parent

I hope the hedge fund shorted NVDA to make some good money along the way too hahaha!
msoad Jan 28, 2025 parent

They could make a ton of money shorting NVDA and releasing the paper. The most honest short position ever!
msoad Jan 27, 2025 parent

We are at multiple trillion dollar investment territory, all purely based on the idea that "to make AI you need lots of GPU and power"
msoad Jan 27, 2025 parent

four GPUs are very convincing indeed! :D
msoad Jan 27, 2025 parent

I use yek
https://github.com/bodo-run/yek
msoad Jan 25, 2025 parent

Does it work nicely on Linux? I'm very curious about this
msoad Jan 25, 2025 parent

All I want is an iPhone Shortcuts script to delete messages like "Hi" and "Hey" from unknown numbers. I get so many of those and having to delete them is a pain.
Shortcuts does not allow deleting messages apparently :(
msoad Jan 23, 2025 parent

I saw this earlier. benchmarks are impressive!
Did OpenAI release anything beside this product? Any benchmarks at least to compare?
It feels like OpenAI is betting on the fact that they have a nice UI?!
2 points Jan 23, 2025

UI-Tars: Pioneering Automated GUI Interaction with Native Agents

0 comments msoad arxiv.org
msoad Jan 23, 2025 parent

This is a very uniformed response IMO. S3 seems very niche compared to Node.js compatibility. Not sure why you're attacking me for saying this?
msoad Jan 23, 2025 parent

I have not tried Bun yet but the long list of features makes me skeptical that it's all solid and bug-free. I'm wishing to be wrong. I'll give it a spin in a future project.
From a project management perspective I'm a little confused why would you spend time on S3 support while you're still not 100% Node.js compatible. Next.js is a very big ecosystem and if you can get Next.js customers onboard you'll grow much more than supporting S3.
msoad Jan 23, 2025 parent

Can someone please do some security research on this to see what it sends when it calls home?
I'm terrified to install a binary like that
msoad Jan 23, 2025 parent

I agree with Tailwind's stance on this. You really don't need @apply if you're breaking things down to smaller components. I often see people have things like <ul><li className="long_list_of_classes">text1</li><li className="long_list_of_classes">text2</li>...</ul>. This is where I think we need a linter to warn against things like that. Make those <li>'s a component!
msoad Jan 23, 2025 parent

I think everyone who worked at Google in the past has PTSD from having to migrate gRPC schemas. What a mess! Type safety doesn't have to be this costly
msoad Jan 22, 2025 parent

no... one more lane will fix the traffic. Truly American approach
Amazing to see how DeepSeek R1 is doing better than OpenAI models with much less resources
msoad Jan 22, 2025 parent

if your birth year starts with 2, I can see why you might think that
msoad Jan 22, 2025 parent

I am paying for o1 Pro but since Deepseek R1 came out I stopped using it. So there goes $200/mo of their revenue ;)
msoad Jan 22, 2025 parent

I know. It is sad. Naming can also be seen as a way of showing respect to a hugely impactful paper if you want to be positive about it.
msoad Jan 22, 2025 parent

Here is a conundrum:
With all that AI generated code being pushed, as a leader I wonder which is better? Enforce a ton of e2e so no code that is really well thought through all aspects of the solution can go past CI or does this enable AI to go even crazier and break all sort of best practices to just pass the test?
msoad Jan 21, 2025 parent

I'm not trying to be cheeky here. They have literally joked about how TypeScript versions means nothing really. So they can't just announce a new major version and drop enums completely. Maybe with a feature flag this is possible but even then, a fresh tsc --init not supporting enums is not really how TypeScript works
msoad Jan 21, 2025 parent

https://ai.meta.com/research/publications/large-concept-mode...
msoad Jan 21, 2025 parent

Google Maps is not showing Palestine on the maps app. it shows middle of the ocean when you search for it...
msoad Jan 21, 2025 parent

if how us humans reason about things is a clue, language is not the right tool to reason about things.
There is now research in Large Concept Models to tackle this but I'm not literate enough to understand what that actually means...
msoad Jan 20, 2025 parent

I'm not saying that never has happened. maybe they trained against openAI models but they are letting anyone to train from their output. I doubt they had access to GPT models to "distill"
msoad Jan 20, 2025 parent

No model really can "call home". It's the server running it. Luckily for Deepseek there are other providers that guarantee no data collection since the models are open source

msoad Jan 20, 2025 parent

> Give me five odd numbers that don't have the letter 'e' in their spelling

Compare the reasoning times!!! 84s vs 342s

R1 (Thought for 84 seconds)

      No odd number in English avoids the letter 'e' in its spelling. The request for five such numbers cannot be fulfilled.

o1 Pro (Thought for 5 minutes and 42 seconds)

      No standard English spelling of an odd number can avoid “e.” Every odd digit (one, three, five, seven, nine) already includes “e,” so once you build any odd number out of those digits, the spelled‐out form will contain “e.” As a result, there are no such odd numbers in standard English without an “e” in their name.

msoad Jan 20, 2025 parent

It already replaces o1 Pro in many cases for me today. It's much faster than o1 Pro and results are good in most cases. Still, sometimes I have to ask the question from o1 Pro if this model fails me. Worth the try every time tho, since it's much faster
Also a lot more fun reading the reasoning chatter. Kinda cute seeing it say "Wait a minute..." a lot

This user hasn’t submitted anything.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous