Preferences

syspec
Joined 2,963 karma

  1. Yes! Completely agree! So hard to debug. Nearly impossible to set breakpoints (they disappear in refresh).
  2. Sometimes the AI is all too good at writing tests.

    I agree with the idea, I do it too, but you need to make sure the test don't just validate the incorrect behavior or that the code is not updated to pass the test in a way that actually "misses the point".

    I've had this happen to me on one or two tests every time

  3. Don't worry, the 99% reduction in battery materials is just a strategic pivot to an 'asset-light' approach. The 4680 supply chain isn't collapsing, it’s just being 'optimized' for a future where cars apparently don't need batteries—just FSD subscriptions and robotaxis that run on optimism.
  4. I used to do this with Karma test runner. The best part was how it didn't try to capture everything, so debugging with breakpoints was really easy.

    I like Vitest browser mode, but it's a pain to just "detach" for a specific frame and run that test in isolation, with my actual breakpoints.

  5. > This announcement is more than just a headline—it's validation of our pioneering work with General AI Agents.

    Anyone else thought this was satire when they read that as the second line in the announcement?

    I literally laughed, then clicked the top left logo, to check out the homepage and see if this `ManuAI` was a real website.

    ---

    You would think that they would know better to at least edit that out.

    It's not just ironic -- it's cosmically poetic.

  6. Prompt injection?
  7. Alls they need to do is make extensions much much easier to build, especially extensions that render HTML.

    That's vscode's moat.

    Anytime the same extension exist in both vscode and jetbrains, the jetbrains version is clunky, crash, and unstable.

    I keep Jetbrains open while using vscode, for its local history/git/etc features, but how long will that be enough to keep my subscription

  8. According to the report, 52% of all open-source AI is used for *roleplaying*. They attribute it to fewer content filters and higher creativity.

    I'm pretty surprised by that, but I guess that also selects for people who would use openrouter

  9. There are actually a lot of those! One of the best things about using them is that you can swap models around at will.

    I love to switch models and ask them what they thought of the previous models answer

  10. Thank you Claude.
  11. Source: "trust me bro"
  12. You keep mentioning that in this thread, but a11y on a Mac is considered the gold standard.

    Security on a Mac, the same (SIP, Keychain, Secure Enclave, great tools for fleet management)

    What specifically is in violation of "#a11y or security evaluation frameworks"?

  13. 2.5 flash and 2.5 Pro were just sitting back and watching?

    The problem with Google is that someone had to show them how to make a product out of the thing, which Open AI did.

    Then Anthropic taught them to make a more specific product out of there models

    In every aspect, they're just playing catch up, and playing me too.

    Models are only part of the solution

  14. I have "unlimited" access to both Gemini 2.5 Pro and Claude 4.5 Sonnet through work.

    From my experience, both are capable and can solve nearly all the same complex programming requests, but time and time again Gemini spits out reams and reams of code so over engineered, that totally works, but I would never want to have to interact with.

    When looking at the code, you can't tell why it looks "gross", but then you ask Claude to do the same task in the same repo (I use Cline, it's just a dropdown change) and the code also works, but there's a lot less of it and it has a more "elegant" feeling to it.

    I know that isn't easy to capture in benchmarks, but I hope Gemini 3.0 has improved in this regard

  15. Hmm... it basically says that real power in the US isn’t with the government anymore, but with giant tech companies and the rich weirdos behind them.

    These rich weirdos run the show through apps, platforms, and algorithms that decide what people see, do, and believe.

    It’s not that democracy broke, it just quietly got replaced by a system where the rules are written by whoever owns the server.

  16. Me: I built this insane 3D multilingual AI avatar platform with real-time conversation, full enterprise stack, 47 languages, and Unreal Engine

    You: Can bypass US Treasury sanctions and open a Stripe account

  17. This post reads less like an engineering deep dive and more like a Xeon product brochure that wandered into a video game blog. They casually name-drop every Intel optimization short of tattooing "Hyperthreaded" on their foreheads.
  18. I think one thing they need to focus on is making extensions easier to create.

    I'm a big fan of Intelli-J, but VS Code is eating their lunch with AI agent plugins. I'm assuming it's because it's much easier to create them on that platform.

    The same pluginm from the same team, on Intelli-J always works much slower, feels clunkier and is prone to crashing.

    It's a shame

  19. > about US$2.5 billion on stock-based compensation, nearly double the amount from the first half of last year.

    Wow! 2.5B in stock based compensation

  20. Wouldn't it stand to reason that they would provide a sort of utility for a collegue as they read through the generated code.
  21. This is awesome! Downloaded it and played with it and as a non Kagi user, I'm impressed!

    But please please let me select text!

    I wanted to look up a word, so I double taped it to select it, then I realized non of the text in the articles is selectable

  22. This article leans HARD into the noble savage trope. It critiques Western justice for being reductive, then idealizes indigenous systems without asking the same hard questions.

    It ends up mirroring the same blind spots it thought it was exposing.

  23. You should hear the long form of the acronym!

    TACOBELL

    - Trump Always Chickens Out Before Eventually Losing Loudly

  24. When I code, typing is about 10(?) ercent of the time spent.

    That's true for most people

  25. I feel exactly the same way!

    I love love love coding. When I'm done with work, and I have time between family stuff, I code to relax!

    If I have a fun project, I could easily code from when I wake until I go to sleep, and have before having a kid.

    It's a pure joy second only to family

  26. What's beast mide, and how do I trigger it?
  27. Does this work for others when working in other domains? When creating a Swift application, I can't imagine creating 20 agents and letting them go to town. Same for the backend of such an application if it's in say, Java+Springboot
  28. Is it possible that this is the result of basically all the benchmarks being focused on coding (and a few standardized tests).
  29. I've been using Open WebUI and have been blown away, it's a better ChatGPT interface than ChatGPT!

    https://github.com/open-webui/open-webui

    Curious how this compares to that, which has a ton of features and runs great

This user hasn’t submitted anything.

Keyboard Shortcuts

Story Lists

j
Next story
k
Previous story
Shift+j
Last story
Shift+k
First story
o Enter
Go to story URL
c
Go to comments
u
Go to author

Navigation

Shift+t
Go to top stories
Shift+n
Go to new stories
Shift+b
Go to best stories
Shift+a
Go to Ask HN
Shift+s
Go to Show HN

Miscellaneous

?
Show this modal