- this seems like a crazy idea as the cli client has nothing to do with how many tokens per second the api streams
- if the quality of search results today is anything to go buy -- clearly no
- > I've consistently found Gemini to be better than ChatGPT [ because ] Google has crawled the internet so they have more data to work with.
This commonly expressed non-sequitur needs to die.
First of all, all of the big AI labs have crawled the internet. That's not a special advantage to Google.
Second, that's not even how modern LLMs are trained. That stopped with GPT-4. Now a lot more attention is paid to the quality of the training data. Intuitively, this makes sense. If you train the model on a lot of garbage examples, it will generate output of similar quality.
So, no, Google's crawling prowess has little to do with how good Gemini can be.
- To be honest, no. It would just disappoint me as a customer and make me switch to a much cheaper Android.
- My biggest issue with iOS 26 is not the UI (it’s subpar compared to prior work), but the fact that it drains my battery 2x faster than before and I’m with an iPhone 16 Pro. That’s unacceptable performance degradation on a 1-year old phone.
- Sounds nice, in theory, but in practice I want to iterate on one, perhaps, two tasks at a time, and keep a good understanding of what the agent is doing, so that I can prevent it from going off the rails, making bad decisions and then building on them even further.
Worktrees and parallel agents do nothing to help me with that. It's just additional cognitive load.
- While this is cool, can anything be done about the speed of inference?
At least for my use, 200K context is fine, but I’d like to see a lot faster task completion. I feel like more people would be OK with the smaller context if the agent acts quickly (vs waiting 2-3 mins per prompt).
- This sounds like the programmer equivalent of astrology.
> Build context for the work you're doing. Put lots of your codebase into the context window.
If you don’t say that, what do you think happens as the agent works on your codebase.
- > But honestly what are the examples of people losing their jobs to software ?
Bank tellers
Travel agents
Cashiers
Bookkeeping clerks
Typists
- Most recently, AI helped me salvage and refactor a giant and completely mismanaged outsourced rewrite of a popular niche site.
Without it, the site would suffer a slow and painful death in the SERPs and would lead to about 10MM annual loss for the company.
Starting from scratch with a proper, qualified team was not possible for political reasons.
So, being able to do it as a single person, with heavy AI assistance, is a huge win.
- Sadly, I don't think this astroturfing is limited to announcement threads. It seems it is becoming increasingly hard to source real human opinions online, even on specialized forums like this or Reddit communities.
I hope that I am wrong, but, if I am not, then these companies are doing real and substantial damage to the internet. The loss of trust will be very hard to undo.
- The most alarming to me thing is that it seems to be happening at scale. This is one of dozens similar posts I've seen all over the programming communities with similar characteristics (high praise, new-ish accounts, little if any other activity).
- Yes, this is a paid comment, in the sense that it's probably a bot. 22 day old account, with 1 post, praising Claude.
For more than a year, Anthropic has engaged in an extensive guerrilla marketing effort on Reddit and similar developer-oriented platforms, aiming to persuade users that Claude significantly outperforms competitors in programming tasks, even though nearly all benchmarks indicate otherwise.
- This is the worst price to value ratio for an acquihire ever.
- Not sure why this is being downvoted, but it's absolutely true.
If you're using these models to generate code daily, the costs add up.
Sure, I'll give a really tough problem to o3 (and probably over ChatGPT, not the API), but on general code tasks, there really isn't meaningful enough difference to justify 4x the cost.
- “best where the actual logic isn’t very hard”?
yeah, well it’s also one of the top scorers on the Math olympiads
- > somehow deducted from subsidies
Do you think the EU subsidizes Meta/Apple.
- They wanted to use Claude, but had to settle for Gemini, as they just couldn't justify paying those API token prices.
- This looks like the work of a team of stable geniuses.
- alright
- No one will use this
- YC start up idea -- an app that uses AI to help you maximize your YC application
- that is an interesting insight
- So, in essence, both the input and the output are read by a LLM that's fine-tuned to censor. If it flags up content, it instructs the core model to refuse. Similar to most AI-based moderation systems. It's a bit more complicated as there's one LLM for inputs and another one for outputs, but it's not really a groundbreaking idea.
- Actually, I have a really good counter example but I'm unwilling to share it publicly, as I don't want to dox myself.
But the gist of it is that CF sales are really good at identifying users that are both locked into their offerings and big enough to be able to sign an expensive contract.
CF do have an excellent offering and workers, in particular, are amazing for many things.
Once the above conditions hit though, you will invariably get a call from the sales team. There is no free lunch.
- wait
- The reason it's free and with unlimited bandwidth is that it's not.
Unless you stay very small, you'll eventually get on the radar of the sales team and you'll realize the service is neither unlimited nor free. In fact, you'll likely have to look at a 5 or 6-figure contract to remain on the service.
- I don't understand how `serious professionals` continue to try to trash WordPress using superficial and often uneducated arguments, completely ignoring the fact that it has huge flexibility, a rich ecosystem, many options for using modern development practices and 40%+ global market share.
Because it's most likely in the training data. I.e., it stole it for you.