Comment by tenpoundhammer

tenpoundhammer Dec 11, 2025 parent

I have been using chatGPT a ton over the last months and paying the subscription. Used it for coding, news, stock analysis, daily problems, and a whatever I could think of. I decided to give Gemini a go when version three came out to great reviews. Gemini handles every single one of my uses cases much better and consistently gives better answers. This is especially true for situations were searching the web for current information is important, makes sense that google would be better. Also OCR is phenomenal chatgpt can't read my bad hand writing but Gemini can easily. Only downsides are in the polish department, there are more app bugs and I usually have to leave the happen or the session terminates. There are bugs with uploading photos. The biggest complaint is that all links get inserted into google search and then I have to manipulate them when they should go directly to the chosen website, this has to be some kind of internal org KPI nonsense. Overall, my conclusion is that ChatGPT has lost and won't catch up because of the search integration strength.

dmd Dec 11, 2025

I consistently have exactly the opposite experience. ChatGPT seems extremely willing to do a huge number of searches, think about them, and then kick off more searches after that thinking, think about it, etc., etc. whereas it seems like Gemini is extremely reluctant to do more than a couple of searches. ChatGPT also is willing to open up PDFs, screenshot them, OCR them and use that as input, whereas Gemini just ignores them.

nullbound Dec 11, 2025

I will say that it is wild, if not somewhat problematic that two users have such disparate views of seemingly the same product. I say that, but then I remember my own experience just from few days ago. I don't pay for gemini, but I have paid chatgpt sub. I tested both for the same product with seemingly same prompt and subbed chatgpt subjectively beat gemini in terms of scope, options and links with current decent deals.

It seems ( only seems, because I have not gotten around to test it in any systematic way ) that some variables like context and what the model knows about you may actually influence quality ( or lack thereof ) of the response.

martinpw Dec 11, 2025

> I will say that it is wild, if not somewhat problematic that two users have such disparate views of seemingly the same product.

This happens all the time on HN. Before opening this thread, I was expecting that the top comment would be 100% positive about the product or its competitor, and one of the top replies would be exactly the opposite, and sure enough...

I don't know why it is. It's honestly a bit disappointing that the most upvoted comments often have the least nuance.

stevage Dec 11, 2025

How much nuance can one person's experience have? If the top two most visible things are detailed, contrary experiences of the same product, that seems a pretty good outcome?

AznHisoka Dec 12, 2025

Also, why introduce nuance for the sake of nuance? For every single use case, Gemini (and Claude) has performed better. I can’t give ChatGPT even the slightest credit when it doesnt deserve any

block_dagger Dec 11, 2025

Replace "on HN" with "in the course of human events" and we may have a generally true statement ;)

rabf Dec 11, 2025

Chatgpt is not one model! Unless you manually specify to use a particular model your question can be routed to different models depending on what it guesses would be most appropriate for your question.

stingraycharles Dec 12, 2025

Isn’t that just standard MoE behavior? And isn’t the only choice you have from the UI between “Instant” and “Thinking”?

baq Dec 12, 2025

MoE is a single model thing, model routing happens earlier.

2 More Comments →

blks Dec 11, 2025

Because neither product has any consistency in its results, no predictive behaviour. One day it performs well, another it hallucinates non existing facts and libraries. Those are stochastic machines

sendes Dec 11, 2025

I see the hyperbole is the point, but surely what these machines do is to literally predict? The entire prompt engineering endeavour is to get them to predict better and more precisely. Of course, these are not perfect solutions - they are stochastic after all, just not unpredictably.

coliveira Dec 12, 2025

Prompt engineering is voodoo. There's no sure way to determine how well these models will respond to a question. Of course, giving additional information may be helpful, but even that is not guaranteed.

4 More Comments →

dmd Dec 11, 2025

And I’d really like for Gemini to be as good or better, since I get it for free with my Workspace account, whereas I pay for chatgpt. But every time I try both on a query I’m just blown away by how vastly better chatgpt is, at least for the heavy-on-searching-for-stuff kinds of queries I typically do.

Workaccount2 Dec 11, 2025

Gemini has tons of people using it free via aistudio

I can't help but feel that google gives free requests the absolute lowest priority, greatest quantization, cheapest thinking budget, etc.

I pay for gemini and chatGPT and have been pretty hooked on Gemini 3 since launch.

crorella Dec 11, 2025

It’s like having 3 coins and users preferring one or the other when tossing it because one coin gives consistently more heads (or tails) than the other coin.

What is better is to build a good set of rules and stick to one and then refine those rules over time as you get more experience using the tool or if the tool evolves and digress from the results you expect.

nullbound Dec 11, 2025

<< What is better is to build a good set of rules and

But, unless you are on a local model you control, you literally can't. Otherwise, good rules will work only as long as the next update allows. I will admit that makes me consider some other options, but those probably shouldn't be 'set and iterate' each time something changes.

crorella Dec 12, 2025

what I had in mind when I added that comment was for coding, with the use of .md files. For the web version of chats I agree there is little control on how to tailor the way you want the agent to behave, unless you give a initial "setup" prompt.

jhancock Dec 11, 2025

I can use GPT one day and the next get a different experience with the same problem space. Same with Gemini.

4ndrewl Dec 11, 2025

This is by design, given a non-determenitisic application?

jhancock Dec 11, 2025

sure. It may be more than that...possibly due to variable operating params on the servers and current load.

On whole, if I compare my AI assistant to a human worker, I get more variance than I would from a human office worker.

2 More Comments →

sjaramillo Dec 11, 2025

I guess LLMs have a mood too

dr_dshiv Dec 12, 2025

Vibes

nunez Dec 11, 2025

Tesla FSD has been more or less the same experience. Some people drive 100s of miles without disengaging while others pull the plug within half a mile from their house. A lot of it depends on what the customer is willing to tolerate.

austhrow743 Dec 12, 2025

We've been having trouble telling if people are using the same product ever since Chat GPT first got popular. The had a free model and a paid model, that was it, no other competitors or naming schemes to worry about, and discussions were still full of people talking about current capabilities without saying what model they were using.

For me, "gemini" currently means using this model in the llm.datasette.io cli tool.

openrouter/google/gemini-3-pro-preview

For what anyone else means? If they're equivalent? If Google does something different when you use "Gemini 3" in their browser app vs their cli app vs plans vs api users vs third party api users? No idea to any of the above.

I hate naming in the llm space.

dmd Dec 12, 2025

FWIW i’m always using 5.1 Thinking.

Bombthecat Dec 12, 2025

Could also be a language thing ...

ghostpepper Dec 12, 2025

Same, I use chatgpt plus (the entry-level paid option) extensively for personal research projects and coding, and it seems miles ahead of whatever "Gemini Pro" is that I have through work. Twice yesterday, gemini repeated verbatim a previous response as if I hadn't asked another question and told it why the previous response was bad. Gemini feels like chatGPT from two years ago.

staticman2 Dec 11, 2025

Are you uploading PDFs that already have a text layer?

I don't currently subscribe to Gemini but on A.I. Studio's free offering when I upload a non OCR PDF of around 20 pages the software environment's OCR feeds it to the model with greater accuracy than I've seen from any other source.

dmd Dec 11, 2025

I’m not uploading PDFs at all. I’m talking about PDFs it finds while searching than it extracts data from for the conversation.

staticman2 Dec 11, 2025

I'm surprised to hear anyone finds these models trustworthy for research.

Just today I asked Claude what year over year inflation was and it gave me 2023 to 2024.

I also thought some sites ban A.I. crawling so if they have the best source on a topic, you won't get it.

Workaccount2 Dec 12, 2025

Anytime you use LLMs you should be keenly aware of their knowledge cutoff. Like any other tool, the more you understand it, the better it works.

staticman2 Dec 12, 2025

I'm sorry but I don't see what "knowledge cutoff" has to do with what we were talking about- which is using a LLM find PDFs and other sources for research.

whazor Dec 12, 2025

I agree with you. To me, gemini has much worse search results. Then again, I use kagi for search and I cannot stand the search results from Google anymore. And its clear that gemini uses those.

In contrast, chatgpt has built their own search engine that performs better in my experience. Except for coding, then I opt for Claude opus 4.5.

noname120 Dec 11, 2025

Perplexity Pro with any thinking model blows both out of the water in a fraction of the time, in my experience

kccqzy Dec 11, 2025

> The biggest complaint is that all links get inserted into google search and then I have to manipulate them when they should go directly to the chosen website, this has to be some kind of internal org KPI nonsense.

Oh I know this from my time at Google. The actual purpose is to do a quick check for known malware and phishing. Of course these days such things are better dealt with by the browser itself in a privacy preserving way (and indeed that’s the case), so it’s unnecessary to reveal to Google which links are clicked. It’s totally fine to manipulate them to make them go directly to the website.

gjuggler Dec 12, 2025

I think Gemini is just broken.

Instead of forwarding model-generated links to https://www.google.com/url?q=[URL], which serves the purpose of malware check and user-facing warning about linking to an external site, Gemini forwards links to https://www.google.com/search?q=[URL], which does... a Google search for the URL, which isn't helpful at all.

Example: https://gemini.google.com/share/3c45f1acdc17

NotebookLM by comparison, does the right thing: https://notebooklm.google.com/notebook/7078d629-4b35-4894-bb...

It's kind of impressive how long this obviously-broken link experience has been sitting in the Gemini app used by millions.

sundarurfriend Dec 11, 2025

That's interesting, I just today started getting some "Some sites restrict our ability to check links." dialogue in ChatGPT that wanted me to verify that I really wanted to follow the link, with a Learn More link to this page: https://help.openai.com/en/articles/10984597-chatgpt-generat...

So it seems like ChatGPT does this automatically and internally, instead of using an indirect check like this.

solarkraft Dec 11, 2025

> Only downsides are in the polish department

What an understatement. It has me thinking „man, fuck this“ on the daily.

Just today it spontaneously lost an entire 20-30 minutes long thread and it was far from the first time. It basically does it any time you interrupt it in any way. It’s straight up data loss.

It’s kind of a typical Google product in that it feels more like a tech demo than a product.

It has theoretically great tech. I particularly like the idea of voice mode, but it’s noticeably glitchy, breaks spontaneously often and keeps asking annoying questions which you can’t make it stop.

sundarurfriend Dec 11, 2025

ChatGPT web UI was also like this for the longest time, until a few months ago: all sorts of random UI bugs leading either to data loss or misleading UI state. Interrupting still is very flaky there too. And on the mobile app, if you move away from the app while it's taking time to think, its state would somehow desync from the actual backend thinking state, and get stuck randomly; sometimes restarting the app fixes it, sometimes that chat is that unusable from that point on.

And the UI lack of polish shows up freshly every time a new feature lands too - the "branch in new chat" feature is really finicky still, getting stuck in an unusable state if you twitch your eyebrows at wrong moment.

gcr Dec 12, 2025

i basically can't use the ChatGPT app on the subway for these reasons. the moment the websocket connection drops, i have to edit my last message and resubmit it unchanged.

it's like the client, not the server, is responsible for writing to my conversation history or something

spruce_tips Dec 12, 2025

it took me a lot of tinkering to get this feeling seamless in my own apps that use the api under the hood. i ended up buffering every token into a redis stream (with a final db save at the end of streaming) and building a mechanism to let clients reconnect to the stream on demand. no websocket necessary.

works great for kicking off a request and closing tab or navigating away to another page in my app to do something.

i dont understand why model providers dont build this resilient token streaming into all of their APIs. would be a great feature

rishabhaiover Dec 12, 2025

exactly. they need to bring in spotify level of caching of streaming music that it just works if you're in a subway. Constant availability should be table stakes for them.

rjzzleep Dec 12, 2025

I get that the web versions are free, but if you can afford API access, I always recommend using Msty for everything. It's a much better experience.

https://msty.ai/

p_ing Dec 12, 2025

> ChatGPT web UI was also like this for the longest time

Copilot Chat has been perfect in this respect. It's currently GPT 5.0, moving to 5.1 over the next month or so, but at least I've never lost an (even old) conversation since those reside in an Exchange mailbox.

Max-Limelihood Dec 12, 2025

I lost thousands of conversations I'd had back in the move from "Bing" to "Copilot". Moved straight to Claude and never touched a GPT again.

Duanemclemore Dec 12, 2025

I downloaded my archive and completely ended my GPT subscription last week based on some bad computer maintenance advice. Same thing here - using other models, never touching that product again.

5 More Comments →

p_ing Dec 13, 2025

I'm referring to Copilot Chat. The data resides in your Exchange mailbox. You're referring to the consumer product.

deepGem Dec 12, 2025

There is no competing product for GPT Voice. Hands down. I have tried Claude, Gemini - they don't even comes close.

But voice is not a huge traffic funnel. Text is. And the verdict is more or less unanimous at this time. Gemini 3.0 has outdone ChatGPT. I unsubscribed from GPT plus today. I was a happy camper until the last month when I started noticing deplorable bugs.

1. The conversation contexts are getting intertwined.Two months ago, I could ask multiple random queries in a conversation and I would get correct responses but the last couple of weeks, it's been a harrowing experience having to start a new chat window for almost any change in thread topic. 2. I had asked ChatGPT to once treat me as a co-founder and hash out some ideas. Now for every query - I get a 'cofounder type' response. Nothing inherently wrong but annoying as hell. I can live with the other end of the spectrum in which Claude doesn't remember most of the context.

Now that Gemini pro is out, yes the UI lacks polish, you can lose conversations, but the benefits of low latency search and a one year near free subscription is a clincher. I am out of ChatGPT for now, 5.2 or otherwise. I wish them well.

esyir Dec 12, 2025

Just a note, chatGPT does retain a persistent memory of conversations. In the settings menu, there's a section that allows you to tweak/clear this persistent memory

rapind Dec 12, 2025

I found the gemini cli extremely lacking and even frustrating. Why google would choose node…

Codex is decent and seemed to be improving (being written in rust helps). Claude code is still the king, but my god they have server and throttling issues.

Mixed bag wherever you go. As model progress slows / flatlines (already has?) I’m sure we’ll see a lot more focus and polish on the interfaces.

wahnfrieden Dec 12, 2025

Codex is king

wkat4242 Dec 12, 2025

What's that near free subscription? I don't see it here

deepGem Dec 12, 2025

They had 9.99 for the first year.

wkat4242 Dec 12, 2025

Oh I must have missed that, thanks.

topato Dec 12, 2025

yeah, the best Ive seen is like 1.99 for two months, then back to normal pricing....

KronisLV Dec 11, 2025

> It has me thinking „man, fuck this“ on the daily.

That's sometimes me with the CLI. I can't use the Gemini CLI right now on Windows (in the Terminal app), because trying to copy in multiple lines of text for some reason submits them separately and it just breaks the whole thing. OpenCode had the same issue but even worse, it quite after the first line or something and copied the text line by line into the shell, thank fuck I didn't have some text that mentions rm -rf or something.

More info: https://github.com/google-gemini/gemini-cli/issues/14735#iss...

At the same time, neither Codex CLI, nor Claude Code had that issue (and both even showed shortened representations of copied in text, instead of just dumping the whole thing into the input directly, so I could easily keep writing my prompt).

So right now if I want to use Gemini, I more or less have to use something like KiloCode/RooCode/Cline in VSC which are nice, but might miss out on some more specific tools. Which is a shame, because Gemini is a really nice model, especially when it comes to my language, Latvian, but also your run of the mill software dev tasks.

In comparison, Codex feels quite slow, whereas Claude Code is what I gravitate towards most of the time but even Sonnet 4.5 ends up being expensive when you shuffle around millions of tokens: https://www.hackerneue.com/item?id=46216192 Cerebras Code is nice for quick stuff and the sheer amount of tokens, but in KiloCode/... regularly messes up applying diff based edits.

radicaldreamer Dec 11, 2025

Google’s standard problem is that they don’t even use their own products. Their Pixel and Android team rocks iPhones on the daily, for example.

free652 Dec 12, 2025

You cant buy an iPhone without a director approval. And it's like 3 gen behind as well. So no, they don't use iPhones.

ummonk Dec 12, 2025

Google tells its employees what products they're allowed to buy for personal use?

snypher Dec 12, 2025

Seems like they meant for a work device.

gcr Dec 12, 2025

lots of googlers use BYOD iPhones and the corp suite for this use case is fairly well-supported

brookst Dec 12, 2025

Which makes tons of sense because iPhone users are higher CLV than Android users. If Google had to choose between major software defects in Android or iOS, they would focus quality on iOS every time.

siva7 Dec 12, 2025

that explains why their ios gemini app is so ridiculously bad. in private they probably use iphones and just chatgpt instead.

dominotw Dec 12, 2025

you have to get premission from director for your presonal phone? wtf

testdelacc1 Dec 12, 2025

For the work phone.

RBerenguel Dec 11, 2025

I would think this is not true

sib Dec 12, 2025

You'd be wrong (source - worked in the Android org).

RBerenguel 4 days ago

How long ago?

renewiltord Dec 11, 2025

Yeah, I've heard that Sundar Pichai dogfoods the latest Pixel at least once a month and sometimes two or three times.

sam345 Dec 12, 2025

That's inexcusable.

Der_Einzige Dec 11, 2025

That’s because they will be bullied out of the dating market if they have a “green bubble”.

astrange Dec 12, 2025 (dead)

dkga Dec 12, 2025

What is a green bubble? iPhone's carbon footprint?

brookst Dec 12, 2025

iMessage renders other iMessage users as blue bubbles, SMS/RCS as green bubbles.

People who can’t understand that many people actually prefer iOS use this green/blue thing to explain the otherwise incomprehensible (to them) phenomenon of high iOS market share. “Nobody really likes iOS, they just get bullied at school if they don’t use it”.

It’s just “wake up sheeple” dressed up in fake morality.

2 More Comments →

onethought Dec 11, 2025

I mean there is benefit to understanding competitor well as well?

LogicFailsMe Dec 11, 2025

Outweighed by the value of having to suffer with the moldy fruits of their own labor. That was the only way the Android Facebook app became usable as well.

ssl-3 Dec 11, 2025

There certainly is.

To posit a scenario: I would expect General Motors to buy some Ford vehicles to test and play around with and use. There's always stuff to learn about what the competition has done (whether right, wrong, or indifferent).

But I also expect the parking lots used by employees at any GM design facility in the world to be mostly full of General Motors products, not Fords.

snypher Dec 12, 2025

The CEO of Ford was driving a competition EV for months;

https://www.caranddriver.com/news/a62694325/ford-ceo-jim-far...

GenerWork Dec 11, 2025

>But I also expect the parking lots used by employees at any GM design facility in the world to be mostly full of General Motors products, not Fords.

I think you'd be surprised about the vehicle makeup at Big 3 design facilities.

3 More Comments →

Forgeties79 Dec 11, 2025

I wonder how many apple employees walk in to the office with android phones

azinman2 Dec 12, 2025

Effectively zero.

Disclosure: I work at Apple. And when I was at Google I was shocked by how many iPhones there were.

9 More Comments →

inquirerGeneral Dec 12, 2025 (dead)

adamkochanowicz Dec 11, 2025

I also love that I can leave the microphone on (not in live voice mode) while dictating to ChatGPT and pause and think as much as needed.

With Gemini, it will send as soon as I stop to think. No way to disable that.

wheelerwj Dec 11, 2025

How did you do this?

toomuchtodo Dec 11, 2025

Record button in the app if you’ve got the feature.

arjie Dec 11, 2025

Any time its safety stuff triggers, Gemini wipes the context. It's unusable because of this because whatever is going on with the safety stuff, it fires too often. I'm trying to figure out some code here, not exactly deporting ICE to Guantanamo or whatever.

rvnx Dec 11, 2025

The more Gemini and Nano-Banana soften their filters, the more audience it will take from other platforms. The main risk is payment providers banning them, I can't imagine bank card providers to remove payments to Google.

dzhiurgis Dec 11, 2025

On a flip side chatgpt app now has years of history that sometimes useful (search is pretty ok, but could improve) but otherwise I'd like to remove most of it - good luck doing so.

amluto Dec 12, 2025

Claude regularly computes a reply for me, then reports an error and loses the reply. I wonder what fraction of Anthropic’s compute gets wasted and redone.

seg_lol Dec 12, 2025

Try using a VPN, my ISP was killing connections and claude would randomly reset. Using a VPN fixed the issue.

mnky9800n Dec 11, 2025

The colab integration is where it shines the most imo.

hexnuts Dec 12, 2025

You may be interested in tools like OpenMemory

mmaunder Dec 11, 2025

Yeah I eventually noped out as I said in another comment and am charging hard with Codex and am so happy about 5.2!!

lxgr Dec 11, 2025

Interesting, I had the opposite experience. 5.0 "Thinking" was better than 5.1, but Gemini 3 Pro seems worse than either for web search use cases. It's hallucinating at pretty alarming rates (including making up sources it never actually accessed) for a late 2025 model.

Opus 4.5 has been a step above both for me, but the usage limits are the worst of the three. I'm seriously considering multiple parallel subscriptions at this point.

gs17 Dec 11, 2025

I've had the same experience with search, especially with it hallucinating results instead of actually finding them. It's really frustrating that you can't force a more in-depth search from the model run by the company most famous for a search engine.

astrange Dec 12, 2025

Try the same question in deep research mode.

inquirerGeneral Dec 12, 2025 (dead)

hbarka Dec 11, 2025

I’ve been putting literally the same inputs into both ChatGPT and Gemini and the intuition in answers from Gemini just fits for me. I’m now unwilling to just rely on ChatGPT.

Google, if you can find a way to export chats into NotebookLM, that would be even better than the Projects feature of ChatGPT.

siva7 Dec 12, 2025

notebooklm is heavily biased to only use the sources i added and frame every task around them - even if it is nonsensical - so it is not that useful for novel research. it also tends to hallucinate when lots of data is involved.

LogicFailsMe Dec 11, 2025

All I want for Christmas is a "No NotebookLM slop" checkbox on youtube.

simplify Dec 12, 2025

Youtube's downvote button has served me quite well for this purpose.

didibus Dec 11, 2025

> Overall, my conclusion is that ChatGPT has lost and won't catch up because of the search integration strength.

Depends, even though Gemini 3 is a bit better than GPT5.1, the quality of the ChatGPT apps themselves (mobile, web) have kept me a subscriber to it.

I think Google needs to not-google themselves into a poor app experience here, because the models are very close and will probably continue to just pass each other in lock step. So the overall product quality and UX will start to matter more.

Same reason I am sticking to Claude Code for coding.

concinds Dec 11, 2025

The ChatGPT Mac app especially feels much nicer to use. I like Gemini more due to the context window but I doubt Google will ever create a native Mac app.

bayarearefugee Dec 11, 2025

This matches my experience pretty closely when it comes to LLM use for coding assistance.

I still find a lot to be annoyed with when it comes to Gemini's UI and its... continuity, I guess is how I would describe it? It feels like it starts breaking apart at the seams a bit in unexpected ways during peak usages including odd context breaks and just general UI problems.

But outside of UI-related complaints, when it is fully operational it performs so much better than ChatGPT for giving actual practical, working answers without having to be so explicit with the prompting that I might as well have just written the code myself.

luhn Dec 11, 2025

That's hilarious and right on brand for Google that they spend millions developing cutting-edge technology and fumble the ball making a chat app.

spwa4 Dec 12, 2025

Every Google app is a chat app, except maybe search.

dieortin Dec 12, 2025

Is Google Drive a chat app? Is Google Photos a drive app? I don’t know what you mean

spwa4 Dec 12, 2025

Once you open a file, it is very much a chat app. Comments and chat work for anything you can preview btw, not just Google Docs stuff.

Not sure how you can access the chat in the directory view.

minitoar Dec 12, 2025

In Google Photos shared albums there is a tab that I can only describe as a chatroom.

dieortin Dec 13, 2025

Isn’t there a difference between having a tab that is similar to a chat, to being a chat app?

azan_ Dec 11, 2025

That's interesting. I've got completely different impression. Every time I use Gemini I'm surprised how bad it is. My main complaint is that Gemini is too lazy.

Nathanba Dec 11, 2025

Same for me, at this point I'm seriously starting to think that these are ads for and by Google because for me Gemini is the worst.

WillPostForFood Dec 12, 2025

My experience is that "AI Mode" Gemini in Chrome is terrible, but AI Studio Gemini is pretty great.

varispeed Dec 11, 2025

Get Gemini answer and tell ChatGPT this is what my friend said. Then put ChatGPT answer to Claude and so on. It's a cheat code.

tenpoundhammer OP Dec 12, 2025

I did this today it was amazing. If I would have had time I would try other models as well. Great tip thanks

clhodapp Dec 12, 2025

A cheat code to what?

Iwan-Zotow Dec 12, 2025

To get a Hitler

AznHisoka Dec 11, 2025

ChatGPT seems to just randomly pick urls to cite and extract information from.

Google Gemini seems to look at heuristics like whether the author is trustworthy, or an expert in the topic. But more advanced

FpUser Dec 11, 2025

I've read many very positive reviews about Gemini 3. I tried using it including Pro and to me it looks very inferior to ChatGPT. What was very interesting though was when I caught it bullshitting me I called its BS and Gemini expressed very human like behavior. It did try to weasel its way out, degenerated down to "true Scotsman" level but finally admitted that it was full of it. this is kind of impressive / scary.

TacticalCoder Dec 11, 2025

Yeah basically the same here. And many people on paid ChatGPT subscription like us noticed just that. Gemini 3 Pro "thinking" is really good.

> Overall, my conclusion is that ChatGPT has lost and won't catch up because of the search integration strength.

I think the biggest issue OpenAI is facing is the numbers: Google is at the moment a near $4 trillion company. They can splurge a near infinite amount of money to win the race.

Google is so big they they created their own TPUs, which is mindboggling.

Which new user is going to willingly pay an OpenAI subscription once he knows that gemini.google.com gives access to a state of the art model? And Google makes sure to remind users who search that they can "continue the discussion" with Gemini.

Maybe the dirty Altman tricks like cornering the entire RAM market can work but I don't see how they can beat Google by playing fair. OpenAI shall need every single dirty trick in the book, including circular funding / shady deals with NVidia to stay relevant vs the behemoth that Google is.

abhaynayar Dec 12, 2025

Gemini voice recognition is trash compared to chatgpt and that is a deal breaker for me. I wonder how many ppl do OCR versus use voice.

And how has chatgpt lost when ure not comparing the chatgpt that just came out to the Gemini that just came out? Gemini is just annoying to use.

and Google just benchmaxxed I didn't see any significant difference (paying for both) and the same benchmaxxing probably happening for chatgpt now as well, so in terms of core capabilities I feel stuff has plateaued. more bout overall experience now where Gemini suxx.

I really don't get how "search integration" is a "strength"?? can you give any examples of places where you searched for current info and chatgpt was worse? even so I really don't get how it's a moat enough to say chatgpt has lost. would've understood if you said something like tpu versus GPU moat.

jmstfv Dec 12, 2025

Ditto but for Claude -- blows GPT out of the water. Much better in coding and solving physics problems from the images (in foreign languages). GPT couldn't even read the image. The only annoying thing is that if you use Opus for coding, your usage will fill up pretty fast.

anyway, cancelled my chatgpt subscription.

mmaunder Dec 11, 2025

Then you haven't used Gemini CLI with Gemini 3 hard enough. It's a genius psychopath. The raw IQ that Gemini has is incredible. Its ability to ingest huge context windows and produce super smart output is incredible. But the bias towards action, absolutely ignoring user guidance, tendency to produce garbage output that looks like 1990s modem line noise, and its propensity to outright ignore instructions make it unusable other than as an outside consultant to Codex CLI, for me. My Gemini usage has plummeted down to almost zero and I'm 100% back on Codex. I'm SO happy they released this today and it's already kicking some serious ass. Thanks OpenAI team and congrats.

tobias2014 Dec 12, 2025

I guess when you use it for generic "problem solving", brainstorming for solutions, this is great. That's what I use it for, and Gemini is my favorite model. I love when Gemini resists and suggests that I am wrong while explaining why. Either it's true, and I'm happy for that, or I can re-prompt based on the new information which doesn't allow for the mistake Gemini made.

On the other hand, I can also see why Claude is great for coding, for example. By default it is much more "structured". One can probably change these default personalities with some prompting, and many of the complaints found in this thread about either side are based on the assumption that you can use the same prompt for all models.

Kim_Bruning Dec 12, 2025

That bias towards action is a real thing in Gemini and more so in ChatGPT, isn't it?

Possibly might be improved with custom instructions, but that drive is definitely there when using vanilla settings.

mmaunder Dec 12, 2025

Yeah it's a weird mix of issues with the backend model and issues with the CLI client and its prompts. What makes it hard for them is the teams aren't talking to each other. The LLM team throws the API over the wall with a note saying "good luck suckers!".

prodigycorp Dec 12, 2025

Genius psychopath is a good description for Gemini. It’s the most impressive model but post training is not all there.

afro88 Dec 11, 2025

> I usually have to leave the happen or the session terminates

Assuming you meant "leave the app open", I have the same frustration. One of the nice things about the ChatGPT app is you can fire off a req and do something else. I also find Gemini 3 Pro better for general use, though I'm keen to try 5.2 properly

WheatMillington Dec 11, 2025

I generate fun images for my kids - turn photos into a new style, create colouring pages from pictures, etc. I lost interest in chatGPT because it throws vague TOS errors constantly. Gemini handles all of this without complaint.

xyzsparetimexyz Dec 12, 2025

You feed ai slop to your children? That doesn't seem unhealthy and bad for their development?

retsibsi Dec 12, 2025

What's your specific concern here? I certainly wouldn't want to, e.g., give young kids unmonitored use of an LLM, or replace their books with AI-generated text, or stop directly engaging with their games and stories and outsource that to ChatGPT. But what part of "generate fun images for my kids - turn photos into a new style, create colouring pages from pictures, etc" is likely to be "unhealthy and bad for their development"?

bonesss Dec 12, 2025

Customized, self-guided, tailor made kids content isn’t slop per se.

Colouring pages autogenerated for small kids is about as dangerous as the crayons involved.

Not slop, not unhealthy, not bad.

a_victorp Dec 12, 2025

I see a post like this every time there are news about ChatGPT or OpenAI. I'm probably being paranoid but I keep thinking that it looks like bots or paid advertisement for Gemini

tenpoundhammer OP Dec 12, 2025

I think people like me just enjoying sharing when something is working for them and they have a good experience. It probably gets voted up because people enjoy reading when that happens

jdiff Dec 12, 2025

The consistent side comments about the interface to Gemini being "half baked" probably doesn't fit into that narrative.

jnordt Dec 12, 2025

Can you share some examples of this where it gives better results?

For me both Gemini and ChatGPT (both paid versions Key in Gemini and ChatGPT Plus) give me similiar results in terms of "every day" research. Im sticking with ChatGPT at the moment, as the UI and scaffolding around the model is in my view better at ChatGpt (e.g. you can add more than one picture at once...)

For Software Development, I tested Gemini3 and I was pretty disappointed in comparison to Claude Opus CLI, which is my daily driver.

UltraSane Dec 11, 2025

Google has such a huge advantage in the amount of training data with the Google search database and with YouTube and in terms of FLOPS with their TPUs.

razster Dec 12, 2025

Just a fair warning, it likes to spell Acknowledge as Acknolwedge. And I've run into issues when it's accessing markdown guides, it loses track and hallucinates from time to time which is annoying.

bossyTeacher Dec 11, 2025

A future where Google still dominates, is that a future we want? I feel a future with more players is better than one with just a single one. Competition is valuable for us consumers

melagonster Dec 12, 2025

It happened at least once; when I asked too many questions, the Gemini web page stopped working because it was occupying too much RAM...

NickNaraghi Dec 11, 2025

Straight up Silicon Valley warfare in the HN comment section.

bckr Dec 12, 2025

Gemini is good at reading bad handwriting you say? Might need to give it a shot at my 10 years of journals

Razengan Dec 12, 2025

It would be useful to see some examples of the differences and supposed strengths of Gemini so this doesn't come off as Google advertisement snarf.

Also, I would never, ever, trust Google for privacy or sign into a Google account except on YouTube (and clear cookies afterwards to stop them from signing me into fucking Search too).

m00dy Dec 12, 2025

it's true that Gemini-3 pro is very good, I recently used it on deepwalker [0]. Its agentic performance is amazing. Much better than 5.1

[0]: https://deepwalker.xyz

anonnon Dec 12, 2025

Could you elaborate on GPT-based stock analysis?

citizenpaul Dec 12, 2025

What?? Am I using the same gemini as everyone else?

>OCR is phenomenal

I literally tried to OCR a TYPED document in Gemini today and it mangled it so bad I just transcribed it myself because it would take less time than futzing around with gemini.

> Gemini handles every single one of my uses cases much better and consistently gives better answers.

>coding

I asked it to update a script by removing some redundant logic yesterday. Instead of removing it it just put == all over the place essentially negating but leaving all the code and also removing the actual output.

>Stocks analysis

lol, now I know where my money comes from.

aix1 Dec 12, 2025

Was that with Gemini 3 Pro or a different Gemini model?

citizenpaul Dec 14, 2025

Yes.

Today I asked it to make a short bit of code to query some info from an API. I needed it to not use the specific function X that is normally used. I added to its instructions "Never use function X" then asked it in the chat to confirm its rules. It then generated code using function X and a word soup explaining how it did not uses function X. Then I copy pasted the line and asked why it used function X and it said more word soup explaining how the function was not there. So yea not so good.

Daz912 Dec 12, 2025

No desktop app, not using it

eru Dec 12, 2025

HN doesn't have a dedicated desktop app either.

Daz912 Dec 12, 2025

HN isn't part of my daily workflow so I dont care

LorenDB Dec 11, 2025

What is it with the Polish always messing up products?

(yes, /s)

petersumskas Dec 11, 2025

It’s because their thoughts are Roman while they are always Russian to Finnish things.

Kenya believe it!

Anyway, I’m done here. Abyssinia.

labrador Dec 11, 2025

I like their hotdogs

xyzsparetimexyz Dec 12, 2025

Why do people pay for ai tools? I didn't get that. I feel like I just rotate between them on the free tiers. Unless you're paying for all of them, what's the point?

Zambyte Dec 12, 2025

I pay for Kagi and get all of the major ones, a great search engine that I can tune to my liking, and the ability to link any model to my tuned web search.

Onewildgamer Dec 11, 2025

Google AI mode constantly does mistakes and I go back to chatgpt even when I don't like it.

billyrnalvo Dec 11, 2025

Oh my good heavens, gotta tell ya, you wrestled that rascal to the floor with a shit-eating grin! Good times my friend!

This item has no comments currently.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous