- I had Gemini ingest our huge aws cloudformation repo . I had it describe each infrastructure component and how it related to others and creation hierarchy and IAM.
I got a nice and comprehensive infrastructure requirement document out of this.
Now I am using it to create Terraform repo , deploying it via OpenTofu and comparing it to my existing AWS cloud formation . This part is still a WIP .
- Yes the cost of building software dropped by 90%.
However, the cost of software maintenance went up by 1000% . Lets hope you don't need to ever add a new business rule or user interface to your vibe coded software.
- I am curious : could GenAI have written the paper "Attention is all you need"? We were trapped in CNN RNN architectures for a while : could genAi have arrived at a better architecture ?
- BS. I grew up in Delhi. We used to have a large open space where I and neighborhood kids used to play cricket . Eventually the whole area converted to slums with people from Bangladesh. They took over the whole area. I was too young to care about ethnicity but the loss of my cricket field still bothers me. My neighbor was a bank manager and he once said that the government politicians forced him to give "loans" to Bangladeshi people , with no documents and only their thumbprint,before elections to those people to ensure victory of the ruling party.
- When I ask for Patterns, I am seeking help for recurring problems that I have encountered. Context management .. small llms ( ones with small context size) break and get confused and forget work they have done or the original goal.
- How.. please don't say use langxxx library
I am looking for a language or library agnostic pattern like we have MVC etc. for web applications. Or Gang of Four patterns but for building agents.
- Fine .. say your country has a several years of drought and bad harvest. What happens then ? Do you trade then ?
Or .. lets say due to weather, your farmers can not grow enough oranges or some fruit which drives up local prices. Should only the richest people in your country get to eat fruits ?
Or you discover lithium deposits that your national industry can not use . Should you let that just sit there knowing it could make your province prosperous if traded.
- I am here to hear from folks running LLM on Framework desktop (128GB). Is it usable for agentic coding ?
- >> Have you measured your latency, and how sensitive are you to it?
Not sensitive to latency at all. My users would rather have well researched answers than poor answers.
Also, I use batch mode APIs for chunking .. it is so much cheaper.
- Thanks for sharing. TIL about rerankers.
Chunking strategy is a big issue. I found acceptable results by shoving large texts to to gemini flash and have it summarize and extract chunks instead of whatever text splitter I tried. I use the method published by Anthropic https://www.anthropic.com/engineering/contextual-retrieval i.e. include full summary along with chunks for each embedding.
I also created a tool to enable the LLM to do vector search on its own .
I do not use Langchain or python.. I use Clojure+ LLMs' REST APIs.
- You mean multi-cloud strategy ! You wanna know how you got here ?
See the sales team from Google flew out an executive to NBA Finals, Azure Sales team flew out another executive to NFL superBowl and the AWS team flew out yet another executive to Wimbledon finals. And thats how you end up with multi-cloud strategy.
- >> The CCP literally revoked the visas of key DeepSeek engineers. That's all we need to know.
I don't follow. Why would DeepSeek engineers need visa from CCP?
- I have a branch office in boondocks with limited internet connection. The branch office cannot manage a RDBMS or access cloud services. They can use sqlite app on LAN and we could do reconciliation at end of the business day.
- https://github.com/microsoft/graphrag
This is not agentic but pretty good results when I did a poc.
- Most developers can't do much work without an IDE and Chrome + Google.
Would you say that their work has no value?
- This is exactly my experience. We wanted to modernize a java codebase by removing java JNDI global variables. This is a simple though tedious task. And we tried Claude Code and Gemini. Both of these results were hilarious.
- I would also love to see how the model performs on management type presentations. I am referring to Gartner and McKinsey and BCG etc. presentations.
- Not always.
You could paste your entire repo into Gemini and it could map your forest and also identify the "trees".
Assuming your codebase is smaller than Gemini context window. Sometimes it makes sense to upload a package,s code into Gemini and have it summarize and identify key ideas and function. Then repeat this for every package in the repository.then combine the results . It sounds tedious but it is a rather small python program that does this for me.
- This price drop is nice but I wonder how long it will last. Their prices used to be very low,then they almost doubled, and now it dropped.
- >>After all, after nazi-salute, mecha-hitler, etc blew up, by just how much did the demand for Teslas fall?
I had been saving up for a Tesla but now I am looking elsewhere. I think a lot of people are doing the same here in Canada. You can grok the actual numbers if you want.
- Not the above poster. I would like to run agents with local multimodal LLMs to process huge numbers of sensitive files for an org( summarization, knowledge extraction, answer user questions ,etc). Any ideas?
- Who is paying for those tokens? I had a long conversation with "Devin" and I must have burnt up a large number of tokens. In any case, thank you "Devin"
- Gemini and chatgpt and github copilot subscriptions also provide similar functionality.
- The next cheapest and capable model is GLM 4.5 at $0.6 per million tokens in and $2.2 per million tokens out. Glad to see DeepSeek is still be the value king.
But I am sti disappointed with the price increase.
- I am not interested in the results of the 95%.
I want to know more about the 5% who got it right. What are their use cases ?
- Does anyone else miss o3?
I swear I had an understanding of how to get deep analytical thinking out of o3. I am absolutely struggling to get the same results with GPT-5. The new model feels different and frustrating to use.
- My apologies
This is a better repo to learn about Claude code internals
https://github.com/ghuntley/claude-code-source-code-deobfusc...
- I have been following along the code in this repo. https://github.com/ghuntley/claude-code-source-code-deobfusc...
The author has done a pretty good job of reverse engineering Claude Code and explaining the architecture.
update: changed the link to a better repo
My last use case was like this : I had a old codebase code that was using bakbone.js for ui with jquery and a bunch of old js with little documentation to generat UI for a clojure web application.
Gemini was able to unravel this hairball of code and guiding me step by step to htmx. I am not using AI studio; I am using Gemini subscription.
Since I manually patch the code, its like pair programming with an incredibly patient and smart programmer.
For the record, I am too old for vibe coding .. I like to maintain total control over my code and all the abstractions and logic.