- I worked in finance on the other side of the pond - developers wanted to constantly bring in and use new services but also didn't want any of the responsibility or the work needed to make compliance happy (or even in that particular company shoulder the costs). When me and other folks where brought in it to fix the "cloud strategy" it was a complete shitshow and heads actually rolled when we wrote a tool to assign costs to applications. But we had to start almost from scratch and limit usable services as we developed strategies and blueprints for each...
The complete, unapologetic desire of devs and security teams (but also many infra teams) to not have any kind of ownership was horrifying to me.
In the end there's not a single solution or strategy, it really goes back to the organization and where your weaknesses and strength are as an org. If you have a gazillion consultants following the "best practice" of the day and exceptions on top of exceptions you are dead, devops or otherwise. You will still make billions if you are the right company though regardless of your software practices, so...
- They should be funded by the companies using them. Do you believe any of the fortune top100 would be greatly impacted by funding libxml2? They probably all rely on it, one way or the other.
The foundation of the internet is something that gets bigger and bigger every year. I understand the sentiment and the reasoning of declaring software a "public good", but it won't scale.
- Isn't this intuitively true?
Building a nuclear power plant incurs in a massive set up stage with a lot of unknowns unknowns and requiring impressive material engineering and QC.
Solar is much more "incremental", you can almost start producing electricity and recouping costs immediately.
But a nuclear reactor is an extremely dense power generator compared to a solar panel plant by orders of magnitude. I'm not really sure why are they compared this way.
- Suddenly? That's the level of quality that is standard in all software projects I've ever seen since I've started working in IT.
Enshittification is all around us and is unstoppable. Because we have deadlines to hit and goals to shows we reached to the VP. We broke everything and the software is just half working? Come on that's an issue for the support and ops teams. On to the next beautiful feature we can put on marketing slides!
- I have plenty of hard disagreements on the "user experience improvements" in Linux. "Adding a skin" is not easy and making the experience somewhat coherent is extremely hard (GNOME is sort of successful at an extreme cost and plenty of limitations, KDE is still an incoherent mess with plenty of bad defaults starting from the base CDDM skin). It's full of things like the missing icon view in the GNOME/GTK file chooser [1] and while it's true that Windows11 is atrocious, all those little things add up.
I actually recovered a laptop my family was using to launch firefox by installing linux on it (soldered ram went bad, linux is the only OS I could use to tell it to skip the bad blocks through kernel command line) but I hold no illusion about its level of "user experience". Just look at the comments in this recent thread [2]. And as a power user I am baffled by some of the choices at the kernel level (which I mentioned in that thread) and others closer to the user by distros (ubuntu and snaps, name an iconic duo), or things like flatpak not being close to ready and still shoved down user's throats...
I spent years when I was younger submitting bug reports for the papercuts I noticed - some ignored for years, some closed and forgotten forever when some project decided to move on from bugzilla - and I have no more time or energy to continue doing so. The maintainers after all write the code, I'm just a user and get no voice :)
I've been reading about the "year of linux" for years now, it's a meme for a reason. People that are not "prosumer" will keep using the preinstalled OS even if it's garbage - assuming they buy a laptop or desktop at all - and the prosumer will probably keep an OSX or a Windows machine close by anyway. Linux is usable as a browser kiosk sure but there is still plenty of friction on everything else. Enshittification will continue, and possibly infect also linux.
- The last rewrite I've seen completed (which was justified to a point as the previous system had some massive issues) took 3 years and burned down practically an entire org (multiple people left, some were managed out including two leads, the director was ejected after 18ish months) which was healthy-ish and productive before the rewrite. It's still causing operational pain and does not fully cover all edge cases.
I'm seeing another now in $current_job and I'm seeing similar symptoms (though the system being rewritten is far less important) and customers of the old system essentially abandoned to themselves and marketing and sales are scrambling to try to retain them.
Anecdotal experience is not so good. Rewriting a tiny component? Ok. Full on rewrite of a big system? I feel it's a bad idea and the wisdom holds true.
- This is what home automation was supposed to be. You shouldn't be looking at it, it should just help you silently and reduce the amount of things you have to care about.
Turning lights off, closing shutters when it goes dark, handling temperature and CO2 concentrations, etc.
I feel people have a need to look at dashboards, have screens, etc (maybe it's some sort of sympathetic reaction about looking at dashboards all day at work?) instead of letting go. Dashboards should be looked at if something is wrong and automation is failing.
- As a non-blind user, the title expresses my feelings too. And I feel like it's getting worse over time, not better.
From little things to kernel lockdown breaking hibernate on a fully encrypted system just because you should be happy to get your laptop battery killed by s2idle or disable secure boot. Yay, security.
I can only imagine the pain of all the accessibility issues on top of what I experience.
- IIRC eBPF and DTrace are (no longer) solving a similar problem, eBPF has become far bigger than just tracing, it's now a way to have user space code "driving" kernel decisions. I'm not sure they can be compared this way - and even if we do, the user base of DTrace is infinitesimally smaller of the one of eBPF.
- > Every minute you spend doing on-call work is time that you can't spend on the things you've actually been assigned to do.
In my experience at least if you're oncall during a sprint you would have less work assigned to you than otherwise (2 week sprint and 1 week you are oncall? 50% allocation) as the expectation is that week you will spend responding to alerts, or investigating issues, or even improving alerting and dashboard and fixing bugs. If this does not happen, devs don't push for it and management is completely blind to it you have an organization issue. If leadership does not care about the problem it's time to jump ship ASAP.
But I've seen people stubbornly defending an alert on >60% CPU usage of their 1 CPU allocated kubernetes pods where there was no impact in p99.9 latency (which was measured and was the actual metric that mattered as agreed with the rest of the business and internal customers of the service). Or alerting on each single pod restart. That is self inflicted pain.
- I guess what you are saying is the problem is the company culture - from a technical operations point of view at least - sucks. An no one wants or can put the effort into fixing it.
I see normally in oncall threads people complaining about "I got paged by an alerts because of another system X" - but in at least in a big enough organization this should not happen and it's an organizational failure. There should be an operations center on 24h/24h able to triage, escalate and evaluate, possibly not staffed only with L1 techs and given enough freedom to actually improve and automate. I know there are places where that is not true, and I ran away screaming from some in my career once I understood tech leadership had no understanding why it was needed.
But you would be surprised how much of the oncall pain is actually self inflicted by application teams themselves (some examples I encountered in the last year: TCP connect timeouts in the minutes and with no retries, no retry policies in general and things that should be idempotent that are not, no circuit breaker strategies, connection pools churning as they're shared between 10+ remote endpoints, wrong expectations about transaction isolation levels and how to handle conflicts at least in simple scenarios).
- You can somewhat reliably evaluate what works and what does not, what brings you forward and what is slack you can cut. It takes time (6-12 months at least) of very engaged work of a very engaged team - which you can bring with you.
You can go the hatchet way - I am strongly unconvinced it is indicative of anything resembling good management, mind - but most people and companies cannot rely on banks or investment firms loaning them 40 billion dollars and accepting passively a mark down of their mone~ to 1/4 of the value they loaned down the line. CEOs are ousted by investment firms for a far smaller drop in value all the time.
- > But it's not the case that EU nations and Canada sent their soldiers to die in Iraq
They did, not all of them but many did. On Canada I may be wrong, sure. I believe even Ukraine has KIAs in Iraq.
> France in particular sided with Russia to block the Security Council from authorizing military action, leading to substantial tensions with the US and widespread disapproval from the public on both sides. European demonstrations against the war remain one of the largest mass movements in history.
The Iraq and Afghanistan wars broke the model the US and EU had been trying to push until that moment, alienating the south of the world from it and providing certain countries with a justification for their future actions. France had the right of it in the UN assembly.
People were angry back then for similar reasons they're angry and shocked now, and once again it has to do with expectations.
I also don't believe the Iraq war alone is not really enough to deny the alignment between EU and US foreign policy in the last 30 years or so anyway. You won't have complete agreement with 30 nations involved ever.
> But the North Atlantic Treaty simply does not contain a promise to align foreign policy in this way.
I think this in your original comment highlights your surprise at what those people believe, or at least your not understanding it?
- There was effectively uniform foreign policy between the US and its allies for the last thirty years, even under the first Trump presidency, and this included at least a certain degree of interventionism (first Iraq war, Yugoslavia...) which solidified international institutions (differently from the second Iraq war and Afghanistan, which weakened them).
Even if they didn't agree, EU nations and Canada at least sent their soldiers to die in Iraq and Afghanistan anyway.
Why are you surprised people expect such policy alignment after thirty years of it?
Why are you surprised people consider this a betrayal of what NATO stood for in the past, as a proxy of the democracies of the west? Just because there is no violation of the letter of the treaty?
- The NATO treaty doesn't imply in his wording any obligation for a military reaction to an invasion of a member of NATO. There's no penalty to just respond with a strongly worded letter, but there's an expectation an ally will react militarly.
Will your allies trust you any longer if you just follow the letter of the treaty? I don't think they will. More critically, nor will anyone else.
The US have historically positioned themselves as "defenders of democracy" and have multiple times used that positioning actively. It's inevitable for an expectation to be there for them to do just that. The US is free to violate expectations and just follow the letter of the treaties it has, it is a sovereign nation after all, but the surprised and frankly childish "we have no obligation!" reaction to the blowback is more unreasonable than the expectations for its support of Ukraine, particularly in how it has been handled politically.
- > All wallets could become worthless in a weekend if a government makes the wrong stride in quantum research..
No need for the wrong stride in quantum research. A government can make it illegal and seize the exchanges and watch actual value go to a round 0 in seconds.
A nation's government has a vested interest in preserving the value and viability of its own currency, not Bitcoin's.
- When coding, you trust the CPU to do correct math. The kernel to properly allocate memory. The network stack to send packets and pass you only the ones with valid checksum.
Much of our world operates on trust (and eventually verify), same as any application does. It's exceedingly costly to do otherwise.
- We will never know, but I wonder if it could be a power/signaling or VRM issue - the CPU non getting hot doesn't mean something else on the board has gone out of spec and into catastrophic failure.
Motherboard issues around power/signaling are a pain to diagnose, they will emerge as all sort of problems apparently related to other components (ram failing to initialize and random restarts are very common in my experience) and you end up swapping everything before actually replacing the MB...
- In my last job we ran centralized clusters for all teams. They got X namespaces for their applications, and we made sure they could connect to the databases (handled by another team, though there were discussion of moving them onto dedicated clusters). We had basic configuration setup for them and offered "internal consultants" to help them onboard. We handled maintenance, upgrades and if needed migrations between clusters.
We did not have a cluster just for a single application (with some exceptions because those applications were incredibly massive in pod numbers) and/or had patterns that required custom handling and pre-emptive autoscaling (which we wrote code for!).
Why are so many companies running a cluster for each application? That's madness.
- You arent' forced to use service mesh and complex secrets management schemes. If you add them to the cluster is because you value what they offer you. It's the same thing as kubernetes itself - I'm not sure what people are complaining about, if you don't need what kubernetes offers, just don't use it.
Go back to good ol' corsync/pacemaker clusters with XML and custom scripts to migrate IPs and set up firewall rules (and if you have someone writing them for you, why don't you have people managing your k8s clusters?).
Or buy something from a cloud provider that "just works" and eventually go down in flames with their indian call centers doing their best but with limited access to engineering to understand why service X is misbehaving for you and trashing your customer's data. It's trade-offs all the way.
- The k/v store offers primitives to make that happen, but for non-critical controllers you don't want to deal with things like that they can go down and will be restarted (locally by kubelet/containerd) or rescheduled. Whatever resource they monitor will just not be touched until they get restarted.
- I am fairly simplifying, but you are expected to know your direct dependencies (and normally wil), pagers have embedded escalation rules with prinaries and secondaries, etc. The tooling once you know what to do is better than anything outside of FAANGs I've seen in terms of integration and reliability.
Escalation teams are usually reserved for the "oh fuck" situations, like "I don't work on this site but I found it broken" or "hey I think we are going to lose soon this availability zone" or "I am panicking and have no idea how to manage this incident, please help me".
They're a glue mechanism to prevent silos and paralysis during an event, usually pretty good engineers too.
- This has been done forever. Ops team had cronjobs to restart misbehaving applications out of business hours since before I started working. In a previous job, the solution for disks being full on a VM on-prem (no, not databases) was an automatic reimage. I've seen scheduled index rebuilds on Oracle. The list goes on.
- You should look where the economy is growing and where the salaries are growing. It's not uniform at all.
The entire situation (as an EU country citizen who moved to another EU country) and the narratives around it are funny to me because they're the same as the ones going around for years in my birth country.
"Side X should learn they should get better candidates, otherwise people are not going to show up" way of thinking included, which has only led to further decline as the "conservatives" win and make the situation worse taking more and more seats and control in state controlled companies while at the same time pushing their own companies to absorb more and more of the budget. Yeah, not showing up because you did not like the candidate was a great success - if you wanted the decline to accelerate, that is.
Well, good luck US friends, to you and us all.
You can use suspend+hibernate to accomplish that and it works well. Unless the gods of kernel lockdown decide you cannot for your own good (and it doesn't matter if your disk is fully encrypted, you're not worthy anyway) of course. It's their kernel running on your laptop after all.