…hail Hydra?
All week I’ve been sending these to my coworker, currently hip deep in AI propaganda, at the Microsoft Power Platform conference in Vegas.
No problem! I can just kill the process in the…shit.
No problem! I can just kill the process in the…shit.
No problem! I can just kill the process in the…shit.
deleted by creator
I’ve never noticed this. Mostly because I basically never close Task Manager. Because programs hanging is common enough that it’s actually useful to have Task Manager open on a separate screen.
On an unrelated note, I must set myself a reminder for tomorrow to give installing Linux another go…
From what I saw about this bug, it’s not just that it duplicates the window when closed. You had to have typed “X” into it first and then closing it would cause the duplication behavior.
Well that is unacceptable behavior
They’re ruining LinkedIn and GitHub as well.
No way… Are you serious?
I’m not sure it was possible to ruin something like LinkedIn …
Vibe coding or no, this is a massive QA failure.
That is the reality.
The problem isn’t “vibe coding” (anyone who has ever managed early career staff will be able to attest that… the bar is REAL fucking low). The problem is a complete lack of testing or any sort of “investment” in caring if production breaks.
A lot of it is general apathy induced by… gestures around. But it very much goes beyond just the obnoxious rise in brain drains over “vibe coding”. Personally speaking, I am THIS fucking close to driving over to my company’s head of IT’s house and burning it down with him in it (For legal purposes, this is a joke) as that entire team continues to think “We’ll just wait until people tell us what is broken” is at all fucking acceptable.
But pretty much any SDLC is going to be built around code review. And code review is how you handle developers of different skill and sanity levels. Whether they are old hats who have been in the basement since before you were born, youngins who can’t stop talking about Rust, or chatbots.
And automatic test systems. It’s not that hard.
I have seen at least 1 out of every 5 comments from coderabbitai that lead me down a rabbit hole looking to see if the suggestion is correct. It can waste so much time trying to validate their suggestions only to find out it’s complete BS.
Unfortunately a lot of people are trying to outsource code review to LLMs as well. Also, LLM generated code is more likely to have subtle errors that a human would be very unlikely to make in otherwise mundane code. Errors that are easy to gloss over if you don’t take a magnifying glass to it. My current least favorite thing is LLM generated unit tests that don’t actually test what they say they do.
Yeah I hate when I read through a unit test and realize it doesn’t actually test anything other than itself.
Shit code review is not code review. If you just rubber stamp everything or outsource it to someone who will, you aren’t doing code review.
Aside from that:
LLM generated code is more likely to have subtle errors that a human would be very unlikely to make in otherwise mundane code.
Citation requested
My current least favorite thing is LLM generated unit tests that don’t actually test what they say they do.
If I had a nickle for every single time I had to explain to someone that their unit test doesn’t do anything or that they literally just copied the output and checked against it (and that they are dealing with floating points so that is actually really stupid)… I’d probably go buy some Five Guys for lunch.
Its like saying that the problem is that you are using robots to assemble cybertrucks rather than people. The problem isn’t who is super glueing sharp jagged metal together. The problem is that your product is fundamentally shite and should never have reached production in the first place. And you need to REALLY work through your design flows and so forth.
Citation requested
I keep seeing it over and over again. Anyone that actually has to deal with coworkers using this bullshit that isn’t also in the cult is going to recognize it.
If I had a nickle for every singl yada yada yada
Sure, there have always been better and worse developers. LLMs are making developers that used to be better, worse.
Bad developers just do whatever. It doesn’t matter if they wrote the code themselves or if a tool wrote it for them. They aren’t going to be more or less detail oriented whether it is an LLM, a doxygen plugin, or their own fingers that made the code.
Which is the problem when people make claims like that. It is nonsense and anyone who has ACTUALLY worked with early career staff can tell you… those kids aren’t writing much better code than chatgpt and there is a reason so many of them have embraced it.
But it also fundamentally changes the conversation. It stops being “We should heavily limit the use of generative AI in coding because it prevents people from developing the skills they need to evaluate code” and instead “We need generative AI to be better”.
It was the exact same thing with “AI can’t draw hands”. Everyone and their mother insisted on that. Most people never thought about why basically all cartoons are four fingered hands and so forth. So, when the “studio ghibli filter” was made? It took off like hotcakes because “Now AI can can do hands!” and there was no thought towards the actual implications of generative AI.
Nothing outside of the first paragraph here is terribly meaningful, and the first paragraph is just trying to talk past what I said before. I’ll reiterate, very clearly.
I have observed several of my coworkers that used to be really good at their jobs, get worse at their jobs (and make me spend more ensuring code quality) since they started using using LLM tools. That’s it. That’s all I care about. Maybe they’ll get better. Maybe they won’t. But right now I’d strongly prefer people not use them, because people using them has made my experience worse.
The problem isn’t who is super glueing sharp jagged metal together.
I know it’s not related, curious about this part.
I know it has an aluminum based frame which should inhibit it’s use to haul heavy loads, but what else?
Well, who’d ever think of testing that closing the app works. Some things you have to take for granted or you’ll never test anything meaningful.
If you are a company the size of Microsoft, you have more than enough resources to test absolutely everything.
They used to have very comprehensive automated testing processes to exercise all sorts of things. Unfortunately, like many tech companies these days like Apple, Google, etc., they’re all punting QA as a concept because they just don’t care - what are you going to do, go use another oligopoly platform?
Well, there may have been a period when MS was trying to improve product quality, and in that time, yes maybe they did have very comprehensive automated testing processes. But before that, up to the time of Windows 7 I guess, their quality was dog shit.
In the early days, MS was an undisputed monopoly though, and not only did they not test thoroughly, they hardly even tried to fix bugs - the userbase had to take care of that too. Earlier versions of Windows had all sorts of workarounds and 3rd party tools to try and get things to work properly.
I suspect that once they’d achieved their objective of improving quality, there just weren’t the incentives there any more for middle management to allocate resources to things like comprehensive tests.
You will never have resources to “test absolutely everything”. It is ALWAYS about building out personas and deriving tests from those.
What this tells us is that one of two things happened:
- This was not tested at all
- The testing harness resets the environment after every check (e.g. “does process close when killed”) rather than involving a manual reset (i.e. “close and re-open task manager”)
The latter is a lot more common than you would think since it makes it much easier to automate these harnesses rather than having a human at a VM. But… this is what happens when you don’t step through the entire workflow.
Didn’t Microsoft fire their dedicated human testing team in about 2014?
Probably?
This is the kind of thing that a LOT of companies outsource. Mostly for ill.
Latest news is that Xbox Games Division had been required to aim for a 30% profit margin for the last several years.
Thats why everything sucks and blew up.
Thats a fucking insane baseline target.
Line for next quarter profit must go up, therefore, cut costs.
This is obvious self destructive in the long run, but that doesn’t matter, what matters is C Suite’s golden parachutes.
Isn’t that the final step in the testing plan for every app though? The first step is always opening it.
A bug like this means literally nobody tested it at all on this build, or was so apathetic they didn’t file an obvious issue.
For people to test, you need management that is willing to invest in QA. But that incentive disappears for a corporation when there’s no free market of competitors who can poach your customers by making a better quality product or service.
The person who fiddles with the always on top option in this monstrosity of a compositor maybe?
You think the QA was done by a person? I’ll bet this was Vibe QA’d.
The public, aka Microsoft QA Team, found the bug. It’s a QA success!

Last issue I had with Windows was that I couldn’t update because my EFI partition was too small (the partition is created automatically on install). Why do I never get the funny ones?

You can’t add AI chatbot to task manager without breaking a few eggs.
Oh no, Windows 10 won’t get updates from their vibe coding era!
Its being done on purpose. Planned obsolescence should be made illegal.
What does this have to do with planned obsolescence?
what would they gain by doing this.
Forcing a Windows upgrade.
To another version full of errors due to the same faults? This is the sort of thing that has people looking for alternatives
you mean a feature update?
I was recently pulling my hair out because a recent update caused a bug in the Windows recovery media creator that prevented the Windows RE from recognizing USB input devices. The few weeks between the faulty update and the fix just happened to coincide with me needing to use it.
WTF are they doing messing with something as basic as the Recovery Environment? And further, in a way that breaks its utility entirely?
The version of windows 10 my Lenovo laptop can’t see wifi. I don’t like factory resetting that machine (actually I can’t anymore since I wiped the recovery partition when I moved it to Linux) my framework laptop came with no OS so was immune to bad factory OS version problems
Marketing discovered AI can code (terribly).
Captive audience on the contracts side, so they can do whatever they want as crappy as they want and the contracts still generate revenue.
You wanted a usable product? Stay away from Big Tech anymore.
And no regression tests that caught it
They probably don’t even test
They vibed their way out of the tests
they ask copilot if the code works
Will windows run ok? Yes? Good.












