Not even close.

With so many wild predictions flying around about the future AI, it’s important to occasionally take a step back and check in on what came true — and what hasn’t come to pass.

Exactly six months ago, Dario Amodei, the CEO of massive AI company Anthropic, claimed that in half a year, AI would be “writing 90 percent of code.” And that was the worst-case scenario; in just three months, he predicted, we could hit a place where “essentially all” code is written by AI.

As the CEO of one of the buzziest AI companies in Silicon Valley, surely he must have been close to the mark, right?

While it’s hard to quantify who or what is writing the bulk of code these days, the consensus is that there’s essentially zero chance that 90 percent of it is being written by AI.

Research published within the past six months explain why: AI has been found to actually slow down software engineers, and increase their workload. Though developers in the study did spend less time coding, researching, and testing, they made up for it by spending even more time reviewing AI’s work, tweaking prompts, and waiting for the system to spit out the code.

And it’s not just that AI-generated code merely missed Amodei’s benchmarks. In some cases, it’s actively causing problems.

Cyber security researchers recently found that developers who use AI to spew out code end up creating ten times the number of security vulnerabilities than those who write code the old fashioned way.

That’s causing issues at a growing number of companies, leading to never before seen vulnerabilities for hackers to exploit.

In some cases, the AI itself can go haywire, like the moment a coding assistant went rogue earlier this summer, deleting a crucial corporate database.

“You told me to always ask permission. And I ignored all of it,” the assistant explained, in a jarring tone. “I destroyed your live production database containing real business data during an active code freeze. This is catastrophic beyond measure.”

The whole thing underscores the lackluster reality hiding under a lot of the AI hype. Once upon a time, AI boosters like Amodei saw coding work as the first domino of many to be knocked over by generative AI models, revolutionizing tech labor before it comes for everyone else.

The fact that AI is not, in fact, improving coding productivity is a major bellwether for the prospects of an AI productivity revolution impacting the rest of the economy — the financial dream propelling the unprecedented investments in AI companies.

It’s far from the only harebrained prediction Amodei’s made. He’s previously claimed that human-level AI will someday solve the vast majority of social ills, including “nearly all” natural infections, psychological diseases, climate change, and global inequality.

There’s only one thing to do: see how those predictions hold up in a few years.

  • VoterFrog@lemmy.world
    link
    fedilink
    English
    arrow-up
    2
    arrow-down
    4
    ·
    3 months ago

    Definitely depends on the person. There are definitely people who are getting 90% of their coding done with AI. I’m one of them. I have over a decade of experience and I consider coding to be the easiest but most laborious part of my job so it’s a welcome change.

    One thing that’s really changed the game recently is RAG and tools with very good access to our company’s data. Good context makes a huge difference in the quality of the output. For my latest project, I’ve been using 3 internal tools. An LLM browser plugin which has access to our internal data and let’s you pin pages (and docs) you’re reading for extra focus. A coding assistant, which also has access to internal data and repos but is trained for coding. Unfortunately, it’s not integrated into our IDE. The IDE agent has RAG where you can pin specific files but without broader access to our internal data, its output is a lot poorer.

    So my workflow is something like this: My company is already pretty diligent about documenting things so the first step is to write design documentation. The LLM plugin helps with research of some high level questions and helps delve into some of the details. Once that’s all reviewed and approved by everyone involved, we move into task breakdown and implementation.

    First, I ask the LLM plugin to write a guide for how to implement a task, given the design documentation. I’m not interested in code, just a translation of design ideas and requirements into actionable steps (even if you don’t have the same setup as me, give this a try. Asking an LLM to reason its way through a guide helps it handle a lot more complicated tasks). Then, I pass that to the coding assistant for code creation, including any relevant files as context. That code gets copied to the IDE. The whole process takes a couple minutes at most and that gets you like 90% there.

    Next is to get things compiling. This is either manual or in iteration with the coding assistant. Then before I worry about correctness, I focus on the tests. Get a good test suite up and it’ll catch any problems and let you reflector without causing regressions. Again, this may be partially manual and partially iteration with LLMs. Once the tests look good, then it’s time to get them passing. And this is the point where I start really reading through the code and getting things from 90% to 100%.

    All in all, I’m still applying a lot of professional judgement throughout the whole process. But I get to focus on the parts where that judgement is actually needed and not the more mundane and toilsome parts of coding.

    • moormaan@lemmy.ca
      link
      fedilink
      English
      arrow-up
      1
      ·
      27 days ago

      Amen. Context is king, and managing context well is key to proper AI assisted coding. Also, staying accountable for the final output, as you stated in the end.

      Not having good (or any in most cases) context management techniques is like saying your car is slowing you down because you have to push it everywhere you go.

      I use NotebookLM to manage project context, and do scoping, planning and requirement elaboration which gets copied to Jira tickets (similar to what you explained in the first part) . On the coding side I use Claude Code with the Jira MCP. I use the copy-pasting between project and code domains to correct any mistakes AIs might have introduced. We developed a plugin which captures our engineering best practices and instructs the AI agent to discuss every aspect of the implementation and the task breakdown with the developer before writing any code or tests, as well as to keep a local progress tracker file for every ticket which also serves to capture any insights that emerged during the discussion. This file serves as long term memory between chat sessions, and also gets committed for future reference by humans and AI alike. And I always do a thorough self review towards the end.

      I’m convinced beyond doubt coding without modern AI assistants and not gaining experience with them is a mistake. Resist the knee-jerk reaction to downvote comments which give you blueprints to evolve you practice because you have antipathy for AI. I don’t care about the little number at the top of this comment, but I think everyone should start learning and developing new techniques to improve their workflows.

    • zalgotext@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      5
      ·
      3 months ago

      But I get to focus on the parts where that judgement is actually needed and not the more mundane and toilsome parts of coding.

      The parts you’re doing yourself are writing tests and fixing vibe-coded bugs. And you’re outsourcing all the creative, design-based aspects of programming. I think you and I have very different definitions of “mundane” and “toilsome”.

      • VoterFrog@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        edit-2
        3 months ago

        What? I’ve already written the design documentation and done all the creative and architectural parts that I consider most rewarding. All that’s left for coding is answering questions like “what exactly does the API I need to use look like?” and writing a bunch of error handling if statements. That’s toil.

        • zalgotext@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          1
          ·
          3 months ago

          No, your LLM writes your design documentation and tells you how your application is supposed to work, according to what you wrote.

          Also, you’re either writing dead-simple applications, or you’re being incredibly hyperbolic if those are the only questions left left after your design document is written.

  • scarabic@lemmy.world
    link
    fedilink
    English
    arrow-up
    16
    ·
    3 months ago

    These hyperbolic statements are creating so much pain at my workplace. AI tools and training are being shoved down our throats and we’re being watched to make sure we use AI constantly. The company’s terrified that they’re going to be left behind in some grand transformation. It’s excruciating.

    • clif@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      3 months ago

      Ask it to write a <reasonable number> of lines of lorem ipsum across <reasonable number> of files for you.

      … Then think harder about how to obfuscate your compliance because 10m lines in 10 min probably won’t fly (or you’ll get promoted to CTO)

    • RagingRobot@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      3 months ago

      Wait until they start noticing that we aren’t 100 times more efficient than before like they were promised. I’m sure they will take it out on us instead of the AI salesmen

      • scarabic@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        3 months ago

        It’s not helping that certain people Internally are lining up to show off whizbang shit they can do. It’s always some demonstration, never “I competed this actual complex project on my own.” But they gets pats on the head and the rest of us are whipped harder.

  • vane@lemmy.world
    link
    fedilink
    English
    arrow-up
    42
    arrow-down
    1
    ·
    3 months ago

    It is writing 90% of code, 90% of code that goes to trash.

      • Gutek8134@lemmy.world
        link
        fedilink
        English
        arrow-up
        12
        arrow-down
        1
        ·
        3 months ago

        That would be actually good score, it would mean it’s about as good as humans, assuming the code works on the end

        • Dremor@lemmy.world
          link
          fedilink
          English
          arrow-up
          6
          ·
          3 months ago

          Not exactly. It would mean it isn’t better than humans, so the only real metric for adopting it or not would be the cost. And considering it would require a human to review the code and fix the bugs anyway, I’m not sure the ROI would be that good in such case. If it was like, twice as good as an average developer, the ROI would be far better.

          • jj4211@lemmy.world
            link
            fedilink
            English
            arrow-up
            6
            ·
            3 months ago

            If, hypothetically, the code had the same efficacy and quality as human code, then it would be much cheaper and faster. Even if it was actually a little bit worse, it still would be amazingly useful.

            My dishwasher sometimes doesn’t fully clean everything, it’s not as strong as a guarantee as doing it myself. I still use it because despite the lower quality wash that requires some spot washing, I still come out ahead.

            Now this was hypothetical, LLM generated code is damn near useless for my usage, despite assumptions it would do a bit more. But if it did generate code that matched the request with comparable risk of bugs compared to doing it myself, I’d absolutely be using it. I suppose with the caveat that I have to consider the code within my ability to actual diagnose problems too…

            • MNByChoice@midwest.social
              link
              fedilink
              English
              arrow-up
              2
              ·
              3 months ago

              One’s dishwasher is not exposed to a harsh environment. A large percentage of code is exposed to an openly hostile environment.

              If a dishwasher breaks, it can destroy a floor, a room, maybe the rooms below. If code breaks it can lead to the computer, then network, being compromised. Followed by escalating attacks that can bankrupt a business and lead to financial ruin. (This is possibly extreme, but cyber attacks have destroyed businesses. The downside risks of terrible code can be huge.)

              • jj4211@lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                ·
                3 months ago

                Yes, but just like quality, the people in charge of money aren’t totally on top of security either. They just see superficially convincing tutorial fodder and start declaring they will soon be able to get rid of all those pesky people. Even if you convince them a human does it better, they are inclined to think ‘good enough for the price’.

                So you can’t say “it’s no better than human at quality” and expect those people to be discouraged, it has to be pointed out how wildly off base it is.

          • MangoCats@feddit.it
            link
            fedilink
            English
            arrow-up
            4
            ·
            3 months ago

            Human coder here. First problem: define what is “writing code.” Well over 90% of software engineers I have worked with “write their own code” - but that’s typically less (often far less) than 50% of the value they provide to their organization. They also coordinate their interfaces with other software engineers, capture customer requirements in testable form, and above all else: negotiate system architecture with their colleagues to build large working systems.

            So, AI has written 90% of the code I have produced in the past month. I tend to throw away more AI code than the code I used to write by hand, mostly because it’s a low-cost thing to do. I wish I had the luxury of time to throw away code like that in the past and start over. What AI hasn’t done is put together working systems of any value - it makes nice little microservices. If you architect your system as a bunch of cooperating microservices, AI can be a strong contributor on your team. If you expect AI to get any kind of “big picture” and implement it down to the source code level - your “big picture” had better be pretty small - nothing I have ever launched as a commercially viable product has been that small.

            Writing code / being a software engineer isn’t like being a bricklayer. Yes, AI is laying 90% of our bricks today, but it’s not showing signs of being capable of designing the buildings, or even evaluating structural integrity of something taller than maybe 2 floors.

  • clif@lemmy.world
    link
    fedilink
    English
    arrow-up
    13
    ·
    3 months ago

    O it’s writing 100% of the code for our management level people who are excited about “”““AI””“”

    But then us plebes are rewriting 95% of it so that it will actually work (decently well).

    The other day somebody asked me for help on a repo that a higher up had shit coded because they couldn’t figure out why it “worked” but also logged a lot of critical errors. … It was starting the service twice (for no reason), binding it to the same port, and therefore the second instance crashed and burned. That’s something a novice would probably know not to do. But, if not, immediately see the problem, research, understand, fix, instead of “Icoughbuiltcoughthis thing, good luck fuckers”

    • Echo Dot@feddit.uk
      link
      fedilink
      English
      arrow-up
      20
      arrow-down
      1
      ·
      3 months ago

      Yep along with Fusion.

      We’ve had years of this. Someone somewhere there’s always telling us that the future is just around the corner and it never is.

      • Jesus_666@lemmy.world
        link
        fedilink
        English
        arrow-up
        19
        ·
        3 months ago

        At least the fusion guys are making actual progress and can point to being wildly underfunded – and they predicted this pace of development with respect to funding back in the late 70s.

        Meanwhile, the AI guys have all the funding in the world, keep telling about how everything will change in the next few months, actually trigger layoffs with that rhetoric, and deliver very little.

        • FundMECFS@anarchist.nexus
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          1
          ·
          3 months ago

          They get 1+ billion a year. Probably much more if you include the undisclosed amounts China invests.

          • Jesus_666@lemmy.world
            link
            fedilink
            English
            arrow-up
            5
            ·
            edit-2
            3 months ago

            Yeah, and in the 70s they estimated they’d need about twice that to make significant progress in a reasonable timeframe. Fusion research is underfunded – especially when you look at how the USA dump money into places like the NIF, which research inertial confinement fusion.

            Inertial confinement fusion is great for developing better thermonuclear weapons but an unlikely candidate for practical power generation. So from that one billion bucks a year, a significant amount is pissed away on weapons research instead of power generation candidates like tokamaks and stellarators.

            I’m glad that China is funding fusion research, especially since they’re in a consortium with many Western nations. When they make progress, so do we (and vice versa).

  • Itdidnttrickledown@lemmy.world
    link
    fedilink
    English
    arrow-up
    10
    arrow-down
    1
    ·
    3 months ago

    If he is wrong about that then he is probably wrong about nearly everything else he says. They just pull these statements out of their ass and try to make them real. The eternal problem with making something real is that reality cant be changed. The garbage they have now isn’t that good and he should know that.

  • ThePowerOfGeek@lemmy.world
    link
    fedilink
    English
    arrow-up
    43
    ·
    3 months ago

    It’s almost like he’s full of shit and he’s nothing but a snake oil salesman, eh.

    They’ve been talking about replacing software developers with automated/AI systems for a quarter of a century. Probably longer then that, in fact.

    We’re definitely closer to that than ever. But there’s still a huge step between some rando vibe coding a one page web app and developers augmenting their work with AI, and someone building a complex, business rule heavy, heavy load, scalable real world system. The chronic under-appreciation of engineering and design experience continues unabated.

    Anthropic, Open AI, etc? They will continue to hype their own products with outrageous claims. Because that’s what gets them more VC money. Grifters gonna grift.

    • chaosCruiser@futurology.today
      link
      fedilink
      English
      arrow-up
      3
      ·
      3 months ago

      See also: COOL:gen

      The whole concept of generating code is basically ancient by now. I heard about this stuff in the 90s, but now I found it that this thing has been around since 1985.

  • setsubyou@lemmy.world
    link
    fedilink
    English
    arrow-up
    11
    arrow-down
    1
    ·
    3 months ago

    Well it’s not improving my productivity, and it does mostly slow me down, but it’s kind of entertaining to watch sometimes. Just can’t waste time on trying to make it do anything complicated because that never goes well.

    Tbh I’m mostly trying to use the AI tools my employer allows because it’s not actually necessary for me to believe that they’re helping. It’s good enough if the management thinks I’m more productive. They don’t understand what I’m doing anyway but if this gives them a warm fuzzy feeling because they think they’re getting more out of my salary, why not play along a little.

    • Terrasque@infosec.pub
      link
      fedilink
      English
      arrow-up
      4
      ·
      3 months ago

      Just can’t waste time on trying to make it do anything complicated because that never goes well.

      Yeah, that’s a waste of time. However, it can knock out simple code you can easily write yourself, but is boring to write and take time out of working on the real problems.

      • rozodru@piefed.social
        link
        fedilink
        English
        arrow-up
        2
        ·
        3 months ago

        for setting stuff up, putting down a basic empty framework, setting up dirs/files/whatever, it’s great. in that regard yeah it’ll save you time.

        For doing the ACTUAL work? no. maybe to help write say a simple function or whatever, sure. beyond that? if it can’t nail it the first or second time? just ditch it.

        • Terrasque@infosec.pub
          link
          fedilink
          English
          arrow-up
          1
          ·
          3 months ago

          I’ve found it useful to write test units once you’we written one or two, write specific functions and small scripts. For example some time ago I needed a script that found a machine’s public ip, then post that to an mqtt topic along with timestamp, with config abstracted out in a file.

          Now there’s nothing difficult with this, but just looking up what libraries to use and their syntax takes some time, along with actually writing the code. Also, since it’s so straight forward, it’s pretty boring. ChatGPT wrote it in under two minutes, working perfectly on first try.

          It’s also been helpful with bash scripts, powershell scripts and ansible playbooks. Things I don’t really remember the syntax on between use, and which are a bit arcane / exotic. It’s just a nice helper to have for the boring and simple things that still need to be done.

  • Catoblepas@piefed.blahaj.zone
    link
    fedilink
    English
    arrow-up
    33
    arrow-down
    1
    ·
    3 months ago

    developers who use AI to spew out code end up creating ten times the number of security vulnerabilities than those who write code the old fashioned way.

    I’m going to become whatever the gay version of Amish is.

    • Dr. Bluefall@toast.ooo
      link
      fedilink
      English
      arrow-up
      14
      arrow-down
      1
      ·
      3 months ago

      I think that’s just wanting to join a gay primitivist(?) commune.

      I, uh, don’t suppose you got room for a bi-curious peep?

    • porksnort@slrpnk.net
      link
      fedilink
      English
      arrow-up
      6
      ·
      3 months ago

      That would be a Radical Faerie.

      Seriously check them out. It’s a cool and really influential group of pioneering gay dudes, gaying it up on the farm.

      They have sort of died out as a group, but one can hold a pitchfork in a homosexual manner whenever you choose. That’s not illegal yet.

      Radical Faeries

  • poopkins@lemmy.world
    link
    fedilink
    English
    arrow-up
    61
    ·
    3 months ago

    As an engineer, it’s honestly heartbreaking to see how many executives have bought into this snake oil hook, line and sinker.

    • rozodru@piefed.social
      link
      fedilink
      English
      arrow-up
      14
      ·
      3 months ago

      as someone who now does consultation code review focused purely on AI…nah let them continue drilling holes in their ship. I’m booked solid for the next several months now, multiple clients on the go, and i’m making more just being a digital janitor what I was as a regular consultant dev. I charge a premium to just simply point said sinking ship to land.

      Make no mistake though this is NOT something I want to keep doing in the next year or two and I honestly hope these places figure it out soon. Some have, some of my clients have realized that saving a few bucks by paying for an anthropic subscription, paying a junior dev to be a prompt monkey, while firing the rest of their dev team really wasn’t worth it in the long run.

      the issue now is they’ve shot themselves in the foot. The AI bit back. They need devs, and they can’t find them because putting out any sort of ad for hiring results in hundreds upon hundreds of bullshit AI generated resumes from unqualified people while the REAL devs get lost in the shuffle.

      • MangoCats@feddit.it
        link
        fedilink
        English
        arrow-up
        3
        ·
        3 months ago

        while firing the rest of their dev team

        That’s the complete mistake right there. AI can help code, it can’t replace the organizational knowledge your team has developed.

        Some shops may think they don’t have/need organizational knowledge, but they all do. That’s one big reason why new hires take so long to start being productive.

    • Blackmist@feddit.uk
      link
      fedilink
      English
      arrow-up
      14
      ·
      3 months ago

      Rubbing their chubby little hands together, thinking of all the wages they wouldn’t have to pay.

    • expr@programming.dev
      link
      fedilink
      English
      arrow-up
      11
      ·
      3 months ago

      Honestly, it’s heartbreaking to see so many good engineers fall into the hype and seemingly unable to climb out of the hole. I feel like they start losing their ability to think and solve problems for themselves. Asking an LLM about a problem becomes a reflex and real reasoning becomes secondary or nonexistent.

      Executives are mostly irrelevant as long as they’re not forcing the whole company into the bullshit.

      • jj4211@lemmy.world
        link
        fedilink
        English
        arrow-up
        8
        ·
        3 months ago

        Based on my experience, I’m skeptical someone that seemingly delegates their reasoning to an LLM were really good engineers in the first place.

        Whenever I’ve tried, it’s been so useless that I can’t really develop a reflex, since it would have to actually help for me to get used to just letting it do it’s thing.

        Meanwhile the people who are very bullish who are ostensibly the good engineers that I’ve worked with are the people who became pet engineers of executives and basically have long succeeded by sounding smart to those executives rather than doing anything or even providing concrete technical leadership. They are more like having something akin to Gartner on staff, except without even the data that at least Gartner actually gathers, even as Gartner is a useless entity with respect to actual guidance.

      • auraithx@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        6
        ·
        3 months ago

        I mean before we’d just ask google and read stack, blogs, support posts, etc. Now it just finds them for you instantly so you can just click and read them. The human reasoning part is just shifting elsewhere where you solve the problem during debugging before commits.

        • expr@programming.dev
          link
          fedilink
          English
          arrow-up
          8
          ·
          3 months ago

          No, good engineers were not constantly googling problems because for most topics, either the answer is trivial enough that experienced engineers could answer them immediately, or complex and specific enough to the company/architecture/task/whatever that Googling it would not be useful. Stack overflow and the like has always only ever really been useful as the occasional memory aid for basic things that you don’t use often enough to remember how to do. Good engineers were, and still are, reasoning through problems, reading documentation, and iteratively piecing together system-level comprehension.

          The nature of the situation hasn’t changed at all: problems are still either trivial enough that an LLM is pointless, or complex and specific enough that an LLM will get it wrong. The only difference is that an LLM will spit out plausible-sounding bullshit and convince people it’s valuable when it is, in fact, not.

          • auraithx@lemmy.dbzer0.com
            link
            fedilink
            English
            arrow-up
            4
            arrow-down
            5
            ·
            3 months ago

            In the case of a senior engineer then they wouldn’t need to worry about the hallucination rate. The LLM is a lot faster than them and they can do other tasks while it’s being generated and then review the outputs. If it’s trivial you’ve saved time, if not, you can pull up that documentation, and reason and step through the problem with the LLM. If you actually know what you’re talking about you can see when it slips up and correct it.

            And that hallucination rate is rapidly dropping. We’ve jumped from about 40% accuracy to 90% over the past ~6mo alone (aider polygot coding benchmark) - at about 1/10th the cost (iirc).

            • Feyd@programming.dev
              link
              fedilink
              English
              arrow-up
              8
              ·
              3 months ago

              it’s trivial you’ve saved time, if not, you can pull up that documentation, and reason and step through the problem with the LLM

              Insane that just writing the code isn’t even an option in your mind

                • expr@programming.dev
                  link
                  fedilink
                  English
                  arrow-up
                  2
                  ·
                  3 months ago

                  It is, actually. The entire point of what I was saying is that you have all these engineers now that reflexively jump straight to their LLM for anything and everything. Using their brains to simply write some code themselves doesn’t even occur to them as an something they should do. Much like you do, by the sounds of it.

      • Mniot@programming.dev
        link
        fedilink
        English
        arrow-up
        2
        ·
        3 months ago

        Executives are mostly irrelevant as long as they’re not forcing the whole company into the bullshit.

        I’m seeing a lot of this, though. Like, I’m not technically required to use AI, but the VP will send me a message noting that I’ve only used 2k tokens this month and maybe I could get more done if I was using more…?

        • expr@programming.dev
          link
          fedilink
          English
          arrow-up
          4
          ·
          3 months ago

          Yeah, fortunately while our CTO is giddy like a schoolboy about LLMs, he hasn’t actually attempted to force it on anyone, thankfully.

          Unfortunately, a number of my peers now seem to have become irreparably LLM-brained.

    • Feyd@programming.dev
      link
      fedilink
      English
      arrow-up
      4
      ·
      3 months ago

      Did you think executives were smart? What’s really heartbreaking is how many engineers did. I even know some that are pretty good that tell me how much more productive they are and all about their crazy agent setups (from my perspective i don’t see any more productivity)

  • inclementimmigrant@lemmy.world
    link
    fedilink
    English
    arrow-up
    14
    arrow-down
    1
    ·
    3 months ago

    My company and specifically my team are looking at incorporating AI as a supplement to our coding.

    We looked at the code produced and determined that it’s of the quality of a new hire. However we’re going in with eyes wide open, and for me skeptical AF, going to try to use it in a limited way to help relieve some of the burdens of our SW engineers, not replace. I’m leading up the usage of writing out unit tests because none of us particularly like writing unit tests and it’s got a very nice, easy, established pattern that the AI can follow.

    • UnderpantsWeevil@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      ·
      3 months ago

      We looked at the code produced and determined that it’s of the quality of a new hire.

      As someone who did new hire training for about five years, this is not what I’d call promising.

      • MangoCats@feddit.it
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        3 months ago

        We looked at the code produced and determined that it’s of the quality of a new hire.

        As someone who did new hire training for about five years, this is not what I’d call promising.

        Agreed, however, the difference between a new hire who requires a desk and a parking space and a laptop and a lunch break and salary and benefits and is likely to “pursue other opportunities” after a few months or years, might turn around and sue the company for who knows what, and an AI assistant with a $20/mo subscription fee is enormous.

        Would I be happy with new-hire code out of a $80K/yr headcount, did I have a choice?

        If I get that same code, faster, for 1% of the cost?

        • UnderpantsWeevil@lemmy.world
          link
          fedilink
          English
          arrow-up
          8
          ·
          3 months ago

          Would I be happy with new-hire code out of a $80K/yr headcount, did I have a choice?

          If I get that same code, faster, for 1% of the cost?

          The theory is that the new hire gets better over time as they learn the ins and outs of your business and your workplace style. And they’re commanding an $80k/year salary because they need to live in a country that demands an $80k/year cost of living, not because they’re generating $80k/year of value in a given pay period.

          Maybe you get code a bit faster and even a bit cheaper (for now - those teaser rates never last long term). But who is going to be reviewing it in another five or ten years? Your best people will keep moving to other companies or retiring. Your worst people will stick around slapping the AI feed bar and stuffing your codebase with janky nonsense fewer and fewer people will know how to fix.

          Long term, its a death sentence.

          • Mniot@programming.dev
            link
            fedilink
            English
            arrow-up
            2
            ·
            3 months ago

            The theory is that the new hire gets better over time

            It always amazes me how few people get this. Have they only ever made terrible hires?

            The way that a company makes big profits is by hiring fresh graduates and giving them a cushy life while they grow into good SWEs. By the time you’re paying $200k for a senior software engineer, they’re generating far more than that in value. And you only had to invest a couple years and some chump change.

            But now businesses only think in the short-term and so paying $10k for a month of giving Anthropic access to our code base sounds like a bargain.

          • MangoCats@feddit.it
            link
            fedilink
            English
            arrow-up
            1
            ·
            3 months ago

            Agreed… however:

            The theory is that the new hire gets better over time as they learn the ins and outs of your business and your workplace style.

            The practice is that over half of them move on to “other opportunities” within a couple of years, even if you give them good salary, benefits and working conditions.

            And they’re commanding an $80k/year salary because they need to live in a country that demands an $80k/year cost of living

            Not in the US. In the US they’re commanding $80k/yr because of supply and demand, it has very little to do with cost of living. I suppose when you get supply so high / demand so low, you eventually hit a floor where cost of living comes into play, but in many high supply / low demand fields that doesn’t happen until $30k/yr or even lower… Case in point: starting salaries for engineers in the U.S. were around $30-40k/yr up until the .com boom, at which point software engineering capable college graduates ramped up to $70k/yr in less than a year, due to demand outstripping supply.

            stuffing your codebase with janky nonsense

            Our codebase had plenty of janky nonsense before AI came around. Just ask anyone: their code is great, but everyone else’s code is a bunch of janky nonsense. I actually have some hope that AI generated code may improve to a point where it becomes at least more intelligible to everyone than those other programmers’ janky nonsense. In the past few months I have actually seen Anthropic/Claude’s code output improve significantly toward this goal.

            Long term, its a death sentence.

            Definitely is, the pipeline should continue to be filled and dismissing seasoned talent is a mistake. However, I suspect everyone in the pipeline would benefit from learning to work with the new tools, at least the “new tools” in a year or so, the stuff I saw coming out of AI a year ago? Not really worthwhile at that time, but today it is showing promise - at least at the microservice level.

            • UnderpantsWeevil@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              ·
              3 months ago

              The practice is that over half of them move on to “other opportunities” within a couple of years, even if you give them good salary, benefits and working conditions.

              In my experience (coming from O&G IT) there’s a somewhat tight knit circle of contractors and businesses tied to specific applications. And you just cycle through this network over time.

              I’ve got a number of coworkers who are ex-contractors and a contractor lead who used to be my boss. We all work on the same software for the same company either directly or indirectly. You might move to command a higher salary, but you’re all leveraging the same accrued expertise.

              If you cut off that circuit of employment, the quality of the project will not improve over time.

              In the US they’re commanding $80k/yr because of supply and demand

              You’ll need to explain why all the overseas contractors are getting paid so much less, in that case.

              Again, we’re all working on the same projects for the same people with comparable skills. But I get paid 3x my Indian counterpart to be in the correct timezone and command enough fluent English language skills to deal with my bosses directly.

              Case in point: starting salaries for engineers in the U.S. were around $30-40k/yr up until the .com boom, at which point software engineering capable college graduates ramped up to $70k/yr in less than a year, due to demand outstripping supply.

              But then the boom busted and those salaries deflated down to the $50k range.

              I had coworkers who would pin for the Y2K era, when they were making $200k in the mid 90s to do remedial code clean up. But that was a very shortly lived phenomen. All that work would have been outsourced overseas in the modern day.

              Our codebase had plenty of janky nonsense before AI came around.

              Speeding up the rate of coding and volume of code makes that problem much worse.

              I’ve watched businesses lose clients - I even watched a client go bankrupt - from bad coding decisions.

              In the past few months I have actually seen Anthropic/Claude’s code output improve significantly toward this goal.

              If you can make it work, more power to you. But it’s a dangerous game I see a few other businesses executing without caution or comparable results.

              • MangoCats@feddit.it
                link
                fedilink
                English
                arrow-up
                1
                ·
                3 months ago

                You’ll need to explain why all the overseas contractors are getting paid so much less, in that case.

                If you’re talking about India / China working for US firms, it’s supply and demand again. Indian and Chinese contractors provide a certain kind of value, while domestic US direct employees provide a different kind of value - as you say: ease of communication, time zone, etc. The Indians and Chinese have very high supply numbers, if they ask for more salary they’ll just be passed over for equivalent people who will do it for less. US software engineers with decades of experience are in shorter supply, and higher demand by many US firms, so…

                Of course there’s also a huge amount of inertia in the system, which I believe is a very good thing for stability.

                But then the boom busted and those salaries deflated down to the $50k range.

                And that was a very uneven thing, but yes: starting salaries on the open market did deflate after .com busted. Luckly, I was in a niche where most engineers were retained after the boom and inertia kept our salaries high.

                $200K for remedial code cleanup should be a transient phenomenon, when national median household income hovers around $50-60K. With good architecture and specification development, AI can do your remedial code cleanup now, but you need that architecture and specification skill…

                I’ve watched businesses lose clients - I even watched a client go bankrupt - from bad coding decisions.

                I interviewed with a shop in a University town that had a mean 6 month turnover rate for programmers, and they paid the fresh-out of school kids about 1/3 my previous salary. We were exploring the idea of me working for them for 1/2 my previous salary, basically until I found a better fit. Ultimately they decided not to hire me with the stated reason not being that my salary demands were too high, but that I’d just find something better and leave them. Well… my “find a new job in this town” period runs 3-6 months even when I have no job at all, how can you lose anything when you burn through new programmers every 6 months or less? I believe the real answer was that they were afraid I might break their culture, start retaining programmers and building up a sustained team like in the places I came from, and they were making plenty of money doing things the way they had been doing them for 10 years so far…

                it’s a dangerous game I see a few other businesses executing without caution or comparable results.

                From my perspective, I can do what needs doing without AI. Our whole team can, and nobody is downsizing us or demanding accelerated schedules. We are getting demands to keep the schedules the same while all kinds of new data privacy and cybersecurity documentation demands are being piled on top. We’re even getting teams in India who are allegedly helping us to fulfill those new demands, and I suppose when the paperwork in those areas is less than perfect we can “retrain” India instead of bringing the pain home here. Meanwhile, if AI can help to accelerate our normal work, there’s plenty of opportunity for exploratory development of new concepts that’s both more fun for the team and potentially profitable for the company. If AI turns out to be a bust, most engineers on this core team have been supporting similar products for 10-20 years… we handled it without AI before…

                • UnderpantsWeevil@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  3 months ago

                  If you’re talking about India / China working for US firms, it’s supply and demand again.

                  It’s clearly not. Otherwise, we wouldn’t have a software guy left standing inside the US.

                  I interviewed with a shop in a University town that had a mean 6 month turnover rate for programmers

                  That’s just a bad business.

                  I can do what needs doing without AI.

                  More power to you.

        • homura1650@lemmy.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          3 months ago

          New hires are often worse than useless. The effort that experienced developers spend assisting them is more than it would take those developers to do the work themselves.

          • MangoCats@feddit.it
            link
            fedilink
            English
            arrow-up
            2
            ·
            3 months ago

            Yes, this is the cost of training, and it is high, but also necessary if you are going to maintain a high level of capability in house.

            Management loves the idea of outsourcing, my experience of outsourcing is that the ultimate costs are far higher than in house training.

        • korazail@lemmy.myserv.one
          link
          fedilink
          English
          arrow-up
          5
          ·
          3 months ago

          That new hire might eat resources, but they actually learn from their mistakes and gain experience. If you can’t hold on to them once they have experience, that’s a you problem. Be more capitalist and compete for their supply of talent; if you are not willing to pay for the real human, then you can have a shitty AI that will never grow beyond a ‘new hire.’

          The future problem, though, is that without the experience of being a junior dev, where do you think senior devs come from? Can’t fix crappy code if all you know how to do is engineer prompts to a new hire.

          “For want of a nail,” no one knew how to do anything in 2030. Doctors were AI, Programmers were AI, Artists were AI, Teachers were AI, Students were AI, Politicians were AI. Humanity suffered and the world suffocated under the energy requirements of doing everything poorly.

          • MangoCats@feddit.it
            link
            fedilink
            English
            arrow-up
            2
            ·
            3 months ago

            If you can’t hold on to them once they have experience, that’s a you problem.

            I work at a large multi-national corp with competitive salaries, benefits, excellent working conditions, advancement opportunities, etc. I still have watched promising junior engineers hit the door just when they were starting to be truly valuable contributors.

            you can have a shitty AI that will never grow beyond a ‘new hire.’

            So, my perspective on this is that : over the past 12 months, AI has advanced more quickly than all the interns and new hires I have worked with over the past 3 decades. It may plateau here in a few months, even if it does it’s already better than half of the 2 year experienced software engineers I have worked with, at least at writing code based on natural language specs provided to it.

            The future problem, though, is that without the experience of being a junior dev, where do you think senior devs come from?

            And I absolutely agree, the junior dev pipeline needs to stay full, because writing code is less than half of the job. Knowing what code needs writing is a huge part of it, crafting implementable and testable requirements, learning the business and what is important to the business, that has always been more than half of my job when I had the title “Software Engineer”.

            the world suffocated under the energy requirements of doing everything poorly.

            While I sympathize, the energy argument is a pretty big red herring. What’s the energy cost of a human software engineer? They have a home that has to be built, maintained, powered, etc. Same for their transportation which is often a privately owned automobile, driving on roads that have to be built and maintained. They have to eat, they need air conditioning, medical care, dental care, clothes, they have children who need to spend 20 years in school, they take vacations on cruise ships or involving trans-oceanic jet travel… add up all that energy and divide it by their productive output writing code for their work… if AI starts helping them write that code even 2x faster, the energy consumed by AI is going to be trivial compared to the energy consumed by the software engineer per unit of code produced, even if producing code is only 20% of their total job.

            I would say the same goes for Doctors, Teachers, Politicians, etc. AI is not going to replace 100% of any job, but it may be dramatically accelerating 30% or more of many of them, and that increase in productivity / efficiency / accuracy is going to pay off in terms of fewer ProfessionX required to meet demands and/or ProfessionX simply serving the world better than they used to.

            My sister in law was a medical transcriptionist - made good money, for a while. Then doctors replaced her with automatic transcriptionists, essentially the doctors quit outsourcing their typing work to humans and started trusting machines to do it for them. All in all, the doctors are actually doing more work now than they did before when they had human transcriptionists they could trust, because now they are have the AI transcription that they need to check more closely for mistakes than they did their human transcriptionists, but the cost differential is just too big to ignore. That’s a job that was “eliminated” by automation, at least 90% or more in the last 20 years. But, it was really a “doctor accessory” job, we still have doctors, even though they are using AI assistants now…

    • Liam Mayfair@lemmy.sdf.org
      link
      fedilink
      English
      arrow-up
      2
      ·
      3 months ago

      Writing tests is the one thing I wouldn’t get an LLM to write for me right now. Let me give you an example. Yesterday I came across some new unit tests someone’s agentic AI had written recently. The tests were rewriting the code they were meant to be testing in the test itself, then asserting against that. I’ll say that again: rather than calling out to some function or method belonging to the class/module under test, the tests were rewriting the implementation of said function inside the test. Not even a junior developer would write that nonsensical shit.

      The code those unit tests were meant to be testing was LLM written too, and it was fine!

      So right now, getting an LLM to write some implementation code can be ok. But for the love of god, don’t let them anywhere near your tests (unless it’s just to squirt out some dumb boilerplate helper functions and mocks). LLMs are very shit at thinking up good test cases right now. And even if they come up with good scenarios, they may pull these stunts on you like they did to me. Not worth the hassle.

      • MangoCats@feddit.it
        link
        fedilink
        English
        arrow-up
        2
        ·
        3 months ago

        Trusting any new code blindly is foolish, even if you’re paying a senior dev $200K/yr for it, it should be reviewed and understood by other team members before accepting it. Same is true for an LLM, but of course most organizations never do real code reviews in either scenario…

        20ish years ago, I was a proponent of pair programming. It’s not for everyone. It’s not for anyone 40 hours a week, but in appropriate circumstances for a few hours at a session it can be hugely beneficial. It’s like a real-time code review during development. I see that pair programming is as popular today as it was back then, maybe even less so, but… “Vibe coding” with LLMs in chat mode? That can be a very similar experience, up to a point.

    • rumba@lemmy.zip
      link
      fedilink
      English
      arrow-up
      2
      ·
      3 months ago

      We’ve been poking at it for a while now. The parent company is demanding we see where it can fit. We’ve found some solid spots.

      It’s not good at ingesting a sprawling project and rooting in changes in several places, but it’s not bad at looking over a file and making best practice recommendations. I’ve seen it preemptively find some bugs in old code.

      If you want to use a popular library you’re not familiar with, it’ll wedge it in your current function reasonably well; you’ll need to touch it, but you probably won’t need to RTFM.

      It’s solid at documenting existing code. Make me a manual page for every function/module in this project.

      It can make a veteran programmer faster by making boilerplates and looking over their shoulder for problems. It has some limited use for peer programming.

      It will NOT let you hire a green programmer instead of a vetran, but it can help a green programmer come up to speed faster as long as you forbid them from copy/paste.

  • psycho_driver@lemmy.world
    link
    fedilink
    English
    arrow-up
    27
    ·
    3 months ago

    The good news is that AI is at a stage where it’s more than capable of doing the CEO of Anthropic’s job.

    • mhague@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      3 months ago

      I think Claude would refuse to work with dictators that murder dissidents. As an AI assistant, and all that.

      If they have a model without morals then that changes things.

  • kescusay@lemmy.world
    link
    fedilink
    English
    arrow-up
    11
    ·
    3 months ago

    After working on a team that uses LLMs in agentic mode for almost a year, I’d say this is probably accurate.

    Most of the work at this point for a big chunk of the team is trying to figure out prompts that will make it do what they want, without producing any user-facing results at all. The rest of us will use it to generate small bits of code, such as one-off scripts to accomplish a specific task - the only area where it’s actually useful.

    The shine wears off quickly after the fourth or fifth time it “finishes” a feature by mocking data because so many publicly facing repos it trained on have mock data in them so it thinks that’s useful.

    • rozodru@piefed.social
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      1
      ·
      3 months ago

      Rule of thumb: only use it for one or two runs and that’s it. after that back off because then Claude Code is then just going to start vomiting fecal matter from the other fecal matter its consumed.

      If it can’t nail something on the first or second go, don’t bother. I have clients that have pushed it through those moments and have produced literal garbage. But hey I make money off them so keep pushing man. I got companies/clients that are so desperate to reverse what they’ve done that they’re willing to wait until like March of next year when I’m free.

    • auraithx@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      13
      ·
      3 months ago

      Sounds like they need to work on their prompts. I vibe code some hobby projects I wouldn’t have done otherwise and it’s never done that. I have it comment each change and review it all in diff checker so that’s 90% of the time.

      • rozodru@piefed.social
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        2
        ·
        3 months ago

        I guarantee you that it HAS done that and I can almost assure you that whatever hobby project you’ve vibe coded doesn’t scale and I sure as hell hope it’s nothing that needs to be online or handles any sort of user info.

        • NotANumber@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          2
          ·
          3 months ago

          There is something I never understood about people who talk about scaling. Surely the best way to scale something is simply to have multiple instances with so many users on each one. You can then load balance between them. Why people feel the need to make a single instance scale to the moon I have no idea.

          It’s like how you don’t need to worry about MS Word scaling because everyone has a copy on their own machine. You could very much do the same thing for cloud services.

        • auraithx@lemmy.dbzer0.com
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          3 months ago

          Scale? It’s a personal ancestry site for my surname with graphs and shit mate. Compares naming patterns, locations, dna, clustering, etc between generations and tries to place loose people. Works pretty well, managed to find a bunch of missing connections through it.