This is the technology worth trillions of dollars huh

  • Yaztromo@lemmy.world
    link
    fedilink
    English
    arrow-up
    8
    ·
    3 months ago

    GitLab Enterprise somewhat recently added support for Amazon Q (based on claude) through an interface they call “GitLab Duo”. I needed to look up something in the GitLab docs, but thought I’d ask Duo/Q instead (the UI has this big button in the top left of every screen to bring up Duo to chat with Q):

    (Paraphrasing…)

    ME: How do I do X with Amazon Q in GitLab? Q: Open the Amazon Q menu in the GitLab UI and select the appropriate option.

    ME: [:looks for the non-existant menu:] ME: Where in the UI do I find this menu?

    Q: My last response was incorrect. There is no Amazon Q button in GitLab. In fact, there is no integration between GitLab and Amazon Q at all.

    ME: [:facepalm:]

  • krimson@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    ·
    3 months ago

    Seems it “thinks” a T is a D?

    Just needs a little more water and electricity and it will be fine.

    • sexybenfranklin@ttrpg.network
      link
      fedilink
      English
      arrow-up
      4
      ·
      3 months ago

      It’s more likely that Connecticut comes alphabetically after Colorado in the list of state names and the number of data sets it used for training that were lists of states were probably abover the average, so the model has a higher statistical weight for putting connecticut after colorado if someone asks about a list of states

  • ilinamorato@lemmy.world
    link
    fedilink
    English
    arrow-up
    56
    ·
    3 months ago

    ✅ Colorado

    ✅ Connedicut

    ✅ Delaware

    ❌ District of Columbia (on a technicality)

    ✅ Florida

    But not

    ❌ I’aho

    ❌ Iniana

    ❌ Marylan

    ❌ Nevaa

    ❌ North Akota

    ❌ Rhoe Islan

    ❌ South Akota

  • Djehngo@lemmy.world
    link
    fedilink
    English
    arrow-up
    63
    arrow-down
    2
    ·
    3 months ago

    The letters that make up words is a common blind spot for AIs, since they are trained on strings of tokens (roughly words) they don’t have a good concept of which letters are inside those words or what order they are in.

    • NoiseColor @lemmy.world
      link
      fedilink
      English
      arrow-up
      13
      arrow-down
      55
      ·
      3 months ago

      I find it bizarre that people find these obvious cases to prove the tech is worthless. Like saying cars are worthless because they can’t go under water.

      • skisnow@lemmy.ca
        link
        fedilink
        English
        arrow-up
        87
        arrow-down
        4
        ·
        edit-2
        3 months ago

        Not bizarre at all.

        The point isn’t “they can’t do word games therefore they’re useless”, it’s “if this thing is so easily tripped up on the most trivial shit that a 6-year-old can figure out, don’t be going round claiming it has PhD level expertise”, or even “don’t be feeding its unreliable bullshit to me at the top of every search result”.

        • 1rre@discuss.tchncs.de
          link
          fedilink
          English
          arrow-up
          7
          arrow-down
          26
          ·
          3 months ago

          A six year old can read and write Arabic, Chinese, Ge’ez, etc. and yet most people with PhD level experience probably can’t, and it’s probably useless to them. LLMs can do this also. You can count the number of letters in a word, but so can a program written in a few hundred bytes of assembly. It’s completely pointless to make LLMs to do that, as it’d just make them way less efficient than they need to be while adding nothing useful.

          • Echo Dot@feddit.uk
            link
            fedilink
            English
            arrow-up
            10
            ·
            3 months ago

            So if the AI can’t do it then that’s just proof that the AI is too smart to be able to do it? That’s your arguement is it. Nah, it’s just crap

            You think just because you attached it to an analogy that makes it make sense. That’s not how it works, look I can do it.

            My car is way too technologically sophisticated to be able to fly, therefore AI doesn’t need to be able to work out how many l Rs are in “strawberry”.

            See how that made literally no sense whatsoever.

            • 1rre@discuss.tchncs.de
              link
              fedilink
              English
              arrow-up
              2
              arrow-down
              5
              ·
              3 months ago

              Except you’re expecting it to do everything. Your car is too “technically advanced” to walk on the sidewalk, but wait, you can do that anyway and don’t need to reinvent your legs

          • skisnow@lemmy.ca
            link
            fedilink
            English
            arrow-up
            21
            arrow-down
            2
            ·
            3 months ago

            LOL, it seems like every time I get into a discussion with an AI evangelical, they invariably end up asking me to accept some really poor analogy that, much like an LLM’s output, looks superficially clever at first glance but doesn’t stand up to the slightest bit of scrutiny.

            • 1rre@discuss.tchncs.de
              link
              fedilink
              English
              arrow-up
              2
              arrow-down
              11
              ·
              3 months ago

              it’s more that the only way to get some anti AI crusader that there are some uses for it is to put it in an analogy that they have to actually process rather than spitting out an “ai bad” kneejerk.

              I’m probably far more anti AI than average, for 95% of what it’s pushed for it’s completely useless, but that still leaves 5% that it’s genuinely useful for that some people refuse to accept.

              • TempermentalAnomaly@lemmy.world
                link
                fedilink
                English
                arrow-up
                5
                arrow-down
                1
                ·
                3 months ago

                It’s amazing that if you acknowledge that:

                1. AI has some utility and
                2. The (now tiresome and sloppy) tests they’re using doesn’t negate 1

                You are now an AI evangelist. Just as importantly, the level of investment into AI doesn’t justify #1. And when that realization hits business America, a correction will happen and the people who will be effected aren’t the well off, but the average worker. The gains are for the few, the loss for the many.

              • Jomega@lemmy.world
                link
                fedilink
                English
                arrow-up
                4
                ·
                3 months ago

                it’s more that the only way to get some anti AI crusader that there are some uses for it

                Name three.

                • 1rre@discuss.tchncs.de
                  link
                  fedilink
                  English
                  arrow-up
                  3
                  arrow-down
                  6
                  ·
                  3 months ago

                  I’m going to limit to LLMs as that’s the generally accepted term and there’s so many uses for AI in other fields that it’d be unfair.

                  1. Translation. LLMs are pretty much perfect for this.

                  2. Triaging issues for support. They’re useless for coming to solutions but as good as humans without the need to wait at sending people to the correct department to deal with their issues.

                  3. Finding and fixing issues with grammar. Spelling is something that can be caught by spell-checkers, but grammar is more context-aware, another thing that LLMs are pretty much designed for, and useful for people writing in a second language.

                  4. Finding starting points to research deeper. LLMs have a lot of data about a lot of things, so can be very useful for getting surface level information eg. about areas in a city you’re visiting, explaining concepts in simple terms etc.

                  5. Recipes. LLMs are great at saying what sounds right, so for cooking (not so much baking, but it may work) they’re great at spitting out recipes, including substitutions if needed, that go together without needing to read through how someone’s grandmother used to do xyz unrelated nonsense.

                  There’s a bunch more, but these were the first five that sprung to mind.

              • abir_v@lemmy.world
                link
                fedilink
                English
                arrow-up
                6
                ·
                3 months ago

                I feel this. In my line of work I really don’t like using them for much of anything (programming ofc, like 80% of Lemmy users) because it gets details wrong too often to be useful and I don’t like babysitting.

                But when I need a logging message, or to return an error, it’s genuinely a time saver. It’s good at pretty well 5%, as you say.

                But using it for art, math, problem solving, any of that kind of stuff that gets tauted around by the business people? Useless, just fully fuckin useless.

                • 1rre@discuss.tchncs.de
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  edit-2
                  3 months ago

                  I don’t know about “art”, one part of ai image generation is of replacing stock images and erotic photos which frankly I don’t have a huge issue with as they’re both at least semi-exploitative industries anyway in many ways and you just need something that’s good enough.

                  Obviously these don’t extend to things a reasonable person would consider art, but business majors and tech bros rebranding something shitty to position it as a competitor to or in the same class as something it so obviously isn’t.

        • NoiseColor @lemmy.world
          link
          fedilink
          English
          arrow-up
          11
          arrow-down
          27
          ·
          3 months ago

          I don’t want to defend ai again, but it’s a technology, it can do some things and can’t do others. By now this should be obvious to everyone. Except to the people that believe everything commercials tell them.

          • kouichi@ani.social
            link
            fedilink
            English
            arrow-up
            22
            ·
            3 months ago

            How many people do you think know that AIs are “trained on tokens”, and understand what that means? It’s clearly not obvious to those who don’t, which are roughly everyone.

              • huppakee@feddit.nl
                link
                fedilink
                English
                arrow-up
                10
                arrow-down
                1
                ·
                3 months ago

                Go to an art museum and somebody will say ‘my 6 year old can make this too’, in my view this is a similar fallacy.

                • NoiseColor @lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  2
                  arrow-down
                  6
                  ·
                  3 months ago

                  That makes no sense. That has nothing to do with it. What are you on about.

                  That’s like watching tv and not knowing how it works. You still know what to get out of it.

            • NoiseColor @lemmy.world
              link
              fedilink
              English
              arrow-up
              5
              ·
              3 months ago

              Ok? So, what you are saying is that some lawyers are idiots. I could have told you that before ai existed.

              • Aceticon@lemmy.dbzer0.com
                link
                fedilink
                English
                arrow-up
                8
                ·
                edit-2
                3 months ago

                It’s not the AIs which are crap, its what they’ve been sold as capable of doing and the reliability of their results that’s massivelly disconnected from reality.

                The crap is what a most of the Tech Investor class has pushed to the public about AI.

                It’s thus not at all surprising that many who work or manage work in areas were precision and correctness is essential have been deceived into thinking AI can do much of the work for them and it turns out AI can’t really do it because of those precision and correctness requirement that it simply cannot achieve.

                This will hit more those people who are not Tech experts, such as Lawyers, but even some supposedly Tech experts (such as some programmers) have been swindled in this way.

                There are many great uses for AI, especially stuff other than LLMs, in areas where false positives or false negatives are no big deal, but that’s not were the Make Money Fast slimy salesmen push for them is.

                • NoiseColor @lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  arrow-down
                  6
                  ·
                  3 months ago

                  I think people today, after having a year experience with ai know it’s capabilities reasonably well. My mother is 73 and it’s been a while since she stopped joking about what ai wrote to her that was silly or wrong, so people using computers at their jobs should be much more aware.

                  I agree about that llms are good at some things. They are great tools for what they can do. Let’s use them for those things! I mean even programming has benefitted a lot from this, especially in education, junior level stuff, prototyping, …

                  When using any product, a certain responsibility falls on the user. You can’t blame technology for what stupid users do.

      • figjam@midwest.social
        link
        fedilink
        English
        arrow-up
        7
        ·
        3 months ago

        Understanding the bounds of tech makes it easier for people to gage its utility. The only people who desire ignorance are those that profit from it.

        • NoiseColor @lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          arrow-down
          1
          ·
          3 months ago

          Sure. But you can literally test almost all frontier models for free. It’s not like there is some conspiracy or secret. Even my 73 year old mother uses it and knows it’s general limits.

        • FishFace@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          1
          ·
          3 months ago

          Saying “it’s worth trillions of dollars huh” isn’t really promoting that attitude.

      • EnsignWashout@startrek.website
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        3 months ago

        I find it bizarre that people find these obvious cases to prove the tech is worthless. Like saying cars are worthless because they can’t go under water.

        This reaction is because conmen are claiming that current generations of LLM technology are going to remove our need for experts and scientists.

        We’re not demanding submersible cars, we’re just laughing about the people paying top dollar for the lastest electric car while plannig an ocean cruise.

        I’m confident that there’s going to be a great deal of broken… everything…built with AI “assistance” during the next decade.

        • NoiseColor @lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          3
          ·
          3 months ago

          That’s not what you are doing at all. You are not laughing. Anti ai people are outraged, full of hatred and ready to pounce on anyone who isn’t as anti as they are. It’s a super emotional issue, especially on fediverse.

          You may be confident, because you probably don’t know how software is built. Nobody is going to just abandon all the experience they have, vibe code something and release whatever. Thats not how it works.

      • knatschus@discuss.tchncs.de
        link
        fedilink
        English
        arrow-up
        20
        arrow-down
        1
        ·
        3 months ago

        Then why is Google using it for question like that?

        Surely it should be advanced enough to realise it’s weakness with this kind of questions and just don’t give an answer.

        • NoiseColor @lemmy.world
          link
          fedilink
          English
          arrow-up
          14
          ·
          edit-2
          3 months ago

          They are using it for every question. It’s pointless. The only reason they are doing it is to blow up their numbers.

          … they are trying to be infront. So that some future ai search wouldn’t capture their market share. It’s a safety thing even if it’s not working for all types of questions.

          • TheGrandNagus@lemmy.world
            link
            fedilink
            English
            arrow-up
            11
            ·
            edit-2
            3 months ago

            The only reason they are doing it is to blow up their numbers.

            Ding ding ding.

            It’s so they can have impressive metrics for shareholders.

            “Our AI had n interactions this quarter! Look at that engagement!”, with no thought put into what user problems it actually solves.

            It’s the same as web results in the Windows start menu. “Hey shareholders, Bing received n interactions through the start menu, isn’t that great? Look at that engagement!”, completely obfuscating that most of the people who clicked are probably confused elderly users who clicked on a web result without realising.

            Line on chart must go up!

            • NoiseColor @lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              ·
              3 months ago

              Yeah, but … they also can’t just do nothing and possibly miss out on something. Especially if they already invested a lot.

    • azertyfun@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      3
      ·
      3 months ago

      It’s very funny that you can get ChaptGPT to spell out the word (making each letter an individual token) and still be wrong.

      Of course it makes complete sense when you know how LLMs work, but this demo does a very concise job of short-circuiting the cognitive bias that talking machine == thinking machine.

  • hark@lemmy.world
    link
    fedilink
    English
    arrow-up
    6
    ·
    3 months ago

    With enough duct tape and chewed up bubble gum, surely this will lead to artificial general intelligence and the singularity! Any day now.

  • kreskin@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    3 months ago

    So this is the terminator consciousness so many people are scared will kill us all…

  • Mrkawfee@lemmy.world
    link
    fedilink
    English
    arrow-up
    7
    arrow-down
    1
    ·
    edit-2
    3 months ago

    You don’t get it because you aren’t an AI genius. This chatbot has clearly turned sentient and is trolling you.

    • FauxLiving@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      3 months ago

      It doesn’t take an AI genius to understand that it is possible to use low parameter models which are cheaper to run but dumber.

      Considering Google serves billions of searches per day, they’re not using GPT-5 to generate the quick answers.

  • Deestan@lemmy.world
    link
    fedilink
    English
    arrow-up
    9
    ·
    3 months ago

    Hey hey hey hey don’t look at what it actually does.

    Look at what it feels like it almost can do and pretend it soon will!

  • dude@lemmings.world
    link
    fedilink
    English
    arrow-up
    28
    arrow-down
    1
    ·
    3 months ago

    Well, for anyone who knows a bit about how LLMs work, it’s pretty obvious why LLMs struggle with identifying the letters in the words

      • JustTesting@lemmy.hogru.ch
        link
        fedilink
        English
        arrow-up
        20
        ·
        3 months ago

        They don’t look at it letter by letter but in tokens, which are automatically generated separately based on occurrence. So while ‘z’ could be it’s own token, ‘ne’ or even ‘the’ could be treated as a single token vector. of course, ‘e’ would still be a separate token when it occurs in isolation. You could even have ‘le’ and ‘let’ as separate tokens, afaik. And each token is just a vector of numbers, like 300 or 1000 numbers that represent that token in a vector space. So ‘de’ and ‘e’ could be completely different and dissimilar vectors.

        so ‘delaware’ could look to an llm more like de-la-w-are or similar.

        of course you could train it to figure out letter counts based on those tokens with a lot of training data, though that could lower performance on other tasks and counting letters just isn’t that important, i guess, compared to other stuff

        • MangoCats@feddit.it
          link
          fedilink
          English
          arrow-up
          6
          ·
          3 months ago

          Of course, when the question asks “contains the letter _” you might think an intelligent algorithm would get off its tokens and do a little letter by letter analysis. Related: ChatGPT is really bad at chess, but there are plenty of algorithms that are super-human good at it.

        • fading_person@lemmy.zip
          link
          fedilink
          English
          arrow-up
          2
          ·
          3 months ago

          Wouldn’t that only explain errors by omission? If you ask for a letter, let’s say D, it would omit words containing that same letter when in a token in conjunction with more letters, like Da, De, etc, but how would it return something where the letter D isn’t even in the word?

          • JustTesting@lemmy.hogru.ch
            link
            fedilink
            English
            arrow-up
            8
            ·
            3 months ago

            Well each token has a vector. So ‘co’ might be [0.8,0.3,0.7] just instead of 3 numbers it’s like 100-1000 long. And each token has a different such vector. Initially, those are just randomly generated. But the training algorithm is allowed to slowly modify them during training, pulling them this way and that, whichever way yields better results during training. So while for us, ‘th’ and ‘the’ are obviously related, for a model no such relation is given. It just sees random vectors and the training reorganizes them tho slowly have some structure. So who’s to say if for the model ‘d’, ‘da’ and ‘co’ are in the same general area (similar vectors) whereas ‘de’ could be in the opposite direction. Here’s an example of what this actually looks like. Tokens can be quite long, depending how common they are, here it’s ones related to disease-y terms ending up close together, as similar things tend to cluster at this step. You might have an place where it’s just common town name suffixes clustered close to each other.

            and all of this is just what gets input into the llm, essentially a preprocessing step. So imagine someone gave you a picture like the above, but instead of each dot having some label, it just had a unique color. And then they give you lists of different colored dots and ask you what color the next dot should be. You need to figure out the rules yourself, come up with more and more intricate rules that are correct the most. That’s kinda what an LLM does. To it, ‘da’ and ‘de’ could be identical dots in the same location or completely differents

            plus of course that’s before the llm not actually knowing what a letter or a word or counting is. But it does know that 5.6.1.5.4.3 is most likely followed by 7.7.2.9.7(simplilied representation), which when translating back, that maps to ‘there are 3 r’s in strawberry’. it’s actually quite amazing that they can get it halfway right given how they work, just based on ‘learning’ how text structure works.

            but so in this example, us state-y tokens are probably close together, ‘d’ is somewhere else, the relation between ‘d’ and different state-y tokens is not at all clear, plus other tokens making up the full state names could be who knows where. And tien there’s whatever the model does on top of that with the data.

            for a human it’s easy, just split by letters and count. For an llm it’s trying to correlate lots of different and somewhat unrelated things to their ‘d-ness’, so to speak

            • fading_person@lemmy.zip
              link
              fedilink
              English
              arrow-up
              2
              ·
              3 months ago

              Thank you very much for taking your time to explain this. if you don’t mind, do you recommend some reference for further reading on how llms work internally?