• zbyte64@awful.systems
    link
    fedilink
    English
    arrow-up
    3
    ·
    2 days ago

    Using computers to search for a counter example to a conjecture isn’t exactly new ground and I suspect they did so with the aide of some harness tweaks like some numerical LSP. Like cool, it pushed the envelope but like what the parent said, they grafted on the ability to do a specific task.

      • zbyte64@awful.systems
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 day ago

        Aren’t you the least bit curious what tools they gave the LLM and how the LLM used those tools? It’s like back in math class you are asked to solve a quadratic formula but you forgot how. So you use the calculator to try different numbers and the calculator is telling you if you are getting closer. Sure I got the right answer, but it’s hardly a testament to my math skills.

        • Communist@lemmy.frozeninferno.xyz
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 day ago

          The calculator does not tell them if they’re getting closer? This isn’t how anything works. No I can’t say I’m very interested in whether or not the llm has access to python/a calculator as long as it completes the task, that doesn’t matter.

          • zbyte64@awful.systems
            link
            fedilink
            English
            arrow-up
            1
            ·
            1 day ago

            If you are not interested in how it completes the task then you are not an authority on how it works.

            • Communist@lemmy.frozeninferno.xyz
              link
              fedilink
              English
              arrow-up
              1
              ·
              22 hours ago

              I’m academically interested, what I mean when I say I’m not interested is that I just don’t see the significance when we’re talking about if it’s capable of the task.

              • zbyte64@awful.systems
                link
                fedilink
                English
                arrow-up
                1
                ·
                21 hours ago

                How are you able to understand it’s capability without understanding what tools it is capable of manipulating to effect?

                • Communist@lemmy.frozeninferno.xyz
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  20 hours ago

                  You aren’t, and that’s exactly what I’m saying, it’s capable of doing these things with tools, therefore it’s capable of doing these things.

                  • zbyte64@awful.systems
                    link
                    fedilink
                    English
                    arrow-up
                    1
                    ·
                    16 hours ago

                    So why are you allergic to people talking about the quality of the tools in regards to capability?