speak for yourself

blibla@slrpnk.net · 6 days ago

speak for yourself

agamemnonymous@sh.itjust.works · 2 days ago

As I said elsewhere, the AI probably isn’t going to just be an LLM. It’s probably gonna be a complex model that uses modules like LLMs to fulfill a compound task. But the exact architecture doesn’t matter.

We know that it can output code, which means we have a quantifiable metric to make it better at coding, and thousands of people are certainly trying. AI video was hot garbage 18 months ago, now it’s basically perfect.

It’s not if we’re going to get a decent coding AI, it’s when.

merc@sh.itjust.works · 1 day ago

It’s probably gonna be a complex model that uses modules like LLMs to fulfill a compound task.

That sounds very hand-wavey. But, even the presence of LLMs in the mix suggests it isn’t going to be very good at whatever it does, because LLMs are designed to fool humans into thinking something is realistic rather than actually doing something useful.

We know that it can output code, which means we have a quantifiable metric to make it better at coding

How so? Project managers have been working for decades to quantify code, and haven’t managed to make any progress at it.

It’s not if we’re going to get a decent coding AI, it’s when.

The year 30,000 AD doesn’t count.

agamemnonymous@sh.itjust.works · edit-2 1 day ago

LLMs are designed to fool humans into thinking something is realistic rather than actually doing something useful.

So closer to average human intelligence than it would appear. I don’t know why people keep insisting that confidently making things up and repeating things blindly is somehow distinct from the average human intelligence.

But more seriously, this whole mindset is based on a stagnation in development that I’m just not seeing. I think it was Stanford recently released a paper on a new architecture they developed that has serious promise.

How so? Project managers have been working for decades to quantify code, and haven’t managed to make any progress at it.

I think you misunderstand me. The metric is the code. We can look at the code, see what kind of mistakes it’s making, and then alter the model to try to be better. That is an iterative process.

The year 30,000 AD doesn’t count.

Sure. Maybe it’s 30,000AD. Maybe it’s next month. We don’t know when the breakthrough that kicks off massive improvement is going to hit, or even what it will be. Every new development could be the big one.

merc@sh.itjust.works · 22 hours ago

So closer to average human intelligence than it would appear

No, zero intelligence.

It’s like how people are fooled by optical illusions. It doesn’t mean optical illusions are smart, it just means that they tickle a part of the brain that sees patterns.

a paper on a new architecture they developed that has serious promise

Oooh, a new architecture and serious promise? Wow! You should invest!

The metric is the code. We can look at the code, see what kind of mistakes it’s making

No, we can’t. That’s the whole point. If that were possible, then companies could objectively determine who their best programmers were, and that’s a holy grail they’ve been chasing for decades. It’s just not possible.

and then alter the model to try to be better

Nobody knows how to alter the model to try to be better. That’s why multi-billion dollar companies are releasing new models that are worse than their previous models.

Maybe it’s next month

It’s definitely not next month, or next year, or next century. Nobody has any idea how to get to actual intelligence, and despite the hype progress is as slow as ever.

Every new development could be the big one

Keep drinking that kool-aid.