> because they're not like alphazero, they just spit words This seems to be changing right now: here's a paper on recent, roughly speaking, AlphaZero-like research that specifically uses coding problems. They make it learning on experience rather than on traditional datasets.

Replies (1)

the axiom's avatar
the axiom 1 week ago
I don't know how you found this but it's definitely interesting, maybe this time human programmers will become unnecessary