I'm serious about the question. For all we know and think we know about how humans learn, there is one very concrete thing we know is for certain. Sleep is crucial to learning. So, here's the question again.

Are current language models more awake or asleep?

I would argue that when they are training they are asleep because they are revisiting prior experiences (training data) and making updates to their wiring (training). When they are inferring, they are awake because they are actively interacting with the world (e.g. chatting).

By this framework, language models (currently) are sleep-deprived zombies, who are operating on autopilot. They got one long sleep as an infant (pretraining), one long sleep as a high schooler (chat training), and maybe a short nap here and there (fine-tuning). But they are still zombies.

That seems... not good? I would hardly want an employee to be zombie on autopilot droning on and on. I mean I suppose it would be better if said zombies had pirated thousands of textbooks and had droned intelligibly about that. I suppose it could be better if I had hundreds of said zombies talking to each other in some infinite game of telephone. And maybe, maybe it could be better if these zombies responded super fast, and instead of droning it was more like a reflex, a knee-jerk response. But, I don't think so.

They would still be asleep.

Here is one proposal people have had for waking our zombies up. Some people suggest that we give our zombies a big notepad. But can you imagine? Talking to some sleep-deprived zombie and they are frantically scribbling down what you say, flipping through pages, running through a library, googling, and then knee-jerking a response over and over again. They are faking it. And yes I know, there are entire firms based off of this premise and good for them. But even then, at those firms, they don't expect those zombies to be god-forbid talking to their clients or making real decisions! Of course not, they are zombies.

That is not to say that an army of zombies is not useful. I could certainly use an army of zombies. I'm sure an army of zombies could be commercially viable too, and I'm sure there is an industry in making and maintaining an army of zombies as well. But, I think that that is not the future, it can't be the future.

It is like the Empire in Star Wars. The Sith have ensembled masses of droids and clones to overwhelm our public squares, art, culture, science, relationships, politics and craft. They can be useful and powerful, but they are asleep at the wheel. They come in hordes and those hordes are overwhelming. But, they can be Jedi mind tricked and they have horrible accuracy. They are pretty good at operating machinery or programming systems. But, they don't have direction and are fundamentally a static vision of the future, because, you know by now, they are asleep so they can't evolve a direction or a vision of the future.