(Untitled)

Here's an excellent video from @karpathy@sigmoid.social with some intriguing ways to think about Large Language Models:

  1. They are a lossy compression of the Internet.
  2. They could be seen as "... the kernel process of an emerging operating system". -- they coordinate a lot of resources (memory, computational tools) for problem solving. The internet is the "disk", the context window is "RAM" as working memory.

{{< yt "zjkBMFhNj_g" >}}