Old Notes
Large Language Models
August 1, 2023
I wrote a bunch of notes on LLMs back in 2023 that I will soon include on this section.
You train the model on sequence completion; and what you get after that is a system that can complete sequences by which it can perform (any?!) task, as long as it is structured in terms of human language.
The whole proccedure sounds to me as something absurd that should instinctively not have been possible at all.
For more details, do check out the book 'How to build a Large Language Model from Scratch' https://github.com/rasbt/LLMs-from-scratch
Continue reading:Parameter Size isn't Everything
