How large are large language models? (2025)
How large are large language models? (2025) This aims to be factual information about the size of large language models. None of this document was written by AI. I do not include any information from leaks or rumors. The focus of this document is on base models (the raw text continuation engines, not 'helpful chatbot/assistants'). This is a view from a few years ago to today of one very tiny fraction of the larger LLM story that's happening. History GPT-2,-medium,-large,-xl (2019): 137M, 380M