HyperAI

Large Language Model

Large Language Model (LLM), also known as Large Language Model, is an artificial intelligence algorithm. It applies neural network technology with a large number of parameters and uses self-supervised learning technology to process and understand human language or text. Tasks such as text generation, machine translation, summary writing, image generation from text, machine coding, chatbots or conversational AI are all applications of large language models. Examples of this LLM model include Chat GPT from Open AI, BERT (Bidirectional Encoder Representation of Transformer) from Google, etc.

LLMs are characterized by their large size, with billions of parameters helping them learn complex patterns in language data. These models are often based on deep learning architectures such as Transformers, which helps them achieve impressive performance on various NLP tasks.

The following figure shows the development timeline of large language models in recent years., the yellow highlighted ones are the large language models that have been open sourced.

References

https://www.geeksforgeeks.org/large-language-model-llm/?ref=lbp