lesson1Title

lesson2Title

lesson3Title

lesson4Title

lesson5Title

lesson6Title

lesson7Title

lesson8Title

lesson9Title

lesson10Title

lesson11Title

lesson12Title

lesson13Title

aiPromptEngineeringBasicsChapter1Title

aiPromptEngineeringBasicsChapter2Title

aiPromptEngineeringBasicsChapter3Title

# The Meaning and Evolution of GPT

`GPT` stands for *G*enerative *P*re-trained *T*ransformer, referring to a generative AI model pre-trained using large datasets, based on the **Transformer** model introduced by a Google research team in 2017.

> `Transformer`: An AI model architecture that assigns weights to words based on their importance in a sentence, allowing for efficient parallel computation.

---
 

## What does the name GPT really mean?

- *`G`enerative*: The AI model can generate (or create) text.

- *`P`re-trained*: It has learned from a vast amount of data in advance.

- *`T`ransformer*: Utilizes the Transformer-based AI model architecture.

 

## Background of GPT

Before the development of Transformer-based AI models, natural language processing (NLP) primarily relied on `rule-based systems` and `deep learning` approaches.

 

### Rule-Based Approach

The `rule-based approach` involves defining specific rules beforehand and processing data or drawing conclusions based on those rules.

It generates predictable output results for given inputs.

```text title="Example of Rule-Based Approach"
Identifying the subject and verb in a sentence

Rules:
- In English, the first word in a sentence is likely to be the subject.
- The word following the subject is likely to be the verb.

Input sentence: "The cat sleeps."

Applied rules:
- "The cat" is identified as the subject.
- "Sleeps" is identified as the verb.
```

The rule-based approach did not work well for inputs outside of defined patterns and had limitations in dealing with the ever-changing realities of natural language processing.

 

### Deep Learning

`Deep Learning` refers to using **artificial neural networks** to **learn** patterns in data and make predictions about new data based on this learning.

Key terms related to deep learning are as follows:

 

### Neural Networks

`Neural Networks` are computer models that mimic the human brain, structured to process input data and produce an output.

These networks are composed of multiple layers, with each layer processing the input data to extract higher-level information.

Components of these layers include:

- *Input Layer*: The layer that receives data

- *Hidden Layers*: Multiple intermediate layers that process data and learn patterns

- *Output Layer*: The layer that delivers the final output


### Training

`Training` is the process wherein the neural network processes input data and learns patterns.

For example, by showing numerous images of cats and dogs, the network learns how to distinguish between cats and dogs in images.

Key terms include:

- *Dataset*: A collection of data used for training (e.g., thousands of images of cats and dogs)

- *Label*: Information indicating what each piece of data represents (e.g., cat images labeled as 'cat', dog images labeled as 'dog')

While deep learning has been highly successful, it struggled with sequential data processing, meaning it often forgot earlier parts of a sentence when analyzing long texts. Much like a reader forgetting the beginning of a book by the time they reach the end.

 

## Emergence of the Transformer Model

The Transformer model was designed to **reduce processing time** for input and output by processing data in parallel and to **understand context** by considering the relationships between the input data.

With large-scale pre-training on data, GPT demonstrated outstanding performance in natural language processing and has rapidly evolved with version upgrades such as `GPT-2`, `GPT-3`, and `GPT-4`.

Check out the slideshow to see the major milestones of major GPT models!

The 'P' in GPT stands for 'Pre-trained,' meaning that the model is trained on a large dataset in advance to perform new tasks more effectively.

The Meaning and Evolution of GPT

What does the name GPT really mean?

Background of GPT

Rule-Based Approach

Deep Learning

Neural Networks

Training

Emergence of the Transformer Model

What does GPT stand for?