lesson1Title

lesson2Title

lesson3Title

lesson4Title

lesson5Title

lesson6Title

lesson7Title

lesson8Title

lesson9Title

lesson10Title

lesson11Title

lesson12Title

aiFineTuningApplicationChapter1Title

aiFineTuningApplicationChapter2Title

aiFineTuningApplicationChapter3Title

# Augmenting Data with AI

According to the [OpenAI official documentation](https://platform.openai.com/docs/guides/fine-tuning/example-count-recommendations), a JSONL dataset should contain at least `10 JSON objects`, and conducting fine-tuning with **only 50-100 high-quality data points** can yield good results.

However, manually creating a dataset is time-consuming and costly, which is why using `data augmentation techniques` to expand the dataset and then refining the augmented data is more efficient.

Data augmentation is a technique for generating new data based on existing data, allowing for an increased dataset size, preventing `overfitting`, and enhancing the generalization performance of the AI model.

In the past, implementing data augmentation required writing complex code to utilize a program. These days, data augmentation can be easily accomplished by instructing text-generating AI to create new data based on existing data.

Codefriends offers a feature that allows you to perform complex data augmentation with just **one click**.

 

## Augmenting Data with One Click

You can augment 10 lines of JSON data in the Codefriends fine-tuning practice environment through the 3 steps below.

 

### 1. Select Data

![thumbnail-600](https://academy.codefriends.net/assets/ai/fine-tuning/application/select-data-demo.png)

 

### 2. Create a New File

![thumbnail-600](https://academy.codefriends.net/assets/ai/fine-tuning/application/new-file-demo.png)

 

### 3. Automatically Add 10 Lines

![thumbnail-600](https://academy.codefriends.net/assets/ai/fine-tuning/application/data-augumented-demo.png)

 

When augmenting data, generative AI is used to create new training data based on the JSON data that has been generated so far.

Data augmentation utilizes existing data to generate new data, increasing the dataset size and preventing AI model overfitting to enhance the model's generalization performance.

### What is the most appropriate word to fill in the blank?

Augmenting Data with AI

Augmenting Data with One Click

1. Select Data

2. Create a New File

3. Automatically Add 10 Lines

What is the most appropriate word to fill in the blank?

Create a fine-tuned model

JSONL Data