lesson1Title

lesson2Title

lesson3Title

lesson4Title

lesson5Title

lesson6Title

lesson7Title

lesson8Title

lesson9Title

lesson10Title

lesson11Title

lesson12Title

aiFineTuningBasicsChapter2Title

lesson13Title

aiFineTuningBasicsChapter1Title

aiFineTuningBasicsChapter3Title

# What Data Formats Do Different AI Models Use?

So far, we have explored the dataset format for fine-tuning on the OpenAI platform.

But what data formats do other AI models use?

Text processing AI models can use different forms of JSONL datasets, and AI models that take other input types, such as image processing models, can have their unique data formats.

<br />

## Text Processing AI Models

For example, JSONL datasets composed of `prompt` representing user input and `completion` representing the output generated by the AI model can be used.

```json title="jsonl Data Format"
{"prompt": "What is the capital of France?", "completion": "The capital of France is Paris."}
{"prompt": "What is the smallest state in the US?", "completion": "The smallest state in the US is Rhode Island."}
```

<br />

## Image Processing AI Models

When training or fine-tuning image processing models, you can generally use a `CSV` (Comma-Separated Values) file that includes the path to the image files and the label of the image.

```csv title="CSV Data Format"
imagePath,label
"/path/to/image1.jpg","cat"
"/path/to/image2.jpg","dog"
```

Depending on the AI model, the image path and label can also be used in other formats like JSON or XML. For instance, another image processing AI model might use a JSON format dataset as shown below.

```json title="JSON Data Format"
{
  "images": [
    {"path": "/path/to/image1.jpg", "label": "cat"},
    {"path": "/path/to/image2.jpg", "label": "dog"}
  ]
}
```

<br />

As you can see, various data formats can be used depending on the AI model, and you should structure your dataset according to the model's requirements.

Text processing AI models usually use the JSONL format, while image processing AI models can use CSV or JSON formats. As such, data formats can vary depending on the AI model, so it's important to use the format that fits the model.

### Various data formats such as JSON, XML can be used depending on the AI model.