lesson1Title

lesson2Title

lesson3Title

lesson4Title

lesson5Title

lesson6Title

lesson7Title

lesson8Title

lesson9Title

lesson10Title

lesson11Title

lesson12Title

lesson13Title

lesson14Title

lesson15Title

pythonDataAnalysisAdvancedChapter4Title

pythonDataAnalysisAdvancedChapter1Title

pythonDataAnalysisAdvancedChapter2Title

pythonDataAnalysisAdvancedChapter3Title

# Introduction to `Scikit-learn`

`Scikit-learn` (imported as `sklearn`) is a leading open-source Python library for **machine learning** and **data analysis**.

It provides efficient tools for:
- Classification
- Regression
- Clustering
- Dimensionality reduction
- Model selection
- Data preprocessing

Built on top of `NumPy`, `SciPy`, and `Matplotlib`, Scikit-learn is designed to be simple, efficient, and accessible for both beginners and professionals.

<br/>

## Why Use `Scikit-learn`?

Here are the main reasons why `Scikit-learn` is a **go-to library** for machine learning in Python:

- *Comprehensive algorithms* — includes a wide range of supervised and unsupervised learning models
- *Consistent API* — uniform interface for model training and evaluation
- *Data preprocessing* — built-in tools for scaling, encoding, and feature transformation
- *Model evaluation* — ready-to-use metrics and validation utilities
- *Seamless integration* — works natively with NumPy arrays and Pandas DataFrames

<br/>

## Example: Training a Simple Model

You can install Scikit-learn using the following command:

```bash
pip install scikit-learn
```

After installing Scikit-learn, you can import it using the following command:

```python
import sklearn
```

<br/>

## Example: Training a Simple ML Model

```python
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.neighbors import KNeighborsClassifier

# Load dataset
iris = load_iris()
X_train, X_test, y_train, y_test = train_test_split(
    iris.data, iris.target, test_size=0.2, random_state=42
)

# Create and train model
model = KNeighborsClassifier(n_neighbors=3)
model.fit(X_train, y_train)

# Evaluate
accuracy = model.score(X_test, y_test)
print(f"Accuracy: {accuracy:.2f}")
```

This example shows how little code is needed to:

1. Load a dataset
2. Split it into training and testing sets
3. Train a machine learning model
4. Evaluate its performance

Scikit-learn is widely used for machine learning tasks such as classification, regression, clustering, and dimensionality reduction.

### `Scikit-learn` is a library for machine learning in Python.

Introduction to Scikit-learn

Why Use Scikit-learn?

Example: Training a Simple Model

Example: Training a Simple ML Model

Scikit-learn is a library for machine learning in Python.

Introduction to `Scikit-learn`

Why Use `Scikit-learn`?

`Scikit-learn` is a library for machine learning in Python.