🧠 AI with Python – 📈 Train a Support Vector Machine (SVM) Classifier

Posted On: October 6, 2025

Description:

Introduction

Support Vector Machines (SVMs) are powerful supervised learning algorithms widely used for classification tasks.

They work by finding the optimal hyperplane that separates data points of different classes.

With kernels, SVMs can handle non-linear decision boundaries, making them flexible and effective.

In this post, we’ll train an SVM classifier using scikit-learn on the Iris dataset, evaluate its performance, and optionally visualize the decision boundaries in 2D.

Why SVM?

Finds the maximum margin hyperplane, ensuring strong generalization.
Works for linear and non-linear data (via kernels).
Effective in high-dimensional spaces.
Requires scaling of features for stable performance.

Loading the Dataset

We’ll use scikit-learn’s built-in Iris dataset, a classic classification dataset with four numeric features and three classes (setosa, versicolor, virginica).

from sklearn.datasets import load_iris

iris = load_iris()
X, y = iris.data, iris.target
feature_names = iris.feature_names
target_names = iris.target_names

Train/Test Split

We split the dataset into training and testing sets, stratifying by class to preserve balance.

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(
    X, y, test_size=0.25, stratify=y, random_state=42
)

Building the SVM Pipeline

SVMs are sensitive to feature scales, so we combine StandardScaler and SVC in a Pipeline.

This ensures scaling happens inside cross-validation and prevents leakage.

from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler
from sklearn.svm import SVC

clf = Pipeline([
    ("scaler", StandardScaler()),
    ("svm", SVC(kernel="rbf", C=1.0, gamma="scale", random_state=42))
])
clf.fit(X_train, y_train)

Evaluating the Model

We measure accuracy and use a classification report for detailed metrics.

from sklearn.metrics import accuracy_score, classification_report

y_pred = clf.predict(X_test)
print("Accuracy:", accuracy_score(y_test, y_pred))
print("Report:\n", classification_report(y_test, y_pred, target_names=target_names))

Confusion Matrix

A confusion matrix gives a visual summary of predictions vs. true labels.

from sklearn.metrics import ConfusionMatrixDisplay
import matplotlib.pyplot as plt

ConfusionMatrixDisplay.from_predictions(y_test, y_pred, display_labels=target_names, cmap="Blues")
plt.show()

Optional: Visualizing Decision Boundaries

For intuition, we can project the dataset onto two features (petal length and width) and plot the decision regions of an SVM.

import numpy as np

# Use only two features
X2 = X[:, [2, 3]]
X2_train, X2_test, y2_train, y2_test = train_test_split(
    X2, y, test_size=0.25, stratify=y, random_state=42
)

viz_clf = Pipeline([
    ("scaler", StandardScaler()),
    ("svm", SVC(kernel="rbf", C=1.0, gamma="scale", random_state=42))
]).fit(X2_train, y2_train)

# Create mesh grid
x_min, x_max = X2[:, 0].min() - 0.5, X2[:, 0].max() + 0.5
y_min, y_max = X2[:, 1].min() - 0.5, X2[:, 1].max() + 0.5
xx, yy = np.meshgrid(np.linspace(x_min, x_max, 300), np.linspace(y_min, y_max, 300))

Z = viz_clf.predict(np.c_[xx.ravel(), yy.ravel()]).reshape(xx.shape)

plt.contourf(xx, yy, Z, alpha=0.25, cmap="Paired")
plt.scatter(X2_train[:, 0], X2_train[:, 1], c=y2_train, edgecolor="k", cmap="Paired")
plt.xlabel(feature_names[2])
plt.ylabel(feature_names[3])
plt.title("SVM (RBF) Decision Boundaries – Petal Features")
plt.show()

Sample Output

Accuracy: ~97% (varies slightly due to randomness)
Classification Report: Strong precision and recall across all three classes.
Confusion Matrix: Minimal misclassifications.
Decision Boundary Plot: Clear separation of classes in 2D projection.

Key Takeaways

SVMs are powerful classifiers that maximize the margin between classes.
They handle non-linear boundaries via kernels like RBF.
Always scale features before training an SVM.
Best suited for small to medium datasets with well-defined boundaries.

Code Snippet:

from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler
from sklearn.pipeline import Pipeline
from sklearn.svm import SVC
from sklearn.metrics import accuracy_score, classification_report, ConfusionMatrixDisplay
import matplotlib.pyplot as plt
import numpy as np
import pandas as pd


iris = load_iris()
X, y = iris.data, iris.target
feature_names = iris.feature_names
target_names = iris.target_names

print("X shape:", X.shape, "| y shape:", y.shape)
print("Classes:", list(target_names))


X_train, X_test, y_train, y_test = train_test_split(
    X, y, test_size=0.25, stratify=y, random_state=42
)


clf = Pipeline(steps=[
    ("scaler", StandardScaler()),
    ("svm", SVC(kernel="rbf", C=1.0, gamma="scale", probability=False, random_state=42))
])


clf.fit(X_train, y_train)

y_pred = clf.predict(X_test)


acc = accuracy_score(y_test, y_pred)
print(f"Test Accuracy: {acc:.4f}\n")
print("Classification Report:\n", classification_report(y_test, y_pred, target_names=target_names))


ConfusionMatrixDisplay.from_predictions(y_test, y_pred, display_labels=target_names, cmap="Blues")
plt.title("SVM (RBF) — Confusion Matrix")
plt.tight_layout()
plt.show()


# Choose two informative features (indices for petal length, petal width)
f1, f2 = 2, 3
X2 = X[:, [f1, f2]]

# Re-split for this 2D visualization
X2_train, X2_test, y2_train, y2_test = train_test_split(
    X2, y, test_size=0.25, stratify=y, random_state=42
)

viz_clf = Pipeline(steps=[
    ("scaler", StandardScaler()),
    ("svm", SVC(kernel="rbf", C=1.0, gamma="scale", random_state=42))
]).fit(X2_train, y2_train)

# Create a meshgrid
x_min, x_max = X2[:, 0].min() - 0.5, X2[:, 0].max() + 0.5
y_min, y_max = X2[:, 1].min() - 0.5, X2[:, 1].max() + 0.5
xx, yy = np.meshgrid(np.linspace(x_min, x_max, 300),
                     np.linspace(y_min, y_max, 300))

# Predict on grid
Z = viz_clf.predict(np.c_[xx.ravel(), yy.ravel()]).reshape(xx.shape)

# Plot regions + training points
plt.figure(figsize=(7, 5))
plt.contourf(xx, yy, Z, alpha=0.25, cmap="Paired")
scatter = plt.scatter(X2_train[:, 0], X2_train[:, 1], c=y2_train, s=40, edgecolor="k", cmap="Paired")
plt.xlabel(feature_names[f1])
plt.ylabel(feature_names[f2])
plt.title("SVM (RBF) Decision Boundaries — 2 Features")
plt.legend(handles=scatter.legend_elements()[0], labels=list(target_names), title="Class", loc="upper left")
plt.tight_layout()
plt.show()

← →	move
↑	rotate
↓	soft drop
Space	hard drop
P	pause / resume

🧠 AI with Python – 📈 Train a Support Vector Machine (SVM) Classifier

Description:

Introduction

Why SVM?

Loading the Dataset

Train/Test Split

Building the SVM Pipeline

Evaluating the Model

Confusion Matrix

Optional: Visualizing Decision Boundaries

Sample Output

Key Takeaways

Code Snippet:

Comments

Add Your Comment

🧠 AI with Python – 📈 Train a Support Vector Machine (SVM) Classifier

Description:

Introduction

Why SVM?

Loading the Dataset

Train/Test Split

Building the SVM Pipeline

Evaluating the Model

Confusion Matrix

Optional: Visualizing Decision Boundaries

Sample Output

Key Takeaways

Code Snippet:

Comments Show Comments

Add Your Comment

Related Posts

🧠 AI with Python – 📊 SHAP Values for Model Interpretability

🧠 AI with Python – 📊 Multi-class Classification Evaluation with classification_report

🧠 AI with Python – 📊 Precision, Recall, and F1-Score Explained

Comments