Python AI 分类实战教程：scikit-learn 训练与评估完整代码

Q: 这篇文章适合谁读？

这篇文章适合想用 实战 难度理解“Python 人工智能小实战：用 scikit-learn 完成一个分类任务”的读者，预计阅读时间约 10 分钟，重点覆盖 Python, scikit-learn, Classification。

阅读信息

难度: 实战阅读时间: 10 分钟

Python
scikit-learn
Classification

打开知识图谱

中文

Python 人工智能小实战：用 scikit-learn 完成一个分类任务

前面几篇文章讲了人工智能概念、机器学习流程、模型训练评估和神经网络基础。这一篇用一个小实战把流程跑通：使用 Python 和 scikit-learn 完成一个二分类任务。

这个例子使用 scikit-learn 内置的 breast cancer 数据集，不需要额外下载文件。重点不是追求最高分，而是完整经历数据加载、拆分、标准化、训练、预测和评估。

注意：这个数据集只用于机器学习教学练习，不能用于医疗判断或现实诊断。本文关注的是分类流程，而不是医学结论。

一、准备环境

建议先创建虚拟环境，再安装依赖：

python3 -m venv .venv
source .venv/bin/activate
pip install scikit-learn

本文只使用 scikit-learn，不依赖深度学习框架。这样可以把注意力放在机器学习的基本流程上。

二、完整代码

下面是一份可以直接运行的代码：

from sklearn.datasets import load_breast_cancer
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score, classification_report, confusion_matrix
from sklearn.model_selection import train_test_split
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler


def main():
    dataset = load_breast_cancer()
    X = dataset.data
    y = dataset.target

    X_train, X_test, y_train, y_test = train_test_split(
        X,
        y,
        test_size=0.2,
        random_state=42,
        stratify=y,
    )

    model = Pipeline(
        steps=[
            ("scaler", StandardScaler()),
            ("classifier", LogisticRegression(max_iter=500)),
        ]
    )

    model.fit(X_train, y_train)
    y_pred = model.predict(X_test)

    print("Accuracy:", accuracy_score(y_test, y_pred))
    print("Confusion matrix:")
    print(confusion_matrix(y_test, y_pred))
    print("Classification report:")
    print(classification_report(y_test, y_pred, target_names=dataset.target_names))


if __name__ == "__main__":
    main()

保存为 ai_classification_demo.py 后运行：

python ai_classification_demo.py

如果你第一次运行时下载或导入依赖较慢，可以先确认虚拟环境已经启用，并用 python -c "import sklearn; print(sklearn.__version__)" 检查 scikit-learn 是否安装成功。

三、数据集是什么

load_breast_cancer() 会返回一个二分类数据集。每条样本包含多个数值特征，标签表示样本属于哪一类。

在代码里：

X 是特征矩阵，每一行是一条样本
y 是标签数组，每个元素对应一条样本的类别
dataset.target_names 保存类别名称

这个数据集已经被整理成数值特征，适合用来练习基础分类流程。

四、为什么要拆分训练集和测试集

代码中使用了 train_test_split()：

X_train, X_test, y_train, y_test = train_test_split(
    X,
    y,
    test_size=0.2,
    random_state=42,
    stratify=y,
)

这里 test_size=0.2 表示 20% 数据用于测试。stratify=y 表示划分后尽量保持类别比例一致，这对分类任务很有用。

如果不拆分测试集，只在训练集上看结果，模型可能只是记住了训练样本，而不是真的具备泛化能力。

五、为什么使用 Pipeline

代码里没有单独先标准化再训练，而是使用了 Pipeline：

model = Pipeline(
    steps=[
        ("scaler", StandardScaler()),
        ("classifier", LogisticRegression(max_iter=500)),
    ]
)

这样做有两个好处：

标准化和模型训练被放在同一个流程里，不容易漏步骤
测试集会使用训练集上学到的标准化参数，避免数据泄漏

数据泄漏是初学者常见错误。如果你先对完整数据做标准化，再拆分训练集和测试集，测试集的信息就已经提前影响了训练过程。

六、模型为什么选逻辑回归

逻辑回归是分类任务里非常经典的基线模型。它训练速度快、结果稳定、容易解释，适合作为入门模型。

这里没有直接使用神经网络，是因为入门时先跑通完整流程更重要。等你能解释这段代码的每一步，再换成随机森林、支持向量机或神经网络会更自然。

七、怎么看评估结果

代码会输出三类结果：

Accuracy：整体预测正确比例
confusion_matrix：模型把哪些类别预测错了
classification_report：precision、recall、F1-score 等指标

如果准确率很高，也不要马上结束。你还应该看混淆矩阵，确认模型主要错在哪一类；再看 recall 和 precision，判断错误类型是否符合业务要求。

八、一次可复查的预期输出

不同 scikit-learn 版本可能让最后几位小数略有变化，但运行同一份代码时，输出结构应该类似下面这样。重点不是记住具体数字，而是确认你能解释每个指标来自哪里。

Accuracy: 0.97...
Confusion matrix:
[[40  3]
 [ 1 70]]
Classification report:
              precision    recall  f1-score   support
   malignant       ...       ...       ...        43
      benign       ...       ...       ...        71

如果你的准确率明显低很多，先检查三件事：是否使用了 stratify=y，是否把 StandardScaler 放进了 Pipeline，以及 LogisticRegression(max_iter=500) 是否正常收敛。

九、可以继续尝试什么

跑通代码后，可以做几个小实验：

把 test_size 改成 0.3，观察结果是否稳定
去掉 StandardScaler，比较指标变化
把 LogisticRegression 换成 RandomForestClassifier
打印 dataset.feature_names，理解每个特征的含义
尝试找出预测错误的样本索引，看看它们有什么特点

人工智能基础学习的关键，是把每个例子都拆成可解释的步骤。你不只是运行了一个分类模型，而是完整走过了一次机器学习工作流。

十、对照实验怎么设计

小实战最有价值的部分是对照实验。不要一次改很多东西，否则不知道结果变化来自哪里。建议一次只改一个变量，并记录指标变化。

实验	只改变什么	观察重点
去掉标准化	删除 `StandardScaler`	逻辑回归是否更难收敛，指标是否下降
改变测试集比例	`test_size` 从 0.2 改成 0.3	指标波动是否仍在可接受范围
更换模型	换成随机森林	是否真正超过基线，错误类型是否改变
去掉分层抽样	删除 `stratify=y`	类别比例是否变化，少数类 recall 是否不稳定

这样做能帮助你从“跑通代码”过渡到“理解实验”。一个结果是否可靠，通常不是看一次分数，而是看在合理扰动下是否仍然稳定。

十一、把这个练习写进学习笔记

建议你运行完代码后，记录下面几项：

数据集有多少样本、多少特征、几个类别
训练集和测试集分别有多少样本
准确率、precision、recall 和 F1-score 分别是多少
混淆矩阵里哪一类错误更多
去掉标准化或换模型后，结果有什么变化

这些记录比单纯截图一个准确率更有价值，因为它们能帮助你解释实验，而不是只保存结果。

十二、系列回顾

这篇文章把前面的内容落到了代码上。需要回看概念时，可以从人工智能基础学习路线重新开始，也可以回到博客页查看完整系列。

英文

Python AI Mini Practice: A Classification Task with scikit-learn

在独立页面打开

The previous articles covered AI concepts, the machine learning workflow, model training and evaluation, and neural network basics. This article runs a small end-to-end practice project: a binary classification task with Python and scikit-learn.

The example uses the breast cancer dataset built into scikit-learn, so no external data file is required. The goal is not to chase the highest score. The goal is to walk through loading data, splitting data, standardizing features, training, predicting, and evaluating.

Note: this dataset is used here only for machine learning practice. It should not be used for medical decisions or real diagnosis. The article focuses on the classification workflow, not medical conclusions.

1. Prepare the Environment

Create a virtual environment and install the dependency:

python3 -m venv .venv
source .venv/bin/activate
pip install scikit-learn

This example uses only scikit-learn, not a deep learning framework. That keeps the focus on the basic machine learning workflow.

2. Complete Code

The following script can be run directly:

from sklearn.datasets import load_breast_cancer
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score, classification_report, confusion_matrix
from sklearn.model_selection import train_test_split
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler


def main():
    dataset = load_breast_cancer()
    X = dataset.data
    y = dataset.target

    X_train, X_test, y_train, y_test = train_test_split(
        X,
        y,
        test_size=0.2,
        random_state=42,
        stratify=y,
    )

    model = Pipeline(
        steps=[
            ("scaler", StandardScaler()),
            ("classifier", LogisticRegression(max_iter=500)),
        ]
    )

    model.fit(X_train, y_train)
    y_pred = model.predict(X_test)

    print("Accuracy:", accuracy_score(y_test, y_pred))
    print("Confusion matrix:")
    print(confusion_matrix(y_test, y_pred))
    print("Classification report:")
    print(classification_report(y_test, y_pred, target_names=dataset.target_names))


if __name__ == "__main__":
    main()

Save it as ai_classification_demo.py and run:

python ai_classification_demo.py

If dependency import feels slow the first time, confirm that the virtual environment is active and run python -c "import sklearn; print(sklearn.__version__)" to check that scikit-learn is installed.

3. What the Dataset Contains

load_breast_cancer() returns a binary classification dataset. Each sample contains numeric features, and the label indicates which class the sample belongs to.

In the script:

X is the feature matrix, with one row per sample
y is the label array, with one label per sample
dataset.target_names contains the class names

The dataset is already prepared as numeric features, which makes it useful for practicing classification basics.

4. Why Split Training and Test Data?

The script uses train_test_split():

X_train, X_test, y_train, y_test = train_test_split(
    X,
    y,
    test_size=0.2,
    random_state=42,
    stratify=y,
)

test_size=0.2 means 20% of the data is reserved for testing. stratify=y tries to preserve the class ratio after the split, which is useful for classification.

If you evaluate only on training data, the model may have memorized training examples instead of learning a pattern that generalizes.

5. Why Use Pipeline?

The code uses Pipeline instead of manually standardizing first and training later:

model = Pipeline(
    steps=[
        ("scaler", StandardScaler()),
        ("classifier", LogisticRegression(max_iter=500)),
    ]
)

This has two benefits:

Standardization and classification stay in one reproducible workflow
The test set uses scaling parameters learned only from the training set, which avoids data leakage

Data leakage is a common beginner mistake. If you standardize the full dataset before splitting, information from the test set has already influenced training.

6. Why Logistic Regression?

Logistic regression is a classic baseline for classification. It is fast, stable, and easier to explain than many more complex models.

This example does not start with a neural network because running the full workflow is more important at this stage. Once every line in this script is clear, replacing the classifier with a random forest, support vector machine, or neural network becomes more meaningful.

7. How to Read the Evaluation

The script prints three kinds of results:

Accuracy: the overall proportion of correct predictions
confusion_matrix: which classes were predicted incorrectly
classification_report: precision, recall, F1-score, and related metrics

Even if accuracy is high, do not stop there. Check the confusion matrix to see which class causes mistakes, then compare precision and recall to the requirements of the problem.

8. What to Try Next

After the script runs, try a few small experiments:

Change test_size to 0.3 and see whether results stay stable
Remove StandardScaler and compare the metrics
Replace LogisticRegression with RandomForestClassifier
Print dataset.feature_names and read what each feature means
Find the indexes of wrong predictions and inspect those samples

The key to learning AI foundations is to make each example explainable. In this practice project, you did not just run a classifier. You walked through a complete machine learning workflow.

9. Practice Run Audit Table

After running the script, use the table below to turn a one-time execution into a reproducible learning record.

Audit item	What to capture	Why it matters	What to try next
Environment	Python version, scikit-learn version, and command used	Different library versions can change defaults and warnings	Re-run after upgrading dependencies and compare output
Dataset shape	Sample count, feature count, class names, and class balance	Metrics are easier to interpret when the class distribution is known	Print feature names and inspect several rows
Pipeline behavior	Whether scaling is inside the pipeline and fitted only on training data	This prevents leakage from the test set into preprocessing	Remove the scaler and compare convergence and metrics
Error pattern	Confusion matrix, wrong sample indexes, and class-level recall	Wrong examples explain model limitations better than accuracy alone	Compare logistic regression with a tree-based baseline

10. Add This Practice to Your Notes

After running the code, record these details:

How many samples, features, and classes the dataset contains
How many samples are in the training and test sets
The accuracy, precision, recall, and F1-score
Which type of mistake appears more often in the confusion matrix
What changes when you remove standardization or switch models

These notes are more useful than saving only one accuracy value because they help you explain the experiment, not just preserve the result.

11. Series Review

This article turns the previous concepts into code. To revisit the foundations, start again from the AI Basics Learning Roadmap, or return to the Blog page for the full series.

代码运行说明

环境: Python 3 + scikit-learn

安装

python3 -m venv .venv
source .venv/bin/activate
pip install scikit-learn

运行

python ai_classification_demo.py

输入文件: scikit-learn 内置 breast cancer 教学数据集
预期输出: 输出 accuracy、confusion matrix 和 classification report。

安装 python3 -m venv .venv
安装 source .venv/bin/activate
安装 pip install scikit-learn
运行 python ai_classification_demo.py

注意：这个数据集只用于机器学习教学练习，不能用于医疗判断或现实诊断。本文关注的是分类流程，而不是医学结论。

一、准备环境

建议先创建虚拟环境，再安装依赖：

python3 -m venv .venv
source .venv/bin/activate
pip install scikit-learn

本文只使用 scikit-learn，不依赖深度学习框架。这样可以把注意力放在机器学习的基本流程上。

二、完整代码

下面是一份可以直接运行的代码：

from sklearn.datasets import load_breast_cancer
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score, classification_report, confusion_matrix
from sklearn.model_selection import train_test_split
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler


def main():
    dataset = load_breast_cancer()
    X = dataset.data
    y = dataset.target

    X_train, X_test, y_train, y_test = train_test_split(
        X,
        y,
        test_size=0.2,
        random_state=42,
        stratify=y,
    )

    model = Pipeline(
        steps=[
            ("scaler", StandardScaler()),
            ("classifier", LogisticRegression(max_iter=500)),
        ]
    )

    model.fit(X_train, y_train)
    y_pred = model.predict(X_test)

    print("Accuracy:", accuracy_score(y_test, y_pred))
    print("Confusion matrix:")
    print(confusion_matrix(y_test, y_pred))
    print("Classification report:")
    print(classification_report(y_test, y_pred, target_names=dataset.target_names))


if __name__ == "__main__":
    main()

保存为 ai_classification_demo.py 后运行：

python ai_classification_demo.py

三、数据集是什么

load_breast_cancer() 会返回一个二分类数据集。每条样本包含多个数值特征，标签表示样本属于哪一类。

在代码里：

X 是特征矩阵，每一行是一条样本
y 是标签数组，每个元素对应一条样本的类别
dataset.target_names 保存类别名称

这个数据集已经被整理成数值特征，适合用来练习基础分类流程。

四、为什么要拆分训练集和测试集

代码中使用了 train_test_split()：

X_train, X_test, y_train, y_test = train_test_split(
    X,
    y,
    test_size=0.2,
    random_state=42,
    stratify=y,
)

这里 test_size=0.2 表示 20% 数据用于测试。stratify=y 表示划分后尽量保持类别比例一致，这对分类任务很有用。

如果不拆分测试集，只在训练集上看结果，模型可能只是记住了训练样本，而不是真的具备泛化能力。

五、为什么使用 Pipeline

代码里没有单独先标准化再训练，而是使用了 Pipeline：

model = Pipeline(
    steps=[
        ("scaler", StandardScaler()),
        ("classifier", LogisticRegression(max_iter=500)),
    ]
)

这样做有两个好处：

标准化和模型训练被放在同一个流程里，不容易漏步骤
测试集会使用训练集上学到的标准化参数，避免数据泄漏

数据泄漏是初学者常见错误。如果你先对完整数据做标准化，再拆分训练集和测试集，测试集的信息就已经提前影响了训练过程。

六、模型为什么选逻辑回归

逻辑回归是分类任务里非常经典的基线模型。它训练速度快、结果稳定、容易解释，适合作为入门模型。

七、怎么看评估结果

代码会输出三类结果：

Accuracy：整体预测正确比例
confusion_matrix：模型把哪些类别预测错了
classification_report：precision、recall、F1-score 等指标

如果准确率很高，也不要马上结束。你还应该看混淆矩阵，确认模型主要错在哪一类；再看 recall 和 precision，判断错误类型是否符合业务要求。

八、一次可复查的预期输出

Accuracy: 0.97...
Confusion matrix:
[[40  3]
 [ 1 70]]
Classification report:
              precision    recall  f1-score   support
   malignant       ...       ...       ...        43
      benign       ...       ...       ...        71

九、可以继续尝试什么

跑通代码后，可以做几个小实验：

把 test_size 改成 0.3，观察结果是否稳定
去掉 StandardScaler，比较指标变化
把 LogisticRegression 换成 RandomForestClassifier
打印 dataset.feature_names，理解每个特征的含义
尝试找出预测错误的样本索引，看看它们有什么特点

人工智能基础学习的关键，是把每个例子都拆成可解释的步骤。你不只是运行了一个分类模型，而是完整走过了一次机器学习工作流。

十、对照实验怎么设计

小实战最有价值的部分是对照实验。不要一次改很多东西，否则不知道结果变化来自哪里。建议一次只改一个变量，并记录指标变化。

实验	只改变什么	观察重点
去掉标准化	删除 `StandardScaler`	逻辑回归是否更难收敛，指标是否下降
改变测试集比例	`test_size` 从 0.2 改成 0.3	指标波动是否仍在可接受范围
更换模型	换成随机森林	是否真正超过基线，错误类型是否改变
去掉分层抽样	删除 `stratify=y`	类别比例是否变化，少数类 recall 是否不稳定

这样做能帮助你从“跑通代码”过渡到“理解实验”。一个结果是否可靠，通常不是看一次分数，而是看在合理扰动下是否仍然稳定。

十一、把这个练习写进学习笔记

建议你运行完代码后，记录下面几项：

数据集有多少样本、多少特征、几个类别
训练集和测试集分别有多少样本
准确率、precision、recall 和 F1-score 分别是多少
混淆矩阵里哪一类错误更多
去掉标准化或换模型后，结果有什么变化

这些记录比单纯截图一个准确率更有价值，因为它们能帮助你解释实验，而不是只保存结果。

十二、系列回顾

这篇文章把前面的内容落到了代码上。需要回看概念时，可以从人工智能基础学习路线重新开始，也可以回到博客页查看完整系列。

搜索问题

常见问题

这篇文章适合谁读？

这篇文章适合想用实战难度理解“Python 人工智能小实战：用 scikit-learn 完成一个分类任务”的读者，预计阅读时间约 10 分钟，重点覆盖 Python, scikit-learn, Classification。

读完后下一步应该看什么？

可以从文末相关阅读、项目页和知识图谱继续进入相邻主题。

这篇文章有没有可运行代码或配套资源？

有。页面里的运行说明、资源卡片和下载入口会指向复现实验所需的命令、数据、代码或说明文件。

这篇文章和整个网站的学习路线有什么关系？

它会通过文章上下文、学习路线、资源库和项目时间线连接到同一主题下的其他内容。

文章上下文

人工智能项目

从 AI、机器学习、训练评估、神经网络到 Python 小实战、手写数字识别、CIFAR-10 CNN、对抗性流量防御和 AI 安全攻防，按顺序建立基础。

难度: 实战阅读时间: 10 分钟

Python
scikit-learn
Classification

继续下一步

继续：手写数字数据结构入门

先补基础打开资源

对应语言版本 Python AI Mini Practice: A Classification Task with scikit-learn

可分享摘要 Python 人工智能小实战：用 scikit-learn 完成一个分类任务

使用 scikit-learn 内置教学数据集跑通一个分类任务，覆盖数据加载、拆分、标准化、训练、预测、评估和实验记录。

下载分享图打开分享中心

配套资源

文章内包含可直接复制运行的 scikit-learn 分类脚本。

打开资源关联文章

发表回复取消回复

要发表评论，您必须先登录。

项目时间线

已发布文章

人工智能基础学习路线：先理解什么是 AI、机器学习和深度学习面向有编程基础的读者，梳理 AI、机器学习、深度学习的关系，并给出可执行的人工智能基础学习路线。
机器学习完整流程：从数据、特征到模型预测从工程视角拆解机器学习完整流程：定义问题、理解数据、处理特征、训练模型、预测和评估。
机器学习算法怎么选：分类、回归、聚类和推荐场景对照表用任务类型、数据规模、解释性和部署成本选择机器学习算法，覆盖逻辑回归、决策树、随机森林、K-means 和表格数据基线模型。
特征工程入门实战：用 scikit-learn 处理缺失值、类别变量和数值标准化用 scikit-learn Pipeline 和 ColumnTransformer 完成特征工程，处理缺失值、类别变量、数值标准化，并避免数据泄漏。
模型训练与评估入门：损失函数、过拟合和准确率怎么理解讲清楚模型训练中的参数、损失函数、梯度下降、过拟合，以及准确率、召回率、F1 等分类评估指标。
过拟合和欠拟合怎么解决：机器学习模型调优实战指南用训练分数和验证分数判断过拟合与欠拟合，并通过模型复杂度、正则化、交叉验证和特征工程调整机器学习模型。
神经网络基础：从感知机到多层网络从一个神经元讲起，解释权重、偏置、激活函数、前向传播、反向传播和典型神经网络训练循环。
神经网络矩阵微积分：从 y = Wx + b 推导 MSE 梯度用手算、矩阵形状图、NumPy 代码和梯度检查解释 y = Wx + b 下 dL/dW = (ŷ - y)x^T 的来源。
反向传播计算图：两层 MLP 的前向、局部梯度和反向传播把两层 MLP 拆成计算图，手算 ReLU、softmax cross-entropy、dW2、dW1，并用 NumPy 复现实验结果。
梯度下降与优化器几何：Momentum、Adam 和 loss surface 轨迹在二维二次函数上手算梯度下降前几步，比较 Momentum 和 Adam 的轨迹，并用代码生成 loss contour。
卷积与感受野数学：5×5 输入、3×3 kernel、padding 和 im2col 手算一次 5x5 输入与 3x3 kernel 的离散卷积，解释输出尺寸、padding、stride、感受野和 im2col。
Transformer Attention 数学：Q/K/V、Softmax 权重、Mask 与 KV Cache 用 3 个 token 手算 scaled dot-product attention，解释 Q/K/V、softmax、mask、多头注意力和 KV cache。
Python 人工智能小实战：用 scikit-learn 完成一个分类任务使用 scikit-learn 内置教学数据集跑通一个分类任务，覆盖数据加载、拆分、标准化、训练、预测、评估和实验记录。
手写数字识别项目入门：先读懂 train.csv、test.csv 和标签结构从项目文件结构入手，读懂手写数字训练集、测试集、标签列和 784 维像素输入，为后续 C 分类器和实验台打基础。
用 C 实现手写数字 Softmax 分类器：从 784 维像素到 submission.csv 结合当前项目源码，讲清楚 softmax 多分类、损失函数、梯度更新、混淆矩阵输出，以及 submission.csv 的生成过程。
手写数字实验记录：怎么把离线分类项目接进浏览器实验台解释浏览器实验台为什么采用轻量预训练模型、它和离线 C 项目的关系，以及如何用样本浏览和手绘输入理解预测结果。
CIFAR-10 Tiny CNN 教程：用 C 语言实现小型卷积神经网络图像分类用单文件 C 程序完成 CIFAR-10 小型 CNN 图像分类，讲解数据格式、网络结构、训练命令、loss、accuracy、常见错误和改进方向。
构建高熵流量防御：基于 Python 的连接层白噪声混淆与对抗性机器学习实践以 mld_chaffing_v2.py 虚幻镜项目为例，讲解加密元数据泄漏、信息熵、分布距离、混淆矩阵、空闲窗口微脉冲和性能测试取舍。
AI 安全威胁建模：用 NIST AML、MITRE ATLAS 和 OWASP 建立攻防地图用 NIST Adversarial ML、MITRE ATLAS 和 OWASP LLM Top 10 建立 AI 安全威胁模型，覆盖资产、攻击面、证据和剩余风险。
对抗样本与鲁棒评估：从 FGSM 公式到 scikit-learn 数字分类实验从 FGSM 公式解释对抗样本，用 scikit-learn digits toy 实验评估 clean accuracy、perturbed accuracy 和扰动预算。
数据投毒与后门攻击防御：污染率、触发器和训练管线隔离用 toy digits 实验解释数据投毒、后门触发器、attack success rate、数据来源审计和训练管线隔离。
模型隐私与模型窃取风险：成员推断、模型抽取和输出接口防护用本地 toy 实验解释成员推断、模型抽取、membership AUC、surrogate fidelity、输出最小化和查询治理。
LLM/RAG/Agent 安全：Prompt Injection、工具权限和边界感知防护从 RAG 和 Agent 架构解释 prompt injection、外部数据降权、工具 allowlist、人工审批和边界感知防护。

已公开资源

Python AI 小实战代码说明文章内包含可直接复制运行的 scikit-learn 分类脚本。
digit_softmax_classifier.c 手写数字 softmax 分类器的 C 语言源码。
train.csv.zip 手写数字训练集压缩包，包含 42000 条带标签样本。
test.csv.zip 手写数字测试集压缩包，包含 28000 条待预测样本。
sample_submission.csv 官方提交格式示例，可直接对照最终输出字段。
submission.csv 当前 C 项目跑出的预测结果文件。
digit-playground-model.json 浏览器实验台使用的轻量 softmax 演示模型与样本。
digit-sample-grid.svg 从训练集中抽取的小型手写数字预览网格。
手写数字项目打包下载包含源码、压缩数据、提交文件、浏览器模型和样本预览图。
cifar10_tiny_cnn.c 源码单文件 C 语言 tiny CNN，包含 CIFAR-10 读取、卷积、池化、softmax 和反向传播。
model_weights.bin 样例权重一次本地小样本运行生成的模型权重文件。
test_predictions.csv 预测样例 CIFAR-10 tiny CNN 输出的测试预测样例。
CNN 项目说明 PDF 配套 CNN 项目说明材料。
虚幻镜脱敏代码骨架去除控制口令、真实节点和目标列表后的 mld_chaffing_v2.py 控制流程说明。
虚幻镜压力测试记录模板用于记录 CPU、内存、线程峰值、微脉冲速率、延迟和错误数的脱敏 CSV 模板。
虚幻镜分类器评估模板用于记录 TP、FN、FP、TN、accuracy、precision、recall、F1、ROC-AUC、熵和 JS 散度的 CSV 模板。
虚幻镜资源说明说明公开资源为何只提供脱敏代码、测试模板和架构笔记。
AI Security Lab 说明说明 AI 安全攻防系列的安全边界、安装命令和 quick-run 实验。
AI Security Lab 完整实验包包含安全 toy scripts、结果 CSV、风险登记表、攻防矩阵和架构图。
AI 安全风险登记表面向 AI 威胁建模和上线评审的 CSV 风险登记模板。
AI 攻防矩阵把攻击面、toy demo、指标和防护控制映射到一张 CSV 表。
AI Security Lab 架构图展示威胁建模、鲁棒评估、数据完整性、模型隐私和 RAG 防护之间的关系。
FGSM digits 鲁棒评估脚本本地 digits 分类器的 FGSM-style 扰动和准确率下降实验。
数据投毒与后门 toy 脚本用 digits 数据演示污染率、触发器和 attack success rate。
模型隐私与抽取 toy 脚本输出 membership AUC、target accuracy、surrogate fidelity 和 surrogate accuracy。
RAG prompt injection guard toy 脚本用确定性 toy agent 演示外部数据降权和工具权限阻断。
Deep Learning Math Lab 说明包含安装命令、脚本入口、输出结果和文章图示生成说明。
深度学习数学完整实验包打包 NumPy 脚本、CSV 结果、公式图、loss contour、卷积图和 attention 热图。
梯度检查结果 CSV 保存 MSE 梯度解析值、数值差分值和误差范数。
优化器轨迹 CSV 记录梯度下降、Momentum 和 Adam 在二维二次函数上的逐步坐标与 loss。
Attention 权重 CSV 三 token scaled dot-product attention 的 scores、softmax weights 和 context 输出。
深度学习数学图示目录包含矩阵形状、计算图、loss contour、卷积扫描和 attention heatmap。
深度学习数学交互演示在浏览器里调梯度检查、优化轨迹、卷积输出尺寸和 attention 权重热图。
深度学习专题分享图用于分享深度学习 / CNN 专题页的 1200x630 SVG 图。
从零实现机器学习分享图用于分享 K-means、Iris 和机器学习流程专题页的 1200x630 SVG 图。
学生 AI 项目分享图用于分享手写数字、C 分类器和浏览器实验台专题页的 1200x630 SVG 图。
CNN 卷积扫描动画 Remotion 生成的 8 秒短动画，展示 3x3 卷积核如何扫描输入并形成特征图。

当前学习路线

人工智能基础学习路线学习路线节点
机器学习完整流程学习路线节点
机器学习算法怎么选学习路线节点
特征工程入门实战学习路线节点
模型训练与评估入门学习路线节点
过拟合和欠拟合怎么解决学习路线节点
神经网络基础学习路线节点
神经网络矩阵微积分学习路线节点
反向传播计算图学习路线节点
梯度下降与优化器几何学习路线节点
卷积与感受野数学学习路线节点
Transformer Attention 数学学习路线节点
LLM 可视化教学台学习路线节点
Python 人工智能小实战学习路线节点
手写数字数据结构入门学习路线节点
用 C 实现手写数字 Softmax 分类器学习路线节点
手写数字实验台说明学习路线节点
CIFAR-10 Tiny CNN 教程学习路线节点
高熵流量防御实验学习路线节点
AI 安全威胁建模学习路线节点
对抗样本与鲁棒评估学习路线节点
数据投毒与后门防御学习路线节点
模型隐私与模型抽取防护学习路线节点
LLM/RAG/Agent 安全学习路线节点

下一步计划

补充更多图像分类和误差分析案例
把常见指标整理成速查表
继续补充 AI 安全防御实验记录

一、准备环境

二、完整代码

三、数据集是什么

四、为什么要拆分训练集和测试集

五、为什么使用 Pipeline

六、模型为什么选逻辑回归

七、怎么看评估结果

八、一次可复查的预期输出

九、可以继续尝试什么

十、对照实验怎么设计

十一、把这个练习写进学习笔记

十二、系列回顾

1. Prepare the Environment

2. Complete Code

3. What the Dataset Contains

4. Why Split Training and Test Data?

5. Why Use Pipeline?

6. Why Logistic Regression?

7. How to Read the Evaluation

8. What to Try Next

9. Practice Run Audit Table

10. Add This Practice to Your Notes

11. Series Review

一、准备环境

二、完整代码

三、数据集是什么

四、为什么要拆分训练集和测试集

五、为什么使用 Pipeline

六、模型为什么选逻辑回归

七、怎么看评估结果

八、一次可复查的预期输出

九、可以继续尝试什么

十、对照实验怎么设计

十一、把这个练习写进学习笔记

十二、系列回顾

这篇文章适合谁读？

读完后下一步应该看什么？

这篇文章有没有可运行代码或配套资源？

这篇文章和整个网站的学习路线有什么关系？

配套资源

Python AI 小实战代码说明

发表回复 取消回复

项目时间线

发表回复取消回复