English
Handwritten Digit Playground Notes: Bringing an Offline Classifier into the Browser
After adding the handwritten digit project to the site, I did not want the playground to retrain all 42000 samples inside the browser. That would be too heavy, especially on phones. The current setup is more practical: keep the full offline C project as a download, and add a lightweight interactive browser model for immediate experimentation.
That split keeps the system honest. Visitors do not need a local compiler just to understand the input and prediction flow, and the offline project can stay focused on the actual implementation instead of being distorted to fit a front-end demo.
1. What the handwritten digit module does
The new playground tab supports three main actions:
- Browse real samples loaded from the training set into a 28 by 28 grid
- Draw your own input directly in the browser and run a prediction
- Inspect class probabilities for all ten digits instead of only showing one final label
This is useful for learning because you can see not only what the model predicts, but also which competing digits remain plausible.
2. How the browser model relates to the C project
The page does not load the full training loop. It loads a compact set of pre-trained softmax weights. That keeps the browser demo in the same model family as the C project while making the interaction fast enough for normal page use.
The current browser demo weights come from a smaller training split:
- training samples: 10000
- validation samples: 2000
- epochs: 18
- learning rate: 0.35
- validation accuracy: about 90.65%
The goal is not to chase the strongest possible score. The goal is to make the model response immediate and interpretable in the browser while preserving the full downloadable project separately.
3. Why the browser does not retrain the whole project
The reasons are straightforward:
- the full dataset is too large to reprocess on every page load
- repeated gradient updates in the browser would hurt interaction quality
- mobile users need a stable experience more than a heavy in-page training loop
So the playground is better used as a visualization and interaction surface, not as a replacement for the offline training environment.
4. The best way to use the module
A good sequence is:
- load a real sample and confirm that the grid and prediction agree
- edit a few strokes and watch the probability distribution change
- clear the board and draw a digit yourself to see where the model hesitates
If you draw a 9 and the model also gives meaningful probability to 4 or 7, that is not a failure of the UI. It is a useful view into the model boundary.
5. How it fits with the posts and downloads
The handwritten digit content is now structured as one connected set:
- post one explains the dataset files and the 784-dimensional input format
- post two explains the C softmax classifier, training loop, and exported predictions
- the downloads page collects the source, compressed datasets, sample submission, final submission, and browser model file
- the playground provides the interactive lightweight version
That way, readers can understand the logic in the posts, grab the files from downloads, and try the behavior immediately in the playground.
6. What is worth improving next
This version is already useful for teaching, but there are clear next steps:
- add input helpers such as brush-size controls or centering logic
- collect misclassified examples into a browsable error set
- compare the browser demo outputs against the offline C project in more detail
For a learning-oriented technical site, those improvements matter more than extra decoration. The point is to make the relationship between input, model, and output easier to study.
7. Where to continue
If you have not read the implementation post yet, continue with the C classifier article. If you want to run the full project locally, go straight to the downloads page. If you just want to experiment right away, open the playground and start drawing digits.
Chinese
手写数字实验记录:怎么把离线分类项目接进浏览器实验台
Open as a full page手写数字项目接进站点之后,我没有把实验台做成“在浏览器里重新训练完整 42000 条数据”的版本。那样会太重,也不适合手机端。现在的做法更实用:保留离线 C 项目的完整下载版本,再给实验台加一个轻量、可交互、能立刻看到预测结果的浏览器模型。
这样分层有两个好处。一是网站访问者不需要本地编译环境,也能直接理解输入、概率和预测之间的关系。二是离线项目和网页演示各自边界清楚,不会为了前端演示把原始实现改得面目全非。
一、实验台里的手写数字模块做了什么
新标签页现在支持三件事:
- 浏览样本:从训练集抽取一批真实数字样本,直接加载到 28 x 28 网格里
- 手绘输入:在浏览器里直接画数字,立即查看模型预测
- 概率面板:同时显示 0 到 9 十个类别的概率分布,而不是只给一个结果
这类界面对学习很有帮助,因为你不只是看到“答案是什么”,还能看到模型为什么更偏向某几个数字。
二、浏览器版模型和离线 C 项目有什么关系
网页里加载的不是完整训练过程,而是一份预训练好的轻量 softmax 权重文件。它和 C 项目属于同一类多分类线性模型,只是为了网页体积和响应速度,改成了预先训练后再加载。
当前浏览器演示权重来自一份较小的训练切片:
- 训练样本:10000 条
- 验证样本:2000 条
- 训练轮次:18
- 学习率:0.35
- 验证准确率:约 90.65%
这不是为了追求最强分数,而是为了让实验台在浏览器里也能保持即时反馈。真正完整的数据和源码仍然保留在下载包里。
三、为什么不直接在前端重跑完整训练
原因很现实:
- 完整训练集太大,不适合每次页面加载都重新处理
- 浏览器端反复做大规模梯度更新,会显著拖慢页面交互
- 手机端更需要稳定的体验,而不是把 CPU 全部占满
所以实验台更适合承担“可视化理解”和“交互试验”的角色,而不是替代离线训练环境。
四、这个模块最适合怎么用
建议按这个顺序体验:
- 先加载一个真实样本,确认数字网格和预测结果是否一致
- 再手动改几笔,观察概率怎么变化
- 最后清空画板,自己画一个数字,看模型最容易在哪些数字之间犹豫
如果你画了一个 9,但模型同时给 4 和 7 很高概率,这通常不是坏事,反而说明你已经看到模型的边界了。
五、这个实验台和文章、下载页怎么配合
现在这组内容已经形成了比较完整的结构:
- 第一篇 先讲数据文件和 784 维输入格式
- 第二篇 再讲 C 语言 softmax 分类器如何训练和导出结果
- 下载页 集中放源码、压缩数据、示例提交文件和浏览器模型文件
- 实验台 提供直接可交互的轻量版本
这样安排之后,读者可以在文章里理解原理,在下载区拿到材料,在实验台里马上试。
六、下一步还能继续怎么增强
这一版已经能承担教学演示,但后面还可以继续做三类增强:
- 增加“笔刷粗细”和“自动居中”之类的输入处理
- 把错误样本单独整理成可切换的案例列表
- 把浏览器端模型和离线 C 项目的输出做更细的对照说明
对学习型网站来说,这类增强比堆更多装饰更有意义。重点不是页面看起来像 AI 产品,而是让读者真的能比较输入、模型和输出。
七、直接去哪里继续
如果你还没看源码文章,建议先回到 C 语言实现说明。如果你已经准备自己跑一次完整项目,就直接打开 下载页 拿材料;如果只是想快速试一下,就去 实验台 直接画几个数字。
After adding the handwritten digit project to the site, I did not want the playground to retrain all 42000 samples inside the browser. That would be too heavy, especially on phones. The current setup is more practical: keep the full offline C project as a download, and add a lightweight interactive browser model for immediate experimentation.
That split keeps the system honest. Visitors do not need a local compiler just to understand the input and prediction flow, and the offline project can stay focused on the actual implementation instead of being distorted to fit a front-end demo.
1. What the handwritten digit module does
The new playground tab supports three main actions:
- Browse real samples loaded from the training set into a 28 by 28 grid
- Draw your own input directly in the browser and run a prediction
- Inspect class probabilities for all ten digits instead of only showing one final label
This is useful for learning because you can see not only what the model predicts, but also which competing digits remain plausible.
2. How the browser model relates to the C project
The page does not load the full training loop. It loads a compact set of pre-trained softmax weights. That keeps the browser demo in the same model family as the C project while making the interaction fast enough for normal page use.
The current browser demo weights come from a smaller training split:
- training samples: 10000
- validation samples: 2000
- epochs: 18
- learning rate: 0.35
- validation accuracy: about 90.65%
The goal is not to chase the strongest possible score. The goal is to make the model response immediate and interpretable in the browser while preserving the full downloadable project separately.
3. Why the browser does not retrain the whole project
The reasons are straightforward:
- the full dataset is too large to reprocess on every page load
- repeated gradient updates in the browser would hurt interaction quality
- mobile users need a stable experience more than a heavy in-page training loop
So the playground is better used as a visualization and interaction surface, not as a replacement for the offline training environment.
4. The best way to use the module
A good sequence is:
- load a real sample and confirm that the grid and prediction agree
- edit a few strokes and watch the probability distribution change
- clear the board and draw a digit yourself to see where the model hesitates
If you draw a 9 and the model also gives meaningful probability to 4 or 7, that is not a failure of the UI. It is a useful view into the model boundary.
5. How it fits with the posts and downloads
The handwritten digit content is now structured as one connected set:
- post one explains the dataset files and the 784-dimensional input format
- post two explains the C softmax classifier, training loop, and exported predictions
- the downloads page collects the source, compressed datasets, sample submission, final submission, and browser model file
- the playground provides the interactive lightweight version
That way, readers can understand the logic in the posts, grab the files from downloads, and try the behavior immediately in the playground.
6. What is worth improving next
This version is already useful for teaching, but there are clear next steps:
- add input helpers such as brush-size controls or centering logic
- collect misclassified examples into a browsable error set
- compare the browser demo outputs against the offline C project in more detail
For a learning-oriented technical site, those improvements matter more than extra decoration. The point is to make the relationship between input, model, and output easier to study.
7. Where to continue
If you have not read the implementation post yet, continue with the C classifier article. If you want to run the full project locally, go straight to the downloads page. If you just want to experiment right away, open the playground and start drawing digits.
Search questions
FAQ
Who is this article for?
This article is for readers who want a practice-level guide to Handwritten Digit Playground Notes. It takes about 8 min and focuses on Browser Demo, Softmax, Visualization.
What should I read next?
The recommended next step is CIFAR-10 Tiny CNN Tutorial in C, so the article connects into a longer learning route instead of ending as an isolated note.
Does this article include runnable code or companion resources?
Yes. Use the run notes, resource cards, and download links on the page to reproduce the example or inspect the companion files.
How does this article fit into the larger site?
It is connected to the article context block, learning routes, resources, and project timeline so readers can move from concept to implementation.
Article context
AI Learning Project
A practical route from AI concepts to machine learning workflow, evaluation, neural networks, Python practice, handwritten digits, a CIFAR-10 CNN, adversarial traffic-defense notes, and AI security.
See how the offline classifier was adapted into a browser demo with drawing input and probability output.
Download share card Open share centerCompanion resources
AI Learning Project / DATASET
digit-playground-model.json
The compact softmax demo model and sample set used by the browser playground.
AI Learning Project / DIAGRAM
digit-sample-grid.svg
A small handwritten digit preview grid extracted from the training set.
AI Learning Project / ARCHIVE
Handwritten digit project bundle
Contains the source file, compressed datasets, submission files, browser model, and preview grid.
Project timeline
Published posts
- AI Basics Learning Roadmap Separate AI, machine learning, and deep learning before going into implementation details.
- Machine Learning Workflow Follow the practical path from data and features to training, prediction, and evaluation.
- Model Training and Evaluation Understand loss, overfitting, train/test splits, accuracy, recall, and F1.
- Neural Network Basics Move from perceptrons to activation, forward propagation, backpropagation, and training loops.
- NLP Basics: Understanding Bag of Words and TF-IDF An introduction to the most fundamental text representation methods in NLP: Bag of Words (BoW) and TF-IDF.
- RNN Basics: Handling Sequential Data with Memory Understand the core concepts of Recurrent Neural Networks (RNN), the role of hidden states, and their application in NLP.
- Transformer Self-Attention Read Q/K/V, scaled dot-product attention, multi-head attention, and positional encoding before exploring LLM internals.
- Python AI Mini Practice Run a small scikit-learn classification task and read the experiment output.
- Handwritten Digit Dataset Basics Read train.csv, test.csv, labels, and the flattened 28 by 28 pixel layout before training the classifier.
- Handwritten Digit Softmax in C Follow the C implementation from logits and softmax probabilities to confusion matrices and submission export.
- Handwritten Digit Playground Notes See how the offline classifier was adapted into a browser demo with drawing input and probability output.
- CIFAR-10 Tiny CNN Tutorial in C Build and train a small convolutional neural network for CIFAR-10 image classification, then read its loss and accuracy output.
- Building a Tiny CIFAR-10 CNN in C: Convolution, Pooling, and Backpropagation A source-based walkthrough of cifar10_tiny_cnn.c, covering CIFAR-10 binary input, 3x3 convolution, ReLU, max pooling, fully connected logits, softmax, backpropagation, and local commands.
- High-Entropy Traffic Defense Notes Study encrypted metadata leaks, entropy, traffic classifiers, and a defensive Python chaffing prototype.
- AI Security Threat Modeling Build a defense map with NIST adversarial ML, MITRE ATLAS, and OWASP LLM risks.
- Adversarial Examples and Robust Evaluation Evaluate clean and perturbed accuracy with an FGSM-style digits experiment.
- Data Poisoning and Backdoor Defense Study poison rate, trigger behavior, attack success rate, and training pipeline controls.
- Model Privacy and Extraction Defense Measure membership inference signal and surrogate fidelity against a local toy model.
- LLM, RAG, and Agent Security Separate instructions from data and enforce tool permissions against indirect prompt injection.
Published resources
- Python AI practice code guide The article includes a runnable scikit-learn classification script.
- digit_softmax_classifier.c The C source for the handwritten digit softmax classifier.
- train.csv.zip Compressed handwritten digit training set with 42000 labeled samples.
- test.csv.zip Compressed handwritten digit test set with 28000 unlabeled samples.
- sample_submission.csv The official submission format example for checking the final output columns.
- submission.csv The prediction file generated by the current C project.
- digit-playground-model.json The compact softmax demo model and sample set used by the browser playground.
- digit-sample-grid.svg A small handwritten digit preview grid extracted from the training set.
- Handwritten digit project bundle Contains the source file, compressed datasets, submission files, browser model, and preview grid.
- cifar10_tiny_cnn.c source Single-file C tiny CNN with CIFAR-10 loading, convolution, pooling, softmax, and backpropagation.
- model_weights.bin sample weights Model weights generated by one local small-sample run.
- test_predictions.csv sample predictions Sample test prediction output from the CIFAR-10 tiny CNN.
- CNN project explanation PDF Companion explanation material for the CNN project.
- Virtual Mirror redacted code skeleton A redacted mld_chaffing_v2.py control-flow skeleton with secrets, node topology, and target lists removed.
- Virtual Mirror stress-test template A redacted CSV template for CPU, memory, peak threads, pulse rate, latency, and error measurements.
- Virtual Mirror classifier-evaluation template A CSV template for TP, FN, FP, TN, accuracy, precision, recall, F1, ROC-AUC, entropy, and JS divergence.
- Virtual Mirror resource notes Notes explaining why the public resources include only redacted code, test templates, and architecture context.
- AI Security Lab README Setup, safety boundaries, and quick-run commands for the AI Security series.
- AI Security Lab full bundle Includes safe toy scripts, result CSVs, risk register, attack-defense matrix, and architecture diagram.
- AI security risk register CSV risk register template for AI threat modeling and release review.
- AI attack-defense matrix Maps attack surface, toy demo, metric, and defensive control into one CSV table.
- AI Security Lab architecture diagram Shows threat modeling, robustness, data integrity, model privacy, and RAG guardrails.
- FGSM digits robustness script FGSM-style perturbation and accuracy-drop experiment for a local digits classifier.
- Data poisoning and backdoor toy script Demonstrates poison rate, trigger behavior, and attack success rate on digits.
- Model privacy and extraction toy script Outputs membership AUC, target accuracy, surrogate fidelity, and surrogate accuracy.
- RAG prompt injection guard toy script Uses a deterministic toy agent to demonstrate external-data demotion and tool-policy blocking.
- Deep Learning topic share card A 1200x630 SVG card for sharing the Deep Learning / CNN topic hub.
- Machine Learning From Scratch share card A 1200x630 SVG card for the K-means, Iris, and ML workflow topic hub.
- Student AI Projects share card A 1200x630 SVG card for handwritten digits, C classifiers, and browser demos.
- CNN convolution scan animation An 8-second Remotion animation showing how a 3x3 convolution kernel scans an input and builds a feature map.
Current route
- AI Basics Learning Roadmap Learning path step
- Machine Learning Workflow Learning path step
- Model Training and Evaluation Learning path step
- Neural Network Basics Learning path step
- Transformer Self-Attention Learning path step
- LLM Visualizer Learning path step
- Python AI Mini Practice Learning path step
- Handwritten Digit Dataset Basics Learning path step
- Handwritten Digit Softmax in C Learning path step
- Handwritten Digit Playground Notes Learning path step
- CIFAR-10 Tiny CNN Tutorial in C Learning path step
- High-Entropy Traffic Defense Notes Learning path step
- AI Security Threat Modeling Learning path step
- Adversarial Examples and Robust Evaluation Learning path step
- Data Poisoning and Backdoor Defense Learning path step
- Model Privacy and Extraction Defense Learning path step
- LLM, RAG, and Agent Security Learning path step
Next notes
- Add more image-classification and error-analysis cases
- Turn common metrics into a quick reference
- Add more AI security defense experiment notes
