Project hub
Learning material by project
Each project collects its goal, route, timeline, published posts, resources, and next planned notes.
AI Learning Project
Help readers with programming basics complete a coherent first pass through machine learning and then move into convolutional networks, robust evaluation, model privacy, and LLM/RAG/Agent security.
A practical route from AI concepts to machine learning workflow, evaluation, neural networks, Python practice, handwritten digits, a CIFAR-10 CNN, adversarial traffic-defense notes, and AI security.
Next notes
- Add more image-classification and error-analysis cases
- Turn common metrics into a quick reference
- Add more AI security defense experiment notes
Published posts
AI Basics Learning Roadmap
Separate AI, machine learning, and deep learning before going into implementation details.
Machine Learning Workflow
Follow the practical path from data and features to training, prediction, and evaluation.
Model Training and Evaluation
Understand loss, overfitting, train/test splits, accuracy, recall, and F1.
Neural Network Basics
Move from perceptrons to activation, forward propagation, backpropagation, and training loops.
Matrix Calculus for Neural Networks
Derive dL/dW for y = Wx + b and verify it with finite differences.
Backpropagation as a Computation Graph
Trace local gradients through ReLU and softmax cross-entropy in a two-layer MLP.
Gradient Descent and Optimizer Geometry
Compare gradient descent, momentum, and Adam on a visible quadratic loss surface.
Convolution and Receptive Field Math
Compute convolution output size, receptive fields, channel mixing, and im2col layout.
Transformer Attention Math
Hand-calculate Q/K/V scores, softmax weights, masks, multi-head structure, and KV cache.
NLP Basics: Understanding Bag of Words and TF-IDF
An introduction to the most fundamental text representation methods in NLP: Bag of Words (BoW) and TF-IDF.
RNN Basics: Handling Sequential Data with Memory
Understand the core concepts of Recurrent Neural Networks (RNN), the role of hidden states, and their application in NLP.
Transformer Self-Attention
Read Q/K/V, scaled dot-product attention, multi-head attention, and positional encoding before exploring LLM internals.
Python AI Mini Practice
Run a small scikit-learn classification task and read the experiment output.
Handwritten Digit Dataset Basics
Read train.csv, test.csv, labels, and the flattened 28 by 28 pixel layout before training the classifier.
Handwritten Digit Softmax in C
Follow the C implementation from logits and softmax probabilities to confusion matrices and submission export.
Handwritten Digit Playground Notes
See how the offline classifier was adapted into a browser demo with drawing input and probability output.
CIFAR-10 Tiny CNN Tutorial in C
Build and train a small convolutional neural network for CIFAR-10 image classification, then read its loss and accuracy output.
Building a Tiny CIFAR-10 CNN in C: Convolution, Pooling, and Backpropagation
A source-based walkthrough of cifar10_tiny_cnn.c, covering CIFAR-10 binary input, 3x3 convolution, ReLU, max pooling, fully connected logits, softmax, backpropagation, and local commands.
High-Entropy Traffic Defense Notes
Study encrypted metadata leaks, entropy, traffic classifiers, and a defensive Python chaffing prototype.
AI Security Threat Modeling
Build a defense map with NIST adversarial ML, MITRE ATLAS, and OWASP LLM risks.
Adversarial Examples and Robust Evaluation
Evaluate clean and perturbed accuracy with an FGSM-style digits experiment.
Data Poisoning and Backdoor Defense
Study poison rate, trigger behavior, attack success rate, and training pipeline controls.
Model Privacy and Extraction Defense
Measure membership inference signal and surrogate fidelity against a local toy model.
LLM, RAG, and Agent Security
Separate instructions from data and enforce tool permissions against indirect prompt injection.
Project timeline
Published posts
- AI Basics Learning Roadmap Separate AI, machine learning, and deep learning before going into implementation details.
- Machine Learning Workflow Follow the practical path from data and features to training, prediction, and evaluation.
- Model Training and Evaluation Understand loss, overfitting, train/test splits, accuracy, recall, and F1.
- Neural Network Basics Move from perceptrons to activation, forward propagation, backpropagation, and training loops.
- Matrix Calculus for Neural Networks Derive dL/dW for y = Wx + b and verify it with finite differences.
- Backpropagation as a Computation Graph Trace local gradients through ReLU and softmax cross-entropy in a two-layer MLP.
- Gradient Descent and Optimizer Geometry Compare gradient descent, momentum, and Adam on a visible quadratic loss surface.
- Convolution and Receptive Field Math Compute convolution output size, receptive fields, channel mixing, and im2col layout.
- Transformer Attention Math Hand-calculate Q/K/V scores, softmax weights, masks, multi-head structure, and KV cache.
- NLP Basics: Understanding Bag of Words and TF-IDF An introduction to the most fundamental text representation methods in NLP: Bag of Words (BoW) and TF-IDF.
- RNN Basics: Handling Sequential Data with Memory Understand the core concepts of Recurrent Neural Networks (RNN), the role of hidden states, and their application in NLP.
- Transformer Self-Attention Read Q/K/V, scaled dot-product attention, multi-head attention, and positional encoding before exploring LLM internals.
- Python AI Mini Practice Run a small scikit-learn classification task and read the experiment output.
- Handwritten Digit Dataset Basics Read train.csv, test.csv, labels, and the flattened 28 by 28 pixel layout before training the classifier.
- Handwritten Digit Softmax in C Follow the C implementation from logits and softmax probabilities to confusion matrices and submission export.
- Handwritten Digit Playground Notes See how the offline classifier was adapted into a browser demo with drawing input and probability output.
- CIFAR-10 Tiny CNN Tutorial in C Build and train a small convolutional neural network for CIFAR-10 image classification, then read its loss and accuracy output.
- Building a Tiny CIFAR-10 CNN in C: Convolution, Pooling, and Backpropagation A source-based walkthrough of cifar10_tiny_cnn.c, covering CIFAR-10 binary input, 3x3 convolution, ReLU, max pooling, fully connected logits, softmax, backpropagation, and local commands.
- High-Entropy Traffic Defense Notes Study encrypted metadata leaks, entropy, traffic classifiers, and a defensive Python chaffing prototype.
- AI Security Threat Modeling Build a defense map with NIST adversarial ML, MITRE ATLAS, and OWASP LLM risks.
- Adversarial Examples and Robust Evaluation Evaluate clean and perturbed accuracy with an FGSM-style digits experiment.
- Data Poisoning and Backdoor Defense Study poison rate, trigger behavior, attack success rate, and training pipeline controls.
- Model Privacy and Extraction Defense Measure membership inference signal and surrogate fidelity against a local toy model.
- LLM, RAG, and Agent Security Separate instructions from data and enforce tool permissions against indirect prompt injection.
Published resources
- Python AI practice code guide The article includes a runnable scikit-learn classification script.
- digit_softmax_classifier.c The C source for the handwritten digit softmax classifier.
- train.csv.zip Compressed handwritten digit training set with 42000 labeled samples.
- test.csv.zip Compressed handwritten digit test set with 28000 unlabeled samples.
- sample_submission.csv The official submission format example for checking the final output columns.
- submission.csv The prediction file generated by the current C project.
- digit-playground-model.json The compact softmax demo model and sample set used by the browser playground.
- digit-sample-grid.svg A small handwritten digit preview grid extracted from the training set.
- Handwritten digit project bundle Contains the source file, compressed datasets, submission files, browser model, and preview grid.
- cifar10_tiny_cnn.c source Single-file C tiny CNN with CIFAR-10 loading, convolution, pooling, softmax, and backpropagation.
- model_weights.bin sample weights Model weights generated by one local small-sample run.
- test_predictions.csv sample predictions Sample test prediction output from the CIFAR-10 tiny CNN.
- CNN project explanation PDF Companion explanation material for the CNN project.
- Virtual Mirror redacted code skeleton A redacted mld_chaffing_v2.py control-flow skeleton with secrets, node topology, and target lists removed.
- Virtual Mirror stress-test template A redacted CSV template for CPU, memory, peak threads, pulse rate, latency, and error measurements.
- Virtual Mirror classifier-evaluation template A CSV template for TP, FN, FP, TN, accuracy, precision, recall, F1, ROC-AUC, entropy, and JS divergence.
- Virtual Mirror resource notes Notes explaining why the public resources include only redacted code, test templates, and architecture context.
- AI Security Lab README Setup, safety boundaries, and quick-run commands for the AI Security series.
- AI Security Lab full bundle Includes safe toy scripts, result CSVs, risk register, attack-defense matrix, and architecture diagram.
- AI security risk register CSV risk register template for AI threat modeling and release review.
- AI attack-defense matrix Maps attack surface, toy demo, metric, and defensive control into one CSV table.
- AI Security Lab architecture diagram Shows threat modeling, robustness, data integrity, model privacy, and RAG guardrails.
- FGSM digits robustness script FGSM-style perturbation and accuracy-drop experiment for a local digits classifier.
- Data poisoning and backdoor toy script Demonstrates poison rate, trigger behavior, and attack success rate on digits.
- Model privacy and extraction toy script Outputs membership AUC, target accuracy, surrogate fidelity, and surrogate accuracy.
- RAG prompt injection guard toy script Uses a deterministic toy agent to demonstrate external-data demotion and tool-policy blocking.
- Deep Learning Math Lab README Setup commands, script entry points, generated outputs, and figure notes for the math series.
- Deep learning math full lab bundle Bundles NumPy scripts, CSV outputs, formula diagrams, loss contours, convolution figures, and attention heatmaps.
- Gradient check results CSV Stores MSE analytic gradients, finite-difference gradients, and error norms.
- Optimizer path CSV Step-by-step coordinates and loss for gradient descent, momentum, and Adam on a 2D quadratic.
- Attention weights CSV Scores, softmax weights, and context vectors for a three-token scaled dot-product attention example.
- Deep learning math figure set Includes matrix shapes, computation graphs, loss contours, convolution scans, and attention heatmaps.
- Deep learning math interactive visualizer Browser modules for gradient checking, optimizer paths, convolution output size, and attention heatmaps.
- Deep Learning topic share card A 1200x630 SVG card for sharing the Deep Learning / CNN topic hub.
- Machine Learning From Scratch share card A 1200x630 SVG card for the K-means, Iris, and ML workflow topic hub.
- Student AI Projects share card A 1200x630 SVG card for handwritten digits, C classifiers, and browser demos.
- CNN convolution scan animation An 8-second Remotion animation showing how a 3x3 convolution kernel scans an input and builds a feature map.
Current route
- AI Basics Learning Roadmap Learning path step
- Machine Learning Workflow Learning path step
- Model Training and Evaluation Learning path step
- Neural Network Basics Learning path step
- Matrix Calculus for Neural Networks Learning path step
- Backpropagation as a Computation Graph Learning path step
- Gradient Descent and Optimizer Geometry Learning path step
- Convolution and Receptive Field Math Learning path step
- Transformer Attention Math Learning path step
- Transformer Self-Attention Learning path step
- LLM Visualizer Learning path step
- Python AI Mini Practice Learning path step
- Handwritten Digit Dataset Basics Learning path step
- Handwritten Digit Softmax in C Learning path step
- Handwritten Digit Playground Notes Learning path step
- CIFAR-10 Tiny CNN Tutorial in C Learning path step
- High-Entropy Traffic Defense Notes Learning path step
- AI Security Threat Modeling Learning path step
- Adversarial Examples and Robust Evaluation Learning path step
- Data Poisoning and Backdoor Defense Learning path step
- Model Privacy and Extraction Defense Learning path step
- LLM, RAG, and Agent Security Learning path step
Next notes
- Add more image-classification and error-analysis cases
- Turn common metrics into a quick reference
- Add more AI security defense experiment notes
Resources
AI Learning Project / CODE
Python AI practice code guide
The article includes a runnable scikit-learn classification script.
AI Learning Project / CODE
digit_softmax_classifier.c
The C source for the handwritten digit softmax classifier.
AI Learning Project / DATASET
train.csv.zip
Compressed handwritten digit training set with 42000 labeled samples.
AI Learning Project / DATASET
test.csv.zip
Compressed handwritten digit test set with 28000 unlabeled samples.
AI Learning Project / DATASET
sample_submission.csv
The official submission format example for checking the final output columns.
AI Learning Project / DATASET
submission.csv
The prediction file generated by the current C project.
AI Learning Project / DATASET
digit-playground-model.json
The compact softmax demo model and sample set used by the browser playground.
AI Learning Project / DIAGRAM
digit-sample-grid.svg
A small handwritten digit preview grid extracted from the training set.
AI Learning Project / ARCHIVE
Handwritten digit project bundle
Contains the source file, compressed datasets, submission files, browser model, and preview grid.
AI Learning Project / CODE
cifar10_tiny_cnn.c source
Single-file C tiny CNN with CIFAR-10 loading, convolution, pooling, softmax, and backpropagation.
AI Learning Project / DATASET
model_weights.bin sample weights
Model weights generated by one local small-sample run.
AI Learning Project / DATASET
test_predictions.csv sample predictions
Sample test prediction output from the CIFAR-10 tiny CNN.
AI Learning Project / DIAGRAM
CNN project explanation PDF
Companion explanation material for the CNN project.
AI Learning Project / CODE
Virtual Mirror redacted code skeleton
A redacted mld_chaffing_v2.py control-flow skeleton with secrets, node topology, and target lists removed.
AI Learning Project / DATASET
Virtual Mirror stress-test template
A redacted CSV template for CPU, memory, peak threads, pulse rate, latency, and error measurements.
AI Learning Project / DATASET
Virtual Mirror classifier-evaluation template
A CSV template for TP, FN, FP, TN, accuracy, precision, recall, F1, ROC-AUC, entropy, and JS divergence.
AI Learning Project / GUIDE
Virtual Mirror resource notes
Notes explaining why the public resources include only redacted code, test templates, and architecture context.
AI Learning Project / GUIDE
AI Security Lab README
Setup, safety boundaries, and quick-run commands for the AI Security series.
AI Learning Project / ARCHIVE
AI Security Lab full bundle
Includes safe toy scripts, result CSVs, risk register, attack-defense matrix, and architecture diagram.
AI Learning Project / DATASET
AI security risk register
CSV risk register template for AI threat modeling and release review.
AI Learning Project / DATASET
AI attack-defense matrix
Maps attack surface, toy demo, metric, and defensive control into one CSV table.
AI Learning Project / DIAGRAM
AI Security Lab architecture diagram
Shows threat modeling, robustness, data integrity, model privacy, and RAG guardrails.
AI Learning Project / CODE
FGSM digits robustness script
FGSM-style perturbation and accuracy-drop experiment for a local digits classifier.
AI Learning Project / CODE
Data poisoning and backdoor toy script
Demonstrates poison rate, trigger behavior, and attack success rate on digits.
AI Learning Project / CODE
Model privacy and extraction toy script
Outputs membership AUC, target accuracy, surrogate fidelity, and surrogate accuracy.
AI Learning Project / CODE
RAG prompt injection guard toy script
Uses a deterministic toy agent to demonstrate external-data demotion and tool-policy blocking.
AI Learning Project / GUIDE
Deep Learning Math Lab README
Setup commands, script entry points, generated outputs, and figure notes for the math series.
AI Learning Project / ARCHIVE
Deep learning math full lab bundle
Bundles NumPy scripts, CSV outputs, formula diagrams, loss contours, convolution figures, and attention heatmaps.
AI Learning Project / DATASET
Gradient check results CSV
Stores MSE analytic gradients, finite-difference gradients, and error norms.
AI Learning Project / DATASET
Optimizer path CSV
Step-by-step coordinates and loss for gradient descent, momentum, and Adam on a 2D quadratic.
AI Learning Project / DATASET
Attention weights CSV
Scores, softmax weights, and context vectors for a three-token scaled dot-product attention example.
AI Learning Project / DIAGRAM
Deep learning math figure set
Includes matrix shapes, computation graphs, loss contours, convolution scans, and attention heatmaps.
AI Learning Project / TOOL
Deep learning math interactive visualizer
Browser modules for gradient checking, optimizer paths, convolution output size, and attention heatmaps.
AI Learning Project / SOCIAL
Deep Learning topic share card
A 1200x630 SVG card for sharing the Deep Learning / CNN topic hub.
AI Learning Project / SOCIAL
Machine Learning From Scratch share card
A 1200x630 SVG card for the K-means, Iris, and ML workflow topic hub.
AI Learning Project / SOCIAL
Student AI Projects share card
A 1200x630 SVG card for handwritten digits, C classifiers, and browser demos.
AI Learning Project / VIDEO
CNN convolution scan animation
An 8-second Remotion animation showing how a 3x3 convolution kernel scans an input and builds a feature map.
Algorithm Implementation Project
Keep algorithm explanations and runnable implementations together as durable references.
Implementation-focused notes around backtracking, bit operations, clustering, code, diagrams, and downloads.
Next notes
- Add more runnable algorithm examples
- Expand downloadable example inputs
Published posts
Eight queens with classic backtracking
Start with state representation, conflict checks, recursive search, and the full C/Python solving flow.
Bitwise optimization for the eight queens problem
Move from array-based checks to compact bit masks so the same search becomes faster and easier to reason about.
K-means clustering on the Iris dataset in C
Study standardization, K-means++ initialization, restart selection, SSE, and the final clustering result.
Project timeline
Published posts
- Eight queens with classic backtracking Start with state representation, conflict checks, recursive search, and the full C/Python solving flow.
- Bitwise optimization for the eight queens problem Move from array-based checks to compact bit masks so the same search becomes faster and easier to reason about.
- K-means clustering on the Iris dataset in C Study standardization, K-means++ initialization, restart selection, SSE, and the final clustering result.
Published resources
- Iris.csv dataset The 150-sample Iris dataset used by the K-means article.
- Iris_sort_K_mean.c source Includes standardization, K-means++ initialization, restarts, and SSE selection.
- K-means flowchart SVG flowchart for the C program execution path.
- Cluster visualization A 2D scatter projection using petal length and petal width.
- K-means zip package Contains dataset, source code, flowchart, and visualization.
- Gaoshu Lianxi PDF A public advanced calculus practice PDF for review or printing.
- Algorithm Visualization share card A 1200x630 SVG card for eight queens, backtracking, bitmasks, and the playground.
- K-means iteration animation A Remotion clip showing sample assignment, centroid updates, and SSE reduction.
- Eight queens backtracking animation A Remotion clip showing row-by-row search, conflict pruning, and backtracking.
Current route
- Eight queens with classic backtracking Learning path step
- Bitwise optimization for the eight queens problem Learning path step
- K-means clustering on the Iris dataset in C Learning path step
- K-means companion downloads Learning path step
Next notes
- Add more runnable algorithm examples
- Expand downloadable example inputs
Resources
Algorithm Implementation Project / DATASET
Iris.csv dataset
The 150-sample Iris dataset used by the K-means article.
Algorithm Implementation Project / CODE
Iris_sort_K_mean.c source
Includes standardization, K-means++ initialization, restarts, and SSE selection.
Algorithm Implementation Project / DIAGRAM
K-means flowchart
SVG flowchart for the C program execution path.
Algorithm Implementation Project / DIAGRAM
Cluster visualization
A 2D scatter projection using petal length and petal width.
Algorithm Implementation Project / ARCHIVE
K-means zip package
Contains dataset, source code, flowchart, and visualization.
Algorithm Implementation Project / GUIDE
Gaoshu Lianxi PDF
A public advanced calculus practice PDF for review or printing.
Algorithm Implementation Project / SOCIAL
Algorithm Visualization share card
A 1200x630 SVG card for eight queens, backtracking, bitmasks, and the playground.
Algorithm Implementation Project / VIDEO
K-means iteration animation
A Remotion clip showing sample assignment, centroid updates, and SSE reduction.
Algorithm Implementation Project / VIDEO
Eight queens backtracking animation
A Remotion clip showing row-by-row search, conflict pruning, and backtracking.
Site Building Project
Keep the site-building process maintainable, reversible, and ready to extend.
Notes on the bilingual site structure, content sync, categories, comments, and deployment workflow.
Next notes
- Keep documenting deployment and maintenance notes
- Clarify the content sync workflow
Published posts
Welcome to haotianblog: what this bilingual technical site covers
An introduction to the bilingual site, its technical focus, and the kinds of articles and resources it will publish.
Project timeline
Published posts
- Welcome to haotianblog: what this bilingual technical site covers An introduction to the bilingual site, its technical focus, and the kinds of articles and resources it will publish.
Published resources
- SEO distribution short-video storyboards Four 45-60 second storyboard scripts ready for later Remotion production.
Next notes
- Keep documenting deployment and maintenance notes
- Clarify the content sync workflow
Resources
Site Building Project / VIDEO
SEO distribution short-video storyboards
Four 45-60 second storyboard scripts ready for later Remotion production.
