James at the Ironman finish
James Han
Sidekick + CX R&D @ Shopify
prev. ML Research at Mozilla
jameshan.cs@gmail.com
Wiki: Index Firefox: Index Machine Learning Research: Index Hand-Crafted Features and Tree Models Firefox ONNX Inference Learned URL Representations Data Exploration and Visualization tracker cost presentation New Tab Privacy Metrics: Index Privacy Metrics Service Privacy Metrics Component Top Trackers Database Flushing Live ContentBlockingLogs on Anti-Tracking DB Query Nova Redesign & New Tab Integration Gecko Security Patches: Index Embed Link Preservation Defensive Range Checks in a Browser-Side Compression Decoder Marking Browser Clipboard Writes as Sensitive in Private Contexts ETP Infrastructure: Index Sphinx ETP Extension Android ETP Pipeline Strict Mode Fix Private Browsing Cell Clarification Pref Comment Cleanup Additional Work: Index Removing 2,645 Lines of the Old Clear Data Dialog SmartBlock Shims: Click-to-Load Placeholders for Blocked Tracker Content Tracking-Parameter Stripping in Firefox's Copy Clean Link Instrumenting Firefox's Notification Permission Lifecycle with Glean Mozilla Engineering Talks + Networking Shopify: Index Just-in-Time Context: Moving Tool Instructions Out of the System Prompt Calibrating an LLM Judge: Cohen's Kappa from 0.02 to 0.61 Learning Modern Search and Retrieval Data Engineering Fundamentals: Index Python syntax reference Machine Learning: Index Foundations: Index K-Nearest Neighbors: Index Experiment: MNIST Digit Classification K-Nearest Neighbors Decision Trees: Index Decision Trees Experiment: Heart Disease Prediction Linear Regression: Index Experiment: Gradient Descent Gradient Descent Linear Regression Practice Problems Logistic Regression and Regularization: Index Logistic Regression and Regularization Bias-Variance Tradeoff and Bagging: Index Bias-Variance Tradeoff and Bagging Naive Bayes: Index Naive Bayes Gaussian Discriminant Analysis: Index Gaussian Discriminant Analysis Gradient Boosting: Index Gradient Boosting Feature Engineering and Embeddings: Index Feature Engineering and Embeddings Loss Functions and Optimization: Index Loss Functions and Optimization Deep Learning: Index Neural Networks: Index Neural Networks Backpropagation: Index Backpropagation Transformers and Attention: Index Transformers and Attention Large Language Models: Index Large Language Models: Index Large Language Models Retrieval Augmented Generation: Index Retrieval Augmented Generation Fine-Tuning and Parameter-Efficient Methods: Index Fine-Tuning and Parameter-Efficient Methods Linguistics and Tokenization: Index Search and Retrieval: Index What Search Is Counting Words Smarter: TF, Length Normalization, and IDF Privacy and ML: Index Differential Privacy Systems and Practice: Index ML Systems Design: Index ML Systems Design Model Evaluation and Experiment Design: Index Model Evaluation and Experiment Design Projects: Index Painting Classifier: Index Experiment: Painting Classifier Feature Engineering Pipeline Model Comparison: Logistic Regression vs Random Forest vs MLP Training a Flux LoRA of My Face Open Source: Index MarkUs: Index Killing an N+1 in the Grading Interface Scheduled Assignment Visibility, End-to-End Model-Layer Invariants and Tests as Contracts PythonTA: Index AST Manipulation, Custom Checkers, and a Student-Facing Reporter Linear Algebra: Index 1: Re-imagining Matrices 2: Solving Linear Systems 3: Vectors in Euclidian Space 4: Matrix Operations 5: Matrix Transformations as Functions 6: Subspaces 7: Kernel and Image 8: Orthogonality and Projections 9: The Determinant 10: Eigenvalues and Eigenvectors 11: Diagonalization and Similarity 12: Orthogonal Diagonalization and the Spectral Theorem 13: Singular Value Decomposition 14: Portfolio Optimization 15: PCA in Finance 16: Covariance Estimation and Regularization Triathlon: Index Zones, and why mine were wrong Fueling a six-hour ride Tracking load with CTL, ATL, and form Sleep is the cheap performance lever Sarnia to Barrie Writing: Index Essays: Index Before the Music Stops Breakthrough Tactics The Feeling You Can't Verify Coordination Without Authority: Zhang Zuolin and the Fengtian Clique Endogenous Reform Destruction and Recursive Institutional Failure in the Soviet Economic System The Durak is Dead! Long Live the Durak! Discontinuous Utility and the Jutland Paradox Notes on GeoGuessr
Built with Gleam , 🦀 Rust , Astro
wiki Machine Learning Foundations Feature Engineering and Embeddings index.md
  • English EN
  • 中文 ZH
  • Deutsch DE

Feature Engineering and Embeddings: Index

From Foundations
Feature Engineering and Embeddings
Page metadata
First created Apr 5, 2026
Last edited May 31, 2026

Index

  • Feature Engineering and Embeddings
1 ​
748 contributions this year@lxyhan
HEAD 1faeba8
100%
Type to search