Apr 15, 2026 · 14 min read · ML / Math

Introduction to Machine Learning

In this series (5 parts)

Introduction to Machine Learning
Supervised Learning - Learning from Labeled Data
Regression - Predicting Continuous Values
Classification - Predicting Categories
Unsupervised Learning - Finding Hidden Patterns

You’ve probably heard the term machine learning thrown around - in tech blogs, product launches, even coffee shop conversations. But what does it actually mean? And why does it matter?

This is the first post in the ML from Scratch series. We’ll build up from zero - no math prerequisites, no prior ML experience. By the end of this series, you’ll understand the core algorithms, the math behind them, and how to apply them to real problems.

What is Machine Learning?

At its core, machine learning is a way of teaching computers to learn from data instead of being explicitly programmed with rules.

In traditional programming, you write rules:

flowchart LR
  A["Input Data"] --> B["Hand-Written Rules"]
  B --> C["Output"]
  style B fill:#fdf3e8,stroke:#7a4a1a,color:#7a4a1a

In machine learning, you flip this:

flowchart LR
  A["Input Data"] --> B["ML Algorithm"]
  C["Expected Output"] --> B
  B --> D["Learned Model"]
  D --> E["Predictions on New Data"]
  style B fill:#e8f8f0,stroke:#1a5c38,color:#1a5c38
  style D fill:#e8f8f0,stroke:#1a5c38,color:#1a5c38

Instead of telling the computer how to solve a problem, you give it examples and let it figure out the patterns on its own.

Arthur Samuel (1959): “Machine learning is the field of study that gives computers the ability to learn without being explicitly programmed.”

Traditional Programming vs Machine Learning

Aspect	Traditional Programming	Machine Learning
Input	Data + Rules	Data + Expected Output
Output	Results	Learned Rules (Model)
Approach	Explicit logic	Pattern discovery
Adaptability	Manual updates needed	Improves with more data
Best for	Well-defined problems	Complex, fuzzy problems
Example	Sorting algorithm	Spam detection

How Does a Machine “Learn”?

The learning process follows a consistent cycle:

flowchart TD
  A["1. Collect Data"] --> B["2. Prepare & Clean Data"]
  B --> C["3. Choose a Model"]
  C --> D["4. Train the Model"]
  D --> E["5. Evaluate Performance"]
  E -->|"Not good enough"| F["6. Tune & Improve"]
  F --> D
  E -->|"Satisfactory"| G["7. Deploy & Monitor"]
  style A fill:#e8f4fd,stroke:#1a5276,color:#1a5276
  style D fill:#e8f8f0,stroke:#1a5c38,color:#1a5c38
  style G fill:#f3e8fd,stroke:#5b1a7a,color:#5b1a7a

Collect data - gather labeled or unlabeled examples
Prepare data - clean, normalize, split into train/test sets
Choose a model - pick an algorithm suited to the task
Train - feed data through the model, adjust internal parameters
Evaluate - measure how well it performs on unseen data
Tune - adjust hyperparameters, add features, try different models
Deploy - put the trained model into production

Types of Machine Learning

Machine learning is broadly categorized into three paradigms:

flowchart TD
  ML["Machine Learning"] --> SL["Supervised Learning"]
  ML --> UL["Unsupervised Learning"]
  ML --> RL["Reinforcement Learning"]
  SL --> REG["Regression"]
  SL --> CLS["Classification"]
  UL --> CLUST["Clustering"]
  UL --> DIM["Dimensionality Reduction"]
  RL --> GAME["Game Playing"]
  RL --> ROBOT["Robotics"]
  style ML fill:#f3e8fd,stroke:#5b1a7a,color:#5b1a7a
  style SL fill:#e8f8f0,stroke:#1a5c38,color:#1a5c38
  style UL fill:#e8f4fd,stroke:#1a5276,color:#1a5276
  style RL fill:#fdf3e8,stroke:#7a4a1a,color:#7a4a1a

Type	Data	Goal	Example
Supervised	Labeled (input + correct answer)	Predict outcomes	Email → spam or not
Unsupervised	Unlabeled (input only)	Find hidden patterns	Customer segmentation
Reinforcement	Reward signals	Maximize cumulative reward	Game-playing AI

We’ll cover supervised and unsupervised learning in depth in the next posts. Reinforcement learning is a topic for a future series.

Real-Life Examples of ML

Machine learning isn’t just academic - it’s embedded in products you use every day.

Netflix Recommendations

Netflix uses collaborative filtering (a form of ML) to analyze your viewing history alongside millions of other users. If users similar to you loved a show, it’ll recommend it to you.

Gmail Spam Filter

Gmail’s spam filter is a classification model trained on billions of emails. It looks at features like sender reputation, keyword patterns, link analysis, and user feedback to decide: spam or not spam.

Credit Card Fraud Detection

Banks use anomaly detection models that learn your normal spending patterns. A purchase at 3 AM in a foreign country for an unusual amount? The model flags it instantly.

Voice Assistants (Siri, Alexa)

Speech recognition is powered by deep learning models trained on millions of hours of human speech. The model converts audio waveforms into text, then another model interprets the intent.

Self-Driving Cars

Autonomous vehicles combine computer vision (identifying objects in camera feeds), sensor fusion (combining lidar, radar, camera data), and reinforcement learning (learning to navigate) - all ML techniques working together.

Medical Diagnosis

ML models can analyze X-rays, MRIs, and CT scans to detect tumors, fractures, and diseases - sometimes matching or exceeding radiologist accuracy.

Domain	ML Application	Technique
Entertainment	Content recommendations	Collaborative filtering
Email	Spam detection	Text classification
Finance	Fraud detection	Anomaly detection
Healthcare	Disease diagnosis	Image classification
Transport	Self-driving cars	Deep learning + RL
Retail	Demand forecasting	Time series regression
Agriculture	Crop disease detection	Computer vision

Key Terminology

Before we dive deeper in the next posts, here are the terms you’ll see everywhere:

Feature: An input variable (e.g., house size, number of bedrooms)
Label / Target: The output we want to predict (e.g., house price)
Training set: Data used to train the model
Test set: Data held back to evaluate model performance
Model: The learned function mapping inputs to outputs
Overfitting: Model memorizes training data, fails on new data
Underfitting: Model is too simple to capture patterns
Hyperparameter: A setting you choose before training (e.g., learning rate)

What’s Next?

In the next post, we’ll dive into Supervised Learning - the most common and practical form of ML. We’ll work with real tabular data, understand the training process, and see how a model makes predictions.

The journey from here:

flowchart LR
  A["✅ Intro to ML"] --> B["Supervised Learning"]
  B --> C["Regression"]
  C --> D["Classification"]
  D --> E["Unsupervised Learning"]
  style A fill:#e8f8f0,stroke:#1a5c38,color:#1a5c38
  style B fill:#e8f4fd,stroke:#1a5276,color:#1a5276

See you in Part 2.

← Back to all series