Building digital experiences with passion

"Simplicity is the soul of efficiency." - Austin Freeman

Connect

Github iconGitHubGmail iconEmailLinkedIn iconLinkedIn

© 2026 Abel Sintaro. All rights reserved

Back

The Fundamentals of Machine Learning

Chapter 1

The Machine Learning Landscape

Intro1 of 9Next
End-to-End Machine Learning Project

Chapter 1 of Hands-On Machine Learning with Scikit-Learn, Keras & TensorFlow introduces the foundational concepts and big-picture view of machine learning (ML). Aurélien Géron explains what machine learning is, when to use it, and how different types of learning problems are structured. The chapter sets the conceptual groundwork for the practical work that follows in later chapters.

What Machine Learning Is

Géron defines machine learning as the field of study that gives computers the ability to learn from data without being explicitly programmed. Instead of writing fixed rules, developers train models that discover patterns in data and make predictions or decisions. ML is especially useful when:

  • Rules are too complex to code manually
  • The environment changes frequently
  • Large amounts of data are available
  • Pattern discovery is valuable

Why Use Machine Learning

The chapter highlights key advantages of ML systems:

  • They can handle complex, high-dimensional problems
  • They improve with more data
  • They can uncover hidden patterns
  • They adapt to changing conditions

However, Géron also notes that ML is not always the right solution — simple rule-based systems can sometimes be more efficient and interpretable.

Types of Machine Learning Systems

The chapter categorizes ML systems along several dimensions:

  1. Supervised vs. Unsupervised Learning
  • Supervised learning: models learn from labeled data (e.g., classification, regression)
  • Unsupervised learning: models find structure in unlabeled data (e.g., clustering, dimensionality reduction)
  • Géron briefly introduces semi-supervised and reinforcement learning as well.
  1. Batch vs. Online Learning
  • Batch learning: the model is trained once on the full dataset.
  • Online learning: the model learns incrementally from data streams.
  • Online learning is useful for large-scale or continuously evolving data.
  1. Instance-Based vs. Model-Based Learning

Instance-based: compares new data to stored examples (e.g., k-nearest neighbors).

Model-based: builds a predictive model and generalizes from it.

Key Challenges in Machine Learning

Géron outlines common obstacles that affect model performance:

  • Insufficient training data
  • Poor-quality data
  • Irrelevant features
  • Overfitting (model too complex)
  • Underfitting (model too simple)

He emphasizes that data quality and proper evaluation are often more important than algorithm choice.

Testing and Validation

The chapter introduces the critical practice of splitting data into training and test sets. Proper evaluation ensures that models generalize to new data rather than memorizing the training set. Concepts like generalization error and performance metrics are introduced at a high level.

Real-World Workflow Preview

Finally, Géron provides a preview of a typical ML project pipeline:

  1. Look at the big picture
  2. Get the data
  3. Prepare the data
  4. Select and train a model
  5. Fine-tune the model
  6. Present the solution
  7. Launch and monitor

This roadmap becomes the backbone of the rest of the book.

Key Takeaway:

Chapter 1 establishes that successful machine learning is not just about algorithms — it is about understanding the problem type, preparing quality data, choosing the right learning approach, and rigorously evaluating models within a complete workflow.