Expected Value: Summarizes Average Behavior#
In data science, we often care less about a single outcome and more about the long-run average behavior of a process. This idea is captured by expected value.

Mathematically: Expected Value
For a discrete random variable \(X\):
\(\mathbb{E}[X] = \sum_x x \, P(X = x)\)
For a continuous random variable \(X\) with density \(f(x)\):
\(\mathbb{E}[X] = \int_{-\infty}^{\infty} x f(x)\,dx\)
Intuition:
Expected value is not a guaranteed outcome
It represents the average result over many repetitions
It can be positive, zero, or negative
A negative expected value means a process loses value on average
(e.g., average loss, cost, or error).
In data science, expected value connects directly to: -average error -average loss -average reward
This is why it plays a central role in optimization, decision-making, and model evaluation.
#Small Python Simulation
#Bernoulli example (click or no click):
import numpy as np
np.random.seed(1)
p = 0.2
samples = np.random.binomial(n=1, p=p, size=50_000)
print("Empirical mean:", samples.mean())
print("Expected value:", p)
Empirical mean: 0.1994
Expected value: 0.2