InterviewSkill

Probability Interview Questions

Probability fundamentals for data, ML, statistics, experimentation, and analytical interview rounds.

8 questions
Probability

What is conditional probability?easy

Type
conceptual
Topic
conditional-probability
Frequency
common
Tags
conditional-probability, events
Answer

Conditional probability is the probability of an event given that another event has already occurred.

Explanation

It is written as P(A | B) and calculated as P(A and B) / P(B), when P(B) is not zero. It helps reason about updated information in experiments, diagnostics, and ML classification.

Follow-upHow is conditional probability used in Bayes' theorem?

What is Bayes' theorem?medium

Type
conceptual
Topic
bayes-theorem
Frequency
common
Tags
bayes, prior, posterior
Answer

Bayes' theorem updates the probability of a hypothesis using prior belief and observed evidence.

Explanation

It is commonly written as P(A | B) = P(B | A)P(A) / P(B). In interviews, it is useful for medical tests, spam detection, fraud detection, and model reasoning under uncertainty.

Follow-upWhy can a high-accuracy test still produce many false positives?

What does it mean for two events to be independent?easy

Type
conceptual
Topic
independent-events
Frequency
common
Tags
independence, events
Answer

Two events are independent if knowing one happened does not change the probability of the other.

Explanation

For independent events A and B, P(A and B) = P(A)P(B), and P(A | B) = P(A). Independence is stronger than simply being different events.

Follow-upCan two mutually exclusive events be independent?

What is a random variable?easy

Type
conceptual
Topic
random-variable
Frequency
common
Tags
random-variable, distribution
Answer

A random variable maps outcomes of a random process to numeric values.

Explanation

Random variables can be discrete, like the number of clicks, or continuous, like response time. They let us describe uncertainty with distributions, expectation, and variance.

Follow-upWhat is the difference between discrete and continuous random variables?

What is expected value?medium

Type
conceptual
Topic
expected-value
Frequency
common
Tags
expectation, decision-making
Answer

Expected value is the long-run average value of a random variable.

Explanation

For a discrete random variable, it is the sum of each value times its probability. It is useful for decision-making, risk analysis, ranking, and evaluating bets or product outcomes.

Follow-upWhy can expected value be misleading for high-risk decisions?

What does variance measure?medium

Type
conceptual
Topic
variance
Frequency
common
Tags
variance, standard-deviation
Answer

Variance measures how spread out values are around the mean.

Explanation

A higher variance means outcomes are more dispersed and less predictable. Standard deviation is the square root of variance and is easier to interpret because it uses the original units.

Follow-upWhy is standard deviation often easier to explain than variance?

When would you use Bernoulli, binomial, and normal distributions?medium

Type
conceptual
Topic
common-distributions
Frequency
common
Tags
bernoulli, binomial, normal-distribution
Answer

Use Bernoulli for one binary trial, binomial for the number of successes across fixed trials, and normal for many continuous measurements or approximations.

Explanation

These distributions model different kinds of uncertainty. Interviewers often expect you to connect the distribution to the data-generating process rather than memorize formulas.

Follow-upWhy does the normal distribution appear often in practice?

What is the law of large numbers?medium

Type
conceptual
Topic
law-of-large-numbers
Frequency
common
Tags
law-of-large-numbers, sampling
Answer

The law of large numbers says that sample averages tend to get closer to the expected value as sample size grows.

Explanation

It explains why more data usually gives more stable estimates. It does not guarantee every small sample is accurate, and it does not remove bias from a bad sampling process.

Follow-upHow is this different from the central limit theorem?