InterviewRole

Data Analyst Interview Questions

A practical path for SQL, metrics, statistics, dashboards, and business analysis interviews.

Questions

How do you find weekly active users?medium

Answer

Group activity by week and count distinct users.

Explanation

A strong answer defines the activity event, handles timezone and deduplication, then uses a weekly date bucket with COUNT(DISTINCT user_id).

Follow-upHow would you calculate week-over-week growth?

When would you use a window function?medium

Answer

Use it when you need row-level output plus aggregate context.

Explanation

Window functions are useful for ranking, running totals, cohort calculations, lag comparisons, and moving averages without collapsing rows.

Follow-upHow is ROW_NUMBER different from RANK?

How do you debug a slow query?medium

Answer

Inspect the plan, filter early, index join keys, and reduce scanned data.

Explanation

Start with EXPLAIN, check joins, cardinality, missing indexes, large sorts, and whether predicates can use partitions or indexes.

Follow-upWhat makes an index ineffective?

What is statistical significance?medium

Answer

It means the observed result is unlikely under the null hypothesis.

Explanation

Interviewers expect you to separate p-value, confidence, sample size, effect size, and practical business importance.

Follow-upCan a result be significant but not useful?

How do you explain confidence intervals?medium

Answer

They show a plausible range for an estimated value.

Explanation

A good explanation avoids saying there is a fixed probability the true value is inside one computed interval; instead, describe repeated sampling behavior.

Follow-upHow does sample size affect the interval?

What is sampling bias?medium

Answer

It happens when sampled data does not represent the target population.

Explanation

Bias can come from selection, survivorship, non-response, instrumentation, or product exposure differences.

Follow-upHow would you reduce bias in an experiment?

How do you investigate a metric drop?medium

Answer

Validate the data, segment the metric, and compare against recent changes.

Explanation

Check pipeline health first, then slice by time, geography, platform, cohort, channel, and product release to isolate the driver.

Follow-upHow do you avoid false conclusions?

What makes an insight actionable?medium

Answer

It connects a finding to a decision or next step.

Explanation

Actionable analysis includes context, magnitude, confidence, tradeoffs, and a recommendation the team can evaluate.

Follow-upHow would you communicate uncertainty?

How do you handle missing data?medium

Answer

Measure it, understand why it is missing, then choose a transparent treatment.

Explanation

Depending on the cause, you may filter, impute, label missingness, or redesign collection. The key is documenting impact.

Follow-upWhen is imputation risky?

How do you choose the right chart?medium

Answer

Match the chart to the comparison you need to show.

Explanation

Use lines for trends, bars for category comparison, scatter plots for relationships, and tables for precise lookup.

Follow-upWhen should you avoid a pie chart?

What makes a dashboard useful?medium

Answer

It answers a recurring decision with clear metrics and context.

Explanation

A strong dashboard has an audience, a refresh cadence, definitions, targets, owners, and minimal clutter.

Follow-upWhat belongs above the fold?

How do you prevent misleading visuals?medium

Answer

Use honest scales, labels, baselines, and context.

Explanation

Avoid truncated axes, overloaded colors, hidden denominators, and charts that emphasize noise over signal.

Follow-upHow would you show uncertainty visually?

What is normalization?medium

Answer

Normalization organizes tables to reduce redundancy and update anomalies.

Explanation

Explain entities, keys, relationships, and tradeoffs. In analytics, denormalization may be acceptable for performance and usability.

Follow-upWhen would you denormalize?

What is a transaction?medium

Answer

A transaction is a unit of work that should satisfy ACID properties.

Explanation

Good answers cover atomicity, consistency, isolation, durability, plus examples like money transfer or inventory update.

Follow-upWhat problem does isolation solve?

How do indexes speed reads?medium

Answer

Indexes let the database find rows without scanning the whole table.

Explanation

They improve lookup and join performance but add write overhead and storage cost, so they should match query patterns.

Follow-upWhat is a composite index?
Back to Home