Uncategorized

Outlier Detection & Winsorising

This quiz probes robust techniques to identify and handle extreme values in MMM datasets.

IQR method flags outliers if a data point lies beyond 1.5×IQR above Q3 or below ______.

mean

Q1

mode

median

Lower threshold identifies extreme low values outside normal spread. It’s robust to non‑normal distributions.

Winsorising at 99th percentile replaces extreme highs with the value at ______ percentile.

95th

99th

90th

50th

This caps influence of outliers while preserving rank ordering. It reduces variance without deleting data.

Cook’s distance highlights influential observations combining leverage and ______.

residual size

standard error

cookie

CTR

Large residuals at high leverage disproportionately affect coefficient estimates. Monitoring Cook’s D safeguards stability.

2025 Robyn defaults to winsorise spends at +/- ______ SD in log space.

1

0.5

2

5

Clamping extreme log spends prevents single week spikes from skewing adstock parameter fit.

Before winsorising, analysts should understand if spikes are real events or ______ errors.

data

DNS

cookie

creative

Legitimate promo bursts carry signal; blindly trimming could erase true effects. Investigate cause first.

MAD (median absolute deviation) thresholding is preferred over z‑score when data are ______.

uniform

non‑normal

censored

perfect

MAD uses median, resisting distortion from heavy tails more than mean‑SD z‑score.

Winsorising revenue but not spend can bias ROI because variance reduction is applied ______.

equally

never

asymmetrically

randomly

Applying to one side only distorts ratio metrics; symmetric treatment maintains consistency.

Outlier removal logs should capture threshold, count, and ______ of modified points.

dns

pixel

font

IDs

Documentation enables audit and potential rollback if decisions are questioned later.

When outliers are frequent, switch to robust regression such as ______ loss.

MAE

MSE

cross‑entropy

Huber

Huber loss blends L1 and L2, reducing sensitivity to large residuals without manual trimming.

Extreme value theory suggests setting cutoff where tail distribution fits a ______ model.

Gaussian

GPD (generalized pareto)

Poisson

Gamma

Fitting heavy‑tail allows statistically justified threshold selection rather than arbitrary percentile.

Starter

Review the basics.

Solid

Nice work—refine the details.

Expert!

Exceptional command of the topic.

What's your reaction?

Related Quizzes

1 of 7

Leave A Reply

Your email address will not be published. Required fields are marked *