Question Bank
#856

The Dummy Variable Trap

EasyMachine Learning
Reported at:DRWSIG

Problem

You one-hot encode a categorical feature with 12 levels (say, the month of the year) for a linear regression that includes an intercept. How many indicator columns should you create, and what exactly goes wrong if you create one per level?

Your answer

Accepts decimals, fractions (5/12), and percentages (25%).

Hints