Question Bank
#856
The Dummy Variable Trap
EasyMachine Learning
Problem
You one-hot encode a categorical feature with 12 levels (say, the month of the year) for a linear regression that includes an intercept. How many indicator columns should you create, and what exactly goes wrong if you create one per level?
Your answer
Accepts decimals, fractions (5/12), and percentages (25%).