Question 1

What can be an indicator of usefulness of mean encodings?

Correct answers:

Incorrect answers:

Question 2

What is the purpose of regularization in case of mean encodings?

Correct answers:

Incorrect answers:

Question 3

What is the correct way of validation when doing mean encodings?

Correct answers:

Incorrect answers:

Question 4

Suppose we have a data frame 'df' with categorical variable 'item_id' and target variable 'target'.We create 2 different mean encodings:

1)via df['item_id_encoded1'] = df.groupby('item_id')['target'].transform('mean')

2)via OneHotEncoding item_id, fitting Linear Regression on one hot-encoded version of item_id and then calculating 'item_id_encoded2' as a prediction from this linear regression on the same data.

Correct answers:

Incorrect answers: