Talks and presentations

Quantile Encoder: Tackling High Cardinality Categorical Features

October 24, 2020

European Central Bank, ECB, Machine Learning Community Meet Up, Frankfurt, Germany

In this paper, we provide an in-depth analysis of how to tackle high cardinality categorical features with the quantile. Our proposal outperforms state-of-the-art encoders, including the traditional statistical mean target encoder, when considering the Mean Absolute Error, especially in the presence of long-tailed or skewed distributions