What pre-processing is done to my data prior to training a model?

Question

G2

Verified User

What pre-processing is done to my data prior to training a model?

Asked almost 6 years ago

What is done with my data to prepare it for machine learning?

Machine Learning Software

Comment

1 comment

1

Looks like you’re not logged in.

Users need to be logged in to answer questions

Log In

David C. · Answer 1 · 2019-07-19T17:20:44-05:00

Official Response

Qlik AutoML

Read reviews

DC

David C.

Director of Product Marketing

0

Answered almost 6 years ago

Kraken requires a dataset that is mostly ready for machine learning. However, we do apply some basic pre-processing steps to the data before building models. 1. Imputation of nulls 2. Encoding categorical features (also known as creating "dummy variables") 3. Feature scaling, or normalization 4. Handling high correlation of a Driver to the predicted Metric or correlation between Drivers 5. Take random samples of the data and perform five-fold cross-validation All of these pre-processing steps are performed given different thresholds set in our pipeline. The thresholds can be changed by us as we learn more about how accurate the models are that Kraken creates.

What pre-processing is done to my data prior to training a model?

About Qlik AutoML