Introducing G2.ai, the future of software buying.Try now

What pre-processing is done to my data prior to training a model?

What is done with my data to prepare it for machine learning?
1 comment
Looks like you’re not logged in.
Users need to be logged in to answer questions
Log In
Qlik AutoML
Official Response
Qlik AutoML
David C.
DC
Director of Product Marketing
0
Kraken requires a dataset that is mostly ready for machine learning. However, we do apply some basic pre-processing steps to the data before building models. 1. Imputation of nulls 2. Encoding categorical features (also known as creating "dummy variables") 3. Feature scaling, or normalization 4. Handling high correlation of a Driver to the predicted Metric or correlation between Drivers 5. Take random samples of the data and perform five-fold cross-validation All of these pre-processing steps are performed given different thresholds set in our pipeline. The thresholds can be changed by us as we learn more about how accurate the models are that Kraken creates.
Looks like you’re not logged in.
Users need to be logged in to write comments
Log In
Reply