How is the model trained and tested? Why doesn’t my model completely match the historical data?

Question

G2

Verified User

How is the model trained and tested? Why doesn’t my model completely match the historical data?

Asked almost 6 years ago

Tell me a little more about how models are trained and tested.

Machine Learning Software

Comment

1 comment

1

Looks like you’re not logged in.

Users need to be logged in to answer questions

Log In

David C. · Answer 1 · 2019-07-19T17:22:05-05:00

The model is built, or “trained”, on a training dataset that is a subset of the original dataset you select. Kraken automatically splits your dataset randomly and performs five-fold cross-validation. Predictions are made with each row of data in the test dataset and compared to the actual result, producing the accuracy measures with which the models are scored. It may seem slightly counterintuitive that the model can’t get it completely right against the historical data – after all, those events already happened. All that really means is that the model is not predicting with 100% accuracy, so some of the “predictions” (on a historical data point) don’t match what actually happened. This is not necessarily a bad thing; in fact, any model that predicts with 100% accuracy against the test dataset should at least be scrutinized further to see if overfitting or other errors may be occurring.

How is the model trained and tested? Why doesn’t my model completely match the historical data?

About Qlik AutoML