Regression Model in SAP Analytics Cloud Smart Predict

SAP Analytics Cloud offers the Smart Predict feature, enabling business users to uncover trends and make accurate forecasts using automated machine learning. Its predictive modeling capabilities, including Regression Models, help organizations forecast continuous outcomes such as sales, costs, or demand levels, turning data into actionable insights from the very first analysis. Using SAP’s cloud analytics platform, users can make smarter, data-driven decisions without needing coding expertise.

Building a Regression Model

To begin, navigate to the Side Menu → Predictive Scenarios → Regression. Give your scenario a name and description that reflects your business objective

InsightCubes_PredictiveScenario_SAC

The training dataset contains past observations where the target variable is already known. The Smart Predict feature of SAP Analytics Cloud analyzes this dataset to generate a predictive model that explains and forecasts the target variable based on influencers – the factors that affect it.

InsightCubes_Regression_Model_DataSource_SAC
Training Data Source

When you later apply the model to an application dataset, ensure it has:

  • The same structure and number of variables as the training dataset
  • Identical variable names (extra columns will be ignored)

Understanding Variable Roles

In SAP’s Analytics Cloud platform, every column in your dataset represents a variable, while each row corresponds to an individual observation.

  • Target Variable: The variable you aim to predict (e.g., number of complaints or home price).
  • Influencers: Variables that describe and explain the target (e.g., age, income, location).

Excluding Variables

By default, all columns are considered potential influencers. However, you can exclude variables that directly relate to the target or leak information about it (known as leakers). Including such variables can artificially inflate model accuracy.

Saving and Training the Model

Once the predictive model setup is complete:

  1. Save the model – it will appear in your predictive scenario with the status Not Trained.
  2. Train the model – Smart Predict will analyze variable relationships to find the best combination for prediction

After successful training, the model status updates to Trained. If training fails, check the Status Panel for diagnostic messages

Analyzing Regression Model Results

After training, SAC Smart Predict provides several reports to evaluate model performance and interpret results. Root Mean Squared Error (RMSE)

RMSE measures the average difference between predicted and actual values.

  • A smaller RMSE indicates a more accurate model.
  • RMSE uses the same units as the target variable, making it easy to interpret (e.g., a $200 RMSE means predictions are, on average, $200 off).

You can often reduce RMSE by adding more influencer variables.

Prediction Confidence measures how well your model is expected to perform on new data.

  • It ranges from 0% to 100%, and should be as close to 100% as possible.
  • You can improve confidence by adding more data rows or refining influencers.
Insightubes_Regression_Model_Indicators_SAC

Smart Predict highlights the top five influencers that most impact the target variable.
Influencers are ranked by their contribution, which represents how much each variable explains the target.
The sum of all contributions equals 100%.

InsightCubes_Regression_Model_TargetStatistics_SAC
Target Statistics

Understanding the Predicted vs Actual Chart

The Predicted vs Actual chart compares your model’s predictions to the actual target values using 20 data segments (bins). You’ll see:

  • Validation – Actual curve: Actual target values vs predictions
  • Perfect Model curve: Ideal case where predictions match actuals
  • Error Min/Max curves: The expected deviation range

How to Interpret the Chart

  1. Curves don’t match: Model is inaccurate – consider adding more data or influencers.
  2. Curves match closely: Model is accurate and reliable.
  3. Curves match except one segment: Model is mostly accurate – retrain with more data for that specific segment.
InsightCubes_Regression_Model_Example_SAC
example

Applying a Regression Model

Once satisfied with your model’s performance, you can apply it to a new dataset to generate predictions. Steps to Apply the Model:

  1. Open your trained predictive model.
  2. Choose Apply Predictive Model.
  3. In the Data Source section, select your new dataset.
  4. Under Generated Dataset, choose which columns to include in the output (e.g., predictions, statistics).
  5. Name the output table and click Apply.

The predicted output dataset will appear under Browse → Files. You can open it directly or visualize the results in a BI story.

Additional Output Options

When applying a regression model, you can include extra contextual information in the output dataset.

FieldDescription
Apply DateStart date of the model application (TIMESTAMP)
Train DateStart date of model training (TIMESTAMP)
Assigned BinIndicates population segment (useful for monitoring data distribution)
Outlier Indicator“1” if the observation is an outlier; “0” otherwise
Predicted ValueThe predicted numerical outcome
Prediction ExplanationsDetails of variable contributions behind each prediction
InsightCubes_Regression_Model_Apply_SAC
Statistics and Predictions

Using Assigned Bins

Bins divide data into approximately 10% segments. If distribution changes significantly (e.g., one bin rises to 14%), it may signal a population shift – time to retrain your model.

Outlier Indicator

Smart Predict flags outliers automatically – cases where prediction error exceeds 3 times the average error.

Prediction Explanations and Strength

Smart Predict can generate up to 10 explanations per prediction, showing the main variables influencing the result. Each explanation includes:

  • Variable and Value (e.g., Age = 35)
  • Strength: Indicates the magnitude and direction of impact:
  1. Positive strength → increases predicted value
  2. Negative strength → decreases predicted value

If more than 10 influencers exist, Smart Predict aggregates the least significant ones into:

  • Positive Others
  • Negative Others

To visualize regression explanations (e.g., in waterfall charts), SAP Analytics Cloud introduces a baseline pseudo influencer – the mean predicted target. Each final prediction, generated using the Smart Predict feature in SAP Analytics Cloud, equals the baseline plus contributions from all other influencers, making regression explanations clear and easy to interpret within SAP’s cloud analytics platform.

Improving and Managing Models

After evaluating results, you can:

  • Use the model if performance is satisfactory.
  • Improve it by duplicating and retraining with new settings or more data.

To duplicate:

  • Open the predictive scenario
  • Go to the model list → Click … → Duplicate
  • Retrain or modify the copy to compare results.

Conclusion

Regression modeling with Smart Predict in SAC turns historical data into actionable insights without coding. By analyzing key variables, evaluating model accuracy, and interpreting Influencer Contributions, organizations can make smarter, data-driven decisions and unlock real business value using this cloud-based analytics solution.

Key Takeaways

  • RMSE measures accuracy – lower is better.
  • Prediction Confidence shows reliability – closer to 100% is ideal.
  • Influencer analysis reveals what drives outcomes.
  • Assigned bins and outlier indicators help monitor data shifts.
  • Prediction explanations make models transparent and explainable.

Recent Posts

Get in Touch

Learn more and ask us About Our Cloud Consolidation Solution

Share This Post

Share this Page!

Share this with your network.

Want to Know More?

Get In Touch

Something isn’t Clear?


Feel free to contact us, and we will be more than happy to answer all of your questions.