Gaio DataOS
Gaio DataOS
Gaio DataOS
  • 👋 Welcome to Gaio DataOS
  • GETTING STARTED
    • Gaio DataOS Console
    • Quickstart
  • FUNDAMENTALS
    • Data Projects
    • Studio
    • Database
    • Workflow
  • Data Sources
  • TASKS
    • ETL
      • Builder
      • SQL
      • Source SQL
      • Insert Table
      • Insert Row
      • Update
      • Delete
      • Create Table
      • Quick Table
      • Quick Upload
      • Pivot Table
      • Unpivot Table
      • REST
      • Parameters to Table
      • Table to Parameters
      • Define parameter value
      • Users
      • CSV Web
      • CSV Local
      • Google Spreadsheet
    • Analytics
      • Sample
      • Cluster
      • Principal Components
      • Association Rules
      • Forecast
      • Python
    • Delivery
      • Content
      • Form Card
      • Export CSV
    • Map Editor
Powered by GitBook
On this page
  • How to Configure the Forecast Task
  • 1. Open the Forecast Task
  • 2. Fill in the Required Fields
  • 3. Understanding the Output Table
  • 4. Visualization
  1. TASKS
  2. Analytics

Forecast

PreviousAssociation RulesNextPython

Last updated 2 days ago

Predict sales behavior in the coming months. This is an application of Time Series. In essence, the technique relies on a date column and a metric, to analyze three factors:

  1. Seasonality: peaks at certain cyclical moments (e.g. Christmas).

  2. Trend: analyzes the speed of growth, decline or stagnation.

  3. Amplitude: if there are rises and falls over time, it measures whether the distances between peaks and valleys have intensified or not.

Gaio can project the next months or days. Gaio uses the Facebook algorithm, Prophet, to perform calculations.

Various techniques and parameterizations are applied to the data, generating several projections and their errors are calculated. The model with the lowest error is chosen and the projections are saved in a table in Gaio.

How to Configure the Forecast Task


1. Open the Forecast Task

  • In the Studio, go to the Tasks panel.

  • Under the Analytics section, select on Forecast.


2. Fill in the Required Fields

  • Task label: (optional) Name for identifying this step in your flow.

  • Result table: Name of the output table that will store the forecast results (e.g., forecast_times_series).

  • Source table: Automatically populated with the selected table (e.g., times_series).

  • Metrics table: auxiliary metrics reference table.

  • Date: Column that represents the date or time.

  • Measure: Column that contains the values to be predicted (e.g., sales, revenue).

  • Frequency: Time frequency of the data (day, month).

  • Periods: Number of future periods to forecast.

  • Classification (optional): Enables separate forecasts for each group (e.g., per product, region, customer).


3. Understanding the Output Table

Once the task runs, the generated table will include the following columns:

Column
Description

category

The category for group-based forecasts. If classification is not used, this will show nd (not defined)

dt

Date of the record (historical or forecasted)

real

Historical value from the time series, if available

type

Value type: real (observed) or forecast (predicted)

forecast

This column has values for all lines, historical and forecast . This is because the model makes projections for the entire past, where it can calculate how far from reality the projections are.

lower

Lower bound of the confidence interval

upper

Upper bound of the confidence interval


4. Visualization

To create a chart when only one series was projected (without a column in the Classification ), go to Reports, drag all the fields except the category and sort in ascending order by the dt column , the date. Then choose the graphic forecast.

The result should be something similar to the graph below. The blue balls are the actual historical results. The green line is the model, which is applied to the past and future, generating projections.


Best Practices

  • Ensure clean time series: The date column must be continuous and consistent (no missing or duplicate dates).

  • Use classifications to forecast multiple categories independently (e.g., one model per store).

  • Visualize results in charts (line, area) to easily compare historical data with predictions.

It is generally desired to present the series and its projection graphically. In the task, there is an exclusive graph for Time Series .

Report