Files Ingest

The File Ingest Task allows you to ingest files from remote storage systems into Gaio DataOS. It is commonly used to automate data ingestion from external servers such as SFTP or FTP, enabling repeatable and reliable file-based pipelines. This task focuses on connecting to a remote source, selecting files using flexible patterns, and storing them locally for downstream processing.

The File Ingest Task is designed to:

  • Connect to external file servers.

  • Download one or multiple files.

  • Support dynamic file selection patterns.

  • Store files locally for further processing.

  • Enable automated and scheduled ingestion workflows.

How to configure Files Ingest

  1. Provide a descriptive name for the task.

  2. Select the protocol used to access the remote server.

  3. Select the authentication method used to connect.

    1. Password-based authentication (SFTP and FTP).

    2. Key-based authentication (SFTP and FTP).

    3. Anonymous (without password).

  4. Enter the hostname or IP address of the remote server.

  5. Specify the port used by the selected protocol.

  6. Enter the username used for authentication.

  7. Imports a specific file, all files with the given extension or all files from the folder.

circle-exclamation
  1. Specify the path to the file or folder on the remote server.

  2. Define where the files will be stored locally.

  3. Save and execute.

Best Practices (File Ingest)

  • Use explicit filenames when possible

  • Use wildcards only when needed

  • Always validate remote paths

  • Keep local paths organized by pipeline

  • Parameterize filenames for dynamic ingestion

  • Test connectivity before production use

Last updated