Pipeline

What is a pipeline?

The pipeline shows all the steps of your data processing, from import to final output. Each step of the process is represented by a node.

In pipeline screen is where you set how you want your data to be combined, transformed, etc.

Node in a pipeline

Node is a representation of each data processing step.

There are different types of Node depending on the stage of data processing and each one is represented differently in the pipeline:

  1. Connection Node: A connection node represents a link to an external data source
  2. File Node: A file represent the actual data table which is uploaded from a file.
  3. Dataset Node: A dataset represents a data state which can be processed. Only in this type of node you are able to do transformation on the data.

Where to start?

Next: Create connection node and import data