Datasets

Datasets allow for the creation of a dataframe, a data structure that captures the logic that organizes data into a 2-dimensional table of rows and columns. Datasets allow for the reuse of these dataframes across models, as well as the features derived from the data source(s).

The Dataset allows you to manage the entire operation related to data set. You can perform the following two things using the Dataset window:

  • Define a metadata on how you want to create a dataset.
  • In addition, a mechanism where you can take a snapshot of real time data and store it. So, it can be used later in the pipeline.