The Diff operation can be used to compare two data sets. One data set is the comparison source (reference / e.g. an older data set), the other data set is the main source (the current/new data). The following data records are made available on four outputs:

  • Unchanged: Data for which no change has been detected.
  • Changed: Data that is present in both data sets but has been changed.
  • Added: Data that was not yet present in the reference data set.
  • Deleted: Data that was present in the reference data set and is now missing

Please note: The two data sets used must contain the same columns/attributes.

The configuration:

tags

  1. assign a name for the diff
  2. the key attribute/key column must be present in both data sets and the values must be unique
  3. the deleted values are not determined by default and no data appears on the output. The output is activated with this switch.