Talend tNormalize

Talend tNormalize helps to normalize the denormalized data so that we can use that cleanse data for further action. This Talend tNormalize component to normalize the source data as we do in the database normalization.

To demonstrate the Talend tNormalize, we used the below-shown text file. As you can notice, the Sales column has multiple entries that are divided by semicolon.

Talend tNormalize 1

Talend tNormalize Example

Before we start creating a job, we created the File delimited Metadata for the Normalize Source text file. We use this Metadata as the source.

Talend tNormalize 2

Let me use the same File delimited Metadata as the source of the Talend normalize component.

Talend tNormalize 3

Next, drag and drop the tNormalize into the job window.

Talend tNormalize 4

As you see from the below tNormalize component tab, it has two options

  • Column to normalize: Please select the column that you want to flat or normalize.
  • Item separator: Please type the item that separates the content inside that normalized column. 
Talend tNormalize 5

Please click on the tNormalize Edit Schema button to check the schema or the input and output columns.

Talend tNormalize 6

Here, we are selecting the Sales column as the normalized column.

Talend tNormalize 7

Next, we used the semicolon as the item separator.

Talend tNormalize 8

To display the result of the normalized data, we used the tLogRow.

Talend tNormalize 9

Let me run the Talend tNormalize job and see the result.

Talend tNormalize 10

We want to store the tNormalize output in the text file to use it as the tDenormalize source. Here, we are saving the result in a text file, including the header, and the columns separated by a comma. Next, run the Talend tNormalize job.

Talend tNormalize 11

Within the file system, let me open the text file to show you the result.

Talend tNormalize 12