Talend tNormalize

Talend tNormalize helps to normalize the denormalized data so that we can use that cleanse data for further action. This Talend tNormalize component normalizes the source data as we do in the database normalization.

To demonstrate the Talend tNormalize, we used the below-shown text file. As you can notice, the Sales column has multiple entries that are divided by semicolon.

Source Table

Talend tNormalize Example

Before we started creating a job, we created the File delimited Metadata for the Normalize Source text file. We use this Metadata as the source.

File Delimiter Metadata for source text file

Let me use the same File delimited Metadata as the source of the Talend normalize component.

Talend tNormalize 3

Next, drag and drop the tNormalize into the job window. As you see from the below tNormalize component tab, it has two options

  • Column to normalize: Please select the column that you want to flatten or normalize.
  • Item separator: Please type the item that separates the content inside that normalized column. 
Talend tNormalize 5

Please click on the tNormalize Edit Schema button to check the schema or the input and output columns.

Edit Schema

Here, we are selecting the Sales column as the normalized column from the drop-down list. Next, we used the semicolon as the item separator.

Talend tNormalize 8

To display the result of the normalized data, we used the tLogRow.

Talend tNormalize 9

Let me run the Talend tNormalize job and see the result.

Talend tNormalize 10

We want to store the tNormalize output in the text file to use it as the tDenormalize source. Here, we are saving the result in a text file, including the header, and the columns separated by a comma. Next, run the Talend tNormalize job.

Talend tNormalize 11

Within the file system, let me open the text file to show you the result.

Denorm Table