In this article We are going to show you, How to Extract Noun Phrases Using Term Extraction Transformation in SSIS with example. Before reading this article, Please refer Term Extraction in SSIS article for the definition, properties and functionality of this Transformation.
Below screenshot shows our source data
Extract Noun Phrases Using Term Extraction Transformation in SSIS
STEP 1: Open BIDS and Drag and drop the data flow task from the toolbox to control flow and rename it as Extracting Noun Phrases Using Term Extraction Transformation in SSIS.
Double click on it and it will open the data flow tab.
STEP 2: Drag and drop OLE DB Source, Term Extraction Transformation and OLE DB Destination from toolbox to data flow region
STEP 3: Double click on OLE DB source in the data flow region will open the connection manager settings and provides space to write our SQL statement.
Here we selected the [SSIS Tutorials] Database as our source database and SQL Command we used in the above screenshot is:
USE [SSIS Tutorials]
SELECT [Player Information]
FROM [Term Extraction Transformation Source]
STEP 4: Click on columns tab to verify the columns. In this tab we can uncheck the unwanted columns also.
TIP: If we don’t want any column then there is no point to add it in to your SQL command.
Drag the OLE DB source output arrow on to the Term Extraction Transformation to perform transformation on the source Data.
STEP 5: Double click on the Term Extraction Transformation will open the Term Extraction Editor to configure it. Within the Term Extraction tab, You simply need to choose the column you want to use for the Term Extraction from the available input columns. We left the output column names to default Term and Score.
Exclusion Tab: If you want to exclude specific terms during term extraction then, configure this Tab by specifying a column that contains exclusion terms.
In this example let us leave this because we want to extract all the Noun Phrases from source data.
STEP 6: Advanced tab of the Term Extraction Transformation Editor Dialog box is very important to select Term Type, Source Type and Frequency Threshold. In this example we are extracting Noun Phrases only so, we selected Noun Phrases as term type and selecting Frequency Threshold as 1. Please refer Extract Nouns Using Term Extraction Transformation in SSIS article to understand, How to Extract Nouns from the Source Data.
From the below screenshot you can see, there is a warning symbol on the Term Extraction Transformation and it is telling that error output is not connected. You can remove the warning symbol by configuring the error output of Term Extraction Transformation. So double-click on the Configure Error Output button will open new window to configure the error output.
The default configuration of a Term Extraction Transformation is to redirect error rows. You can get rid of this warning by connecting the error output or by changing the default behavior to Ignore Failure or Fail Component. Let’s change to Ignore Failure
Click ok to finish configuring the Term Extraction Transformation.
STEP 7: Now we have to provide Server, database and table details of the destination. So double-click on the OLE DB Destination and provide the required information.
Here we selected [SSIS Tutorials] database as destination data source (localhost as server instance) and [Extracting Noun Phrases using Term Extraction] table as our destination table
STEP 8: Click on Mappings tab to check whether the source columns are exactly mapped to the destination columns. If not, Please assign them to appropriate destination column
Click ok to finish designing our Extracting Noun Phrases using Term Extraction Transformation package. Let us run the package
Let’s open the SQL Server Management Studio and check the results
Thank you for Visiting Our Blog