Fuzzy Lookup Element
X-Ref:
- Ast Fuzzy Lookup Node
- Ast Single In Transformation Node
- Ast Transformation Node
- Ast Scope Boundary Node
Attributes
Attribute | API Type | Default | Description |
---|---|---|---|
ConnectionName | AstOleDbConnectionNode | This values specifies the OLE DB connection and database in which the fuzzy lookup reference table is located and where the fuzzy match index table is stored and accessed. This is a required reference to an existing definiton. | |
Name | String | Specifies the name of the object. This name can be used to reference this object from anywhere else in the program. This is a required property | |
AutoPassThroughInputColumns | Boolean | True | Specifies whether or not input columns to the Fuzzy Lookup node that lack an input mapping should be automatically set to passthrough. |
CopyReferenceTable | Boolean | True | This value specifies whether a copy of the table specified in the ReferenceTable should be made for the fuzzy lookup. |
Delimiters | String | " \t\r\n,.;:-\"'&/\\@!?()<>[]{}|#*^%" | This value specifies which set of white-space and punctuation characters is used to separate fuzzy match fields into the words used for matching and scoring a fuzzy lookup. |
DropExistingIndex | Boolean | True | This value specifies whether any existing indexes having the name specified in the MatchIndexName property should removed. The default is True. |
Exhaustive | Boolean | False | This value specifies whether each input record is directly compared against every record in the reference table. This option is primarily intended for debugging purposes and should be used with care. This method is extremely slow unless the number of records in the reference table is very small. Note also that the entire reference table is loaded into main memory if this option is enabled. |
LocaleId | Language | Inherited | This value specifies which locale is used by the dataflow task. |
MatchIndexName | String | This value specifies the name of the fuzzy-match index table to be reused or the table name in which a newly generated fuzzy-match index table will be written. | |
MatchIndexOptions | FuzzyLookupTransformMatchIndexOption | CreateNewIndex | This value indicates whether an existing fuzzy-match index table is reused or a new one is built. The new index table can be temporary, persisted, or persisted and maintained. |
MaxMemoryUsage | Int32 | 0 | This value specifies the maximum amount of memory (MB) that the current instance of the Fuzzy Lookup transformation can use. If 0 is given, then the transform dynamically adjusts memory usage based upon its requirements and available system memory. |
MaxOutputMatchesPerInput | Int32 | 1 | This value specifies which set of white-space and punctuation characters is used to separate fuzzy match fields into the words used for matching and scoring a fuzzy lookup. |
MinSimilarity | Int32 | 0 | This value specifies the minimum similarity score that is required in order for the transformation to return a match. |
ValidateExternalMetadata | Boolean | true | This value specifies whether the data flow transformation is validated against columns that originated in external data sources. When server assets such as tables and stored procedures are created during processing, ValidateExternalMetadata is normally set to False, which prevents validation from completing at compile time. |
WarmCaches | Boolean | True | This value specifies whether the error-tolerant index and the reference table are partially loaded into main memory before the Fuzzy Lookup transformation begins. This can greatly enhance performance if many input rows are to be processed. |
Singleton Children
Child | API Type | Description |
---|---|---|
<ReferenceTableInput /> | AstTableResourceNode | The AstTableResourceNode type is used for creating references to a table that has been modeled in the Biml project. This is a required property |
<ExternalReferenceTableInput /> | AstUnmanagedTableResourceNode | The AstUnmanagedTableResourceNode type is used for creating references to a table stored in an existing relational database that has not been modeled in the Biml project. This is a required property |
<ErrorHandling /> | AstComponentErrorHandlingNode | Specifies the error handling defaults to apply to an entire component |
<InputPath /> | AstDataflowInputPathNode | AstDataflowInputPathNode is used to model input paths in a SQL Server Integration Services dataflow task. Connections are made by referencing an appropriate output path. |
Collection Children
Child | API Type | Description |
---|---|---|
<Annotations> <Annotation /> </Annotations> | AstAnnotationNode | This is a collection of annotation items that can be used to specify documentation, tags, or other information. Annotations are particularly useful for storing information about nodes that can be used by BimlScript code. |
<DataflowOverrides> Multiple Choices... </DataflowOverrides> | AstDataflowOverrideNode | Provides a collection of objects to override properties of the component, its input paths, its output paths, and its consituent dataflow columns. |
<Inputs> <Column /> </Inputs> | AstFuzzyLookupColumnMappingNode | This is a collection of input column mapping definitions that determine the behavior of specific columns and their comparisons. |
<Outputs> <Column /> </Outputs> | AstDataflowColumnMappingNode | This is the collection of lookup output column mapping definitions. |