The “Address Validation” job … boilerplate goes here [ssu test edit]
The current scope of the pilot focuses solely on domestic addresses, so it is worth keeping in mind that additional work is required to bring this to full production mode with validation/standardization options for both domestic and international addresses.
Name: addressvalidation.ddf WSDL: http://54.244.240.85:21036/datasvc/AddressValidation.ddf?wsdl |
The DataFlux job has four nodes flowing from left to right.
External Data Input Node
This node provides a landing point for source data that is external to the current job. It receives raw address data from the CommIT Registration web form via a SOAP WS. The current attributes include:
Future iterations should probably include county, perhaps as iso_country_code, but for the scope of this pilot all addresses are assumed to be from the US.
Address Validation
This node verifies addresses from the US via the USPS Database. It maps the raw address data into fields which are consumed by the USPS address validation process. The outputs of this node include what is commonly referred to as the “cleansed” address, as well as the “original” address from the web form. This node also includes USPS metadata, of which we are utilizing the USPS Result Code and USPS Numeric Result Code.
At this stage, the cleansed address will include:
Text Result Code |
Numeric Result Code |
Description |
---|---|---|
OK |
0 |
Address was verified successfully. |
PARSE |
11 |
Error parsing address. Components of the address might be missing. |
CITY |
12 |
Could not locate city/state or zip in the USPS database. At least (city and state) or ZIP must be present in the input. |
MULTI |
13 |
Ambiguous address. There were two or more possible matches for this address with different data. |
NOMATCH |
14 |
No matching address found in the USPS data. |
OVER |
15 |
One or more input strings is too long (maximum 100 characters). |
Future iterations should consider deeper levels of address validation, such as Delivery Point Validation (“DPV”) and use of other nodes to handle non-US addresses.
Match Codes
This node generates proprietary match codes for both the original and cleansed versions of “Address1” and “City.” These are the match codes which the match logic will check for person/identity matches.
DataFlux Address (v22) Definition
at 85 Sensitivity
.DataFlux City Definition
at 85 Sensitivity
.Field Layout
Reorganizes the various data fields into a practical layout for the output. For clarity, the original data fields are returned with an o_
prefix (e.g., o_city) and the cleansed data fields are returned with a c_
prefix (e.g., c_city).
The full list of returned attributes include: