By
dan goldberg -
August 26, 2008
An address processing service has been added to the list of free services provided by Texas A&M GeoServices offering the ability to parse, normalize, and standardize a postal address. This service works for both single addresses and in batch for processing a database of addresses .
Address parsing is the process of dividing a single address string into its individual component parts, while address normalization converts these parts into their standardized equivalents. The services provided on this page do both of these tasks using a deterministic approach, one in a single record fashion and the other processing a database of records in batch. For address parsing, a set of rules have been developed using tokenization on white spaces between words, the ordering of the tokens, and a series of alias tables to determine each of the individual address components. Normalization is performed with more alias tables to make a best effort attempt to normalize the components to USPS Publication 28 standard values. This software is not USPS CASS Certified but works farily well on most normal address data and is constantly being improved.