GDP were approached by Xerox, a world leader in document management technology and services enterprise, with worldwide revenue income of over 17 billion dollars. They were in the process of migrating a number of ageing legacy systems to a new SAP based ERP environment. Although already utilising an ETL tool to perform the data manipulation, they soon recognised that their data issues necessitated outsourcing the process to a company specialising in international DP.
Working closely with the client, GDP developed a fast turnaround process that was subject to very aggressive KPIs. The migration was staggered, supplied in multiple ‘deltas’ of varying size and consisting of over 20 countries. Each delta was processed and quality assured within an agreed timeframe; this was crucial as the final delta return was scheduled by the minute into the new live environment switch-over.
The initial audit of the data identified problematic data trends; misspelling and truncation of address elements, fragmentation and concatenation of address lines, spurious data entries, missing country names and partial addresses missing vital elements.
Utilising advanced parsing techniques on a country by country basis, address components were identified and separated, populating distinct and comprehensive address element fields. This process employed extensive multi-language vocabularies and country specific address syntax rules. Information that fell outside of this parsing exercise was retained in separate ‘non-address’ fields. It was important that no information was lost, as much of this text was relevant and of operational importance to the client.
The enhanced addresses were then processed through our international address validation software multiple times; those failing verification were reprocessed with differing approaches to maximize the success rate.
The resultant returned files comprised of corrected and enhanced address elements, with standardised casing, corrected diacriticals and country specific mailing output formatting. Only then did they reach the quality of address cleansing and parsing required for the successful operation of the client’s proposed SAP applications.