HOME > Services & solutions > Data Hygiene
imgg/ttl_dcln.gif
data hygiene
Validate, standardize, correct incorrect postal codes, remove unnecessary characters, and update customer address data, and by utilizing Landscape’s “Best of Class” data hygiene or cleansing services supported by Japan’s largest residential name and address database (the IDB).
Within the source data several processes are carried-out: "kanji" character code conversion, external character conversion, former to new "kanji" character conversion, deletion of non-standard characters, and integration of full size and half size character fonts. Concurrently, data layout is also integrated and formatted if several layouts from multiple sources are submitted.

special note: "Kanji" is the Chinese characters that were adopted into the Japanese language over a thousand years ago and number in the thousands of different individual characters. Two-byte systems are necessary for their handling.
All data is standardized and formatted against the Landscape database masters. For example, addresses are formatted in the same order from postal code, prefecture, ward ("ku"), and on down to apartment, residence or office number.
Landscape does not correct the spelling or character representation of address details if incorrect combinations or relationships are discovered (i.e. chome - postal code mismatches, etc.). Missing data is also appended if the correct combinations or relationships can be established. These faults are generally discovered during validating passes of input data from customer against the IDB database.
From the standardized and formatted data, Landscape "matching logic" keys are created making it possible to separate consumer data into separate households, and subsequently delete duplications. An individual who belongs to a household, but is listed twice with slight variances, can be purged from the database through computer-aided name identification and several other different types of matching keys so that duplications by a minor difference can also be found.
Attribute data (sales, contact history, etc. ) of duplicate individuals or companies found by computer-aided identification processes are transferred to the master customer data for integration.
Duplicates within and between multiple input files from the customer, once identified and confirmed, are integrated into one master output file and flagged as duplicates. Customers can request that these duplicates be placed in a separate file for closer examination.
データクレンジングフロー