Data normalization - Zelix Glossary

What it means

Data normalization is the engineering discipline of giving every field a single canonical form. Phone numbers always in E.164 (+6591234567). Dates always in ISO 8601 (2026-05-18). Country codes always two letters. Customer names always trimmed and title-cased. Currencies always with an explicit ISO code.

It sounds boring. It removes a category of bugs that would otherwise haunt every AI deployment that touches data from more than one system. An AI agent that reads phone numbers from two systems and sees 'same person' instead of 'two strangers' is the bare minimum.

Why it matters

Without normalization, AI integrations spend a frustrating amount of time on data wrangling that should not exist. With it, integrations become straightforward and the team stops debugging the same kind of bug across different projects.

Normalization is also what makes a customer record portable. You can move from one CRM to another without losing the meaning of fields, because everything is in canonical form. The same applies to AI tooling: you can swap the model behind your agent and the data layer keeps working.

Example

A multi-outlet salon chain normalises phone numbers across the CRM, the booking system, and the WhatsApp inbox to E.164. The AI agent now recognises that '91234567' in the CRM, '+65 9123 4567' in the booking system, and '6591234567' in WhatsApp are all the same customer. Same person, same history, no duplicates.

Where this comes up

Workflow engineering

What it means

Why it matters

Example

Related terms

Where this comes up