Skip to main content
POST
/
utility
/
clean
Clean and normalize fields
curl --request POST \
  --url https://api.datalegion.ai/utility/clean \
  --header 'API-Key: <api-key>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "fields": {
    "email": "John.Doe+tag@gmail.com",
    "phone": "(555) 123-4567",
    "domain": "https://www.Google.com/about",
    "company": "Google LLC"
  }
}
'
{
  "results": {
    "email": {
      "original": "John.Doe+tag@gmail.com",
      "cleaned": "johndoe@gmail.com",
      "normalized": true
    },
    "phone": {
      "original": "(555) 123-4567",
      "cleaned": "+15551234567",
      "normalized": true
    },
    "domain": {
      "original": "https://www.Google.com/about",
      "cleaned": "google.com",
      "normalized": true
    },
    "company": {
      "original": "Google LLC",
      "cleaned": {
        "name": "google",
        "type": "name"
      },
      "normalized": true
    }
  }
}

Documentation Index

Fetch the complete documentation index at: https://www.datalegion.ai/docs/llms.txt

Use this file to discover all available pages before exploring further.

Supported Fields

Person Fields

FieldWhat It Does
emailRemoves plus tags, normalizes Gmail dots, lowercases
phoneNormalizes to E.164 format (e.g., +15551234567)
full_nameStrips prefixes/suffixes, normalizes whitespace
first_nameStrips prefixes, normalizes whitespace
last_nameStrips suffixes, normalizes whitespace
stateNormalizes to full state name (e.g., CAcalifornia)
countryNormalizes to full country name (e.g., USunited states)
social_urlNormalizes LinkedIn, Twitter/X, GitHub, Facebook URLs
schoolAccepts name, website URL, or domain — detects type and normalizes

Company Fields

FieldWhat It Does
domainExtracts domain from URL, removes www. prefix
companyAccepts name, website URL, or domain — detects type and normalizes
linkedin_company_urlNormalizes LinkedIn company page URL
ticker_symbolUppercases and strips whitespace
The enrichment endpoints automatically clean inputs before matching. Use this endpoint to clean data before sending to search endpoints, which expect pre-normalized values.

Authorizations

API-Key
string
header
required

Body

application/json

Request model for batch field cleaning.

fields
Fields · object
required

Map of field names to values to clean (e.g., {'email': 'John.Doe+tag@gmail.com', 'domain': 'https://www.Google.com/about'}). Supported person fields: email, phone, full_name, first_name, last_name, state, country, social_url, school. Supported company fields: company, domain, linkedin_company_url, ticker_symbol.

default_region
string
default:US

Default region for phone number normalization (ISO 3166-1 alpha-2 code)

Response

Success - fields cleaned and normalized

Response model for batch field cleaning.

results
Results · object
required

Map of field names to cleaning results