Supported Entities

This page lists the named entities that can be automatically discovered by Gretel's Transform and Classify APIs.

What types of entities can Gretel identify?

TagNameDescriptionRegulatory Compliance

aba_routing_number

ABA Routing Number

An American bank association routing number

GDPR, HIPAA, CPRA

credit_card_number

Credit Card Number

A credit card number is 12 to 19 digits long. They are used for payment transactions globally.

GDPR, HIPAA, CPRA

date

Date

A date. This includes most date formats, as well as the names of common world holidays.

HIPAA

datetime

DateTime

A date and timestamp. This includes most date/time formats.

HIPAA

domain_name

Domain name

A domain name as defined by the DNS standard.

email_address

Email

An email address identifies the mailbox that emails are sent to or from. The maximum length of the domain name is 255 characters, and the maximum length of the local-part is 64 characters.

GDPR, HIPAA,CPRA

facebook_data

Facebook Data

API tokens and keys used for developer access to Facebook

GDPR, HIPAA, CPRA

generic_key

Generic Keys

An arbitrary character set mapped to a key that could be an API key or token

github_token

GitHub Token

A personal GitHub token

GDPR, CPRA

google_data

Google Data

Misc Google IDs, Keys, and Tokens

GDPR, HIPAA, CPRA

google_olc

Google Open Location Code

Geocode string for identifying an area using special system developed by Google

GDPR, HIPAA

gps_coordinates

GPS Coordinates

A combination of latitude and longitude into a single tuple

GDPR, HIPAA

hostname

Hostname

A name that resolves to a specific host or system on a network

iban_code

IBAN code

An International Bank Account Number (IBAN) is defined as an internationally agreed-upon method for identifying bank accounts. It's defined by the International Standard of Organization (ISO) 13616:2007 standard. ISO 13616:2007 was created by the European Committee for Banking Standards (ECBS). An IBAN consists of up to 34 alphanumeric characters including elements such as a country code or account number.

GDPR, HIPAA, CPRA

imei_hardware_id

IMEI

An International Mobile Equipment Identity (IMEI) hardware identifier, used to identify mobile phones.

GDPR, HIPAA, CPRA

imsi_subscriber_id

IMSI

An International Mobile Subscriber Identity (IMSI) identifier, used to identify mobile phone subscriber identities.

GDPR, HIPAA, CPRA

ip_address

IP address

An Internet Protocol (IP) address (either IPv4 or IPv6).

GDPR, HIPAA, CPRA

jwt

JWT

A JSON Web Token

latitude

Latitude

The angular distance of a place north or south of the earth's equator

GDPR, HIPAA

location

Location

A physical address or location.

GDPR, HIPAA

longitude

Longitude

The angular distance of a place east or west of the meridian at Greenwich, England, or west of the standard meridian of a celestial object

GDPR, HIPAA

md5

MD5

A MD5 Hash.

organization_name

Organization name

An organization name.

person_name

Person name

A full person name, which can include first names, middle names or initials, and last names.

GDPR, HIPAA, CPRA

phone_number

Phone number

A telephone number.

GDPR, HIPAA, CPRA

phone_number_namer

Phone Number North America

A telephone number valid for North America (10-digits)

GDPR, HIPAA, CPRA

sendgrid_credentials

SendGrid credentials

Credentials for use with the Sendgrid API

GDPR, HIPAA, CPRA

sha256

SHA256

A SHA256 hash.

sha512

SHA512

A SHA512 hash.

slack_secrets

Slack Secrets

Tokens or private information for Slack workspaces

GDPR, HIPAA

square_api_key

Square API Key

Various keys and tokens for the Square payment service

stripe_api_key

Stripe API Key

A public or private API key for the Stripe service

swift_code

SWIFT code

A SWIFT code is the same as a Bank Identifier Code (BIC). It's a unique identification code for a particular bank. These codes are used when transferring money between banks, particularly for international wire transfers. Banks also use the codes for exchanging other messages.

GDPR, HIPAA, CPRA

time

Time

A timestamp of a specific time of day.

twilio_data

Twilio API data

Information about Twilio API access such as SIDs or API secrets

GDPR, HIPAA, CPRA

url

URL

A Uniform Resource Locator (URL).

GDPR, HIPAA, CPRA

us_social_security_number

US Social Security Number

A United States Social Security number (SSN) is a 9-digit number issued to US citizens, permanent residents, and temporary residents. The Social Security number has effectively become the United States national identification number.

GDPR, HIPAA, CPRA

us_state

USA State

A state in the United States of America.

GDPR

us_zip_code

US Zip Code

Postal code used by the United States Postal Service

GDPR, HIPAA

uuid

UUID

A Universally Unique Identifier (UUID).

Locations Supported using NLP

When NLP prediction are enabled, additional types of locations may be predicted within natural language text fields. These locations include:

  • Address

  • Borough

  • City

  • Continent

  • County

  • Country

  • District

  • Location

  • Municipality

  • Nationality

  • Neighborhood

  • Province

  • Region

  • State

  • Suburb

  • Town

If any of these entities are recognized by the model, they will be labelled as a location.

Last updated