Supported Entities
This page lists the named entities that can be automatically discovered by Gretel's Transform and Classify APIs.
What types of entities can Gretel identify?
Tag | Name | Description | Regulatory Compliance |
---|---|---|---|
aba_routing_number | ABA Routing Number | An American bank association routing number | GDPR, HIPAA, CPRA |
credit_card_number | Credit Card Number | A credit card number is 12 to 19 digits long. They are used for payment transactions globally. | GDPR, HIPAA, CPRA |
date | Date | A date. This includes most date formats, as well as the names of common world holidays. | HIPAA |
datetime | DateTime | A date and timestamp. This includes most date/time formats. | HIPAA |
domain_name | Domain name | A domain name as defined by the DNS standard. | |
email_address | An email address identifies the mailbox that emails are sent to or from. The maximum length of the domain name is 255 characters, and the maximum length of the local-part is 64 characters. | GDPR, HIPAA,CPRA | |
facebook_data | Facebook Data | API tokens and keys used for developer access to Facebook | GDPR, HIPAA, CPRA |
generic_key | Generic Keys | An arbitrary character set mapped to a key that could be an API key or token | |
github_token | GitHub Token | A personal GitHub token | GDPR, CPRA |
google_data | Google Data | Misc Google IDs, Keys, and Tokens | GDPR, HIPAA, CPRA |
google_olc | Google Open Location Code | Geocode string for identifying an area using special system developed by Google | GDPR, HIPAA |
gps_coordinates | GPS Coordinates | A combination of latitude and longitude into a single tuple | GDPR, HIPAA |
hostname | Hostname | A name that resolves to a specific host or system on a network | |
iban_code | IBAN code | An International Bank Account Number (IBAN) is defined as an internationally agreed-upon method for identifying bank accounts. It's defined by the International Standard of Organization (ISO) 13616:2007 standard. ISO 13616:2007 was created by the European Committee for Banking Standards (ECBS). An IBAN consists of up to 34 alphanumeric characters including elements such as a country code or account number. | GDPR, HIPAA, CPRA |
imei_hardware_id | IMEI | An International Mobile Equipment Identity (IMEI) hardware identifier, used to identify mobile phones. | GDPR, HIPAA, CPRA |
imsi_subscriber_id | IMSI | An International Mobile Subscriber Identity (IMSI) identifier, used to identify mobile phone subscriber identities. | GDPR, HIPAA, CPRA |
ip_address | IP address | An Internet Protocol (IP) address (either IPv4 or IPv6). | GDPR, HIPAA, CPRA |
jwt | JWT | A JSON Web Token | |
latitude | Latitude | The angular distance of a place north or south of the earth's equator | GDPR, HIPAA |
location | Location | A physical address or location. | GDPR, HIPAA |
longitude | Longitude | The angular distance of a place east or west of the meridian at Greenwich, England, or west of the standard meridian of a celestial object | GDPR, HIPAA |
md5 | MD5 | A MD5 Hash. | |
organization_name | Organization name | An organization name. | |
person_name | Person name | A full person name, which can include first names, middle names or initials, and last names. | GDPR, HIPAA, CPRA |
phone_number | Phone number | A telephone number. | GDPR, HIPAA, CPRA |
phone_number_namer | Phone Number North America | A telephone number valid for North America (10-digits) | GDPR, HIPAA, CPRA |
sendgrid_credentials | SendGrid credentials | Credentials for use with the Sendgrid API | GDPR, HIPAA, CPRA |
sha256 | SHA256 | A SHA256 hash. | |
sha512 | SHA512 | A SHA512 hash. | |
slack_secrets | Slack Secrets | Tokens or private information for Slack workspaces | GDPR, HIPAA |
square_api_key | Square API Key | Various keys and tokens for the Square payment service | |
stripe_api_key | Stripe API Key | A public or private API key for the Stripe service | |
swift_code | SWIFT code | A SWIFT code is the same as a Bank Identifier Code (BIC). It's a unique identification code for a particular bank. These codes are used when transferring money between banks, particularly for international wire transfers. Banks also use the codes for exchanging other messages. | GDPR, HIPAA, CPRA |
time | Time | A timestamp of a specific time of day. | |
twilio_data | Twilio API data | Information about Twilio API access such as SIDs or API secrets | GDPR, HIPAA, CPRA |
url | URL | A Uniform Resource Locator (URL). | GDPR, HIPAA, CPRA |
us_social_security_number | US Social Security Number | A United States Social Security number (SSN) is a 9-digit number issued to US citizens, permanent residents, and temporary residents. The Social Security number has effectively become the United States national identification number. | GDPR, HIPAA, CPRA |
us_state | USA State | A state in the United States of America. | GDPR |
us_zip_code | US Zip Code | Postal code used by the United States Postal Service | GDPR, HIPAA |
uuid | UUID | A Universally Unique Identifier (UUID). |
Locations Supported using NLP
When NLP prediction are enabled, additional types of locations may be predicted within natural language text fields. These locations include:
Address
Borough
City
Continent
County
Country
District
Location
Municipality
Nationality
Neighborhood
Province
Region
State
Suburb
Town
If any of these entities are recognized by the model, they will be labelled as a location
.
Last updated