Model Suites
Last updated
Last updated
This Model Suites Page is made under and subject to the terms of the customer agreement or terms of service entered into between Gretel.ai and you (the “Customer Agreement”).
Here at Gretel.ai we understand how important data is to our customers. Hence we aim to provide you not only with high-quality data, but also with choices regarding the Model Suites (as defined in the Customer Agreement), allowing you to customize the synthetic data generation process to fit your specific business’s needs and risk tolerances. As noted in the Customer Agreement, Gretel.ai’s Model Suites are designed to be transparent, giving you visibility into your data’s provenance and helping you understand what restrictions, if any, accompany the Generated Data (as defined in the Customer Agreement).
Below you will find details about each of our Model Suites. Given that new models are continually being released, and the terms governing some models may change, we may add to, delete or update the Model Suites that will be available from time to time (with new version numbers). If such changes are material, we will endeavor to update you as to the changes after we become aware of them.
If you have any questions about these Model Suites or your rights regarding the Generated Data, please email us at support@gretel.ai.
The Apache 2.0 Model Suite makes use of models provided under the Apache 2.0 and MIT open source licenses. You may use the Generated Data for commercial purposes (including to train AI models) and without needing to provide notice or attribution to Gretel.ai or any third party. However your use of the Generated Data must comply with Gretel.ai's Acceptable Use Policy.
PLEASE CLICK ON AND REVIEW THE HYPERLINKS BELOW TO VIEW THE FULL TEXT OF THESE LICENSES, AS THEY (AND NOT THIS MODEL SUITES PAGE) GOVERN YOUR USE OF THE GENERATED DATA.
The Llama 3.x Model Suite makes use of models provided under a variety of different Llama Community Licenses. You may use the Generated Data for commercial purposes (excluding training AI models), and without needing to provide notice or attribution to Gretel.ai. However, you must comply with the following restrictions:
You will not use the Generated Data to improve any large language model (excluding any Llama model).
If you use the Generated Data to train an AI model that is distributed or made available, then you must (1) prominently display “Built with Llama” on a related website, user interface, blogpost, about page, or product documentation, and (2) include “Llama” at the beginning of any such AI model name.
If you distribute, license or sell the Generated Data or any AI model trained on the Generated Data, then you must provide copies of the Llama licenses listed in the table below.
Your use of the Generated Data must comply with applicable laws and regulations, Gretel.ai’s Acceptable Use Policy and the various Llama Acceptable Use Policies listed in the licenses below (e.g., Llama 3.2 Acceptable Use Policy, Llama 3.1 Acceptable Use Policy, Llama 2 Acceptable Use Policy).
You must not have greater than 700 million monthly active users in the previous calendar month.
You must not institute litigation or other proceedings against Meta alleging that the Generated Data or any portion thereof constitutes infringement of intellectual property or other rights owned or licensable by you.
PLEASE CLICK ON AND REVIEW THE HYPERLINKS BELOW TO VIEW THE FULL TEXT OF THESE LICENSES, AS THEY (AND NOT THIS MODEL SUITES PAGE) GOVERN YOUR USE OF THE GENERATED DATA.
Note that the Llama Model Suite may also include some models from the Apache 2.0 Model Suite.
The Gemini Model Suite makes use of models provided under the applicable Google terms (see hyperlinks in the table below). You may use the Generated Data for commercial purposes (excluding training AI models), and without needing to provide notice or attribution to Gretel.ai. However, you must comply with the following restrictions:
You will not use the Generated Data to improve any AI model or to develop a product or service that is similar to or competes with Google’s AI models.
In addition to, and without limiting, Gretel.ai's Acceptable Use Policy, you will not use the Generated Data for clinical purposes (excluding non-clinical research, scheduling, or other administrative tasks), as a substitute for professional medical advice, or in any manner that is overseen by or requires clearance or approval from any applicable regulatory authority.
Your use of the Generated Data must comply with applicable laws and regulations, Gretel.ai’s Acceptable Use Policy and Google’s Generative AI Prohibited Use Policy.
PLEASE CLICK ON AND REVIEW THE HYPERLINKS BELOW TO VIEW THE FULL TEXT OF THESE LICENSES, AS THEY (AND NOT THIS MODEL SUITES PAGE) GOVERN YOUR USE OF THE GENERATED DATA.
Note that the Gemini Model Suite may also include some models from the Apache 2.0 Model Suite.
The Azure Model Suite makes use of models provided under the applicable Microsoft Azure terms (see hyperlinks in the table below). You may use the Generated Data for commercial purposes (excluding training AI models), and without needing to provide notice or attribution to Gretel.ai. However, you must comply with the following restrictions:
You will not use and will not direct or enable third parties to use the Generated Data to train any AI models or systems that have substantially similar functionality to a Microsoft AI service, unless such AI model or system is:
an Azure AI model;
a fine-tunable model available in the Azure AI model catalog; or
an AI model that is designed to modify input or output for your use case and is deployed in an application that interacts with a Microsoft Generative AI Service.
Your use of the Generated Data must comply with applicable laws and regulations, Gretel.ai’s Acceptable Use Policy, Microsoft Acceptable Use Policy set forth in the Microsoft Universal License Terms for Online Services for Online Services and Microsoft Generative AI Services Code of Conduct.
PLEASE CLICK ON AND REVIEW THE HYPERLINKS BELOW TO VIEW THE FULL TEXT OF THESE LICENSES, AS THEY (AND NOT THIS MODEL SUITES PAGE) GOVERN YOUR USE OF THE GENERATED DATA.
Note that the Azure Model Suite may also include some models from the Apache 2.0 Model Suite.
Model
License
Mistral-7B
Mistral NeMO
Mixtral 8x7B
Phi-3-medium-14B
Phi-3.5-small-7B
Qwen-2.5-7B
Qwen-2.5-14B
Qwen-2.5-32B
Yi-1.5-9B
tinyLlama-code-math
tinyLlama-1B
OLMoE-1B-7B
Qwen-2.5-0.5B
Qwen-2.5-1.5B
microsoft/codereviewer
Yi-1.5–9B-Coder
Mamba Codestral
Qwen-2.5-Coder-1.5B
Qwen-2.5-Coder-7B
Qwen-2.5-Math-1.5B
Qwen-2.5-Math-7B
Mathstral-7B
Model
License
Llama-3.2-1B
Llama-3.2-3B
Llama-3.1-8B
Llama-3.1-70B
Llama-3.1-405B
codeLlama-7B
codeLlama-70B
Model
License
Gemini-Pro
Gemini-1.5-Pro
Gemini-1.5-Flash
Model
License
Azure/GPT-4o
Azure/GPT-3.5 Turbo
Azure/GPT-4o Mini