LogoLogo
  • Welcome to Gretel!
  • Gretel Basics
    • Getting Started
      • Quickstart
      • Blueprints
      • Use Case Examples
      • Environment Setup
        • Console
        • SDK
      • Projects
      • Inputs and Outputs
      • Gretel Connectors
        • Object Storage
          • Amazon S3
          • Google Cloud Storage
          • Azure Blob
        • Database
          • MySQL
          • PostgreSQL
          • MS SQL Server
          • Oracle Database
        • Data Warehouse
          • Snowflake
          • BigQuery
          • Databricks
        • Gretel Project
    • Release Notes
      • Platform Release Notes
        • May 2025
        • April 2025
        • March 2025
        • February 2025
        • January 2025
        • December 2024
        • November 2024
        • October 2024
        • September 2024
        • August 2024
        • July 2024
        • June 2024
      • Console Release Notes
        • January 2025
        • December 2024
        • November 2024
        • October 2024
        • September 2024
        • August 2024
      • Python SDKs
  • Create Synthetic Data
    • Gretel Safe Synthetics
      • Transform
        • Reference
        • Examples
        • Supported Entities
      • Synthetics
        • Gretel Tabular Fine-Tuning
        • Gretel Text Fine-Tuning
        • Gretel Tabular GAN
        • Benchmark Report
        • Privacy Protection
      • Evaluate
        • Synthetic Quality & Privacy Report
        • Tips to Improve Synthetic Data Quality
        • Data Privacy 101
      • SDK
    • Gretel Data Designer
      • Getting Started with Data Designer
      • Define your Data Columns
        • Column Types
        • Add Constraints to Columns
        • Custom Model Configurations
        • Upload Files as Seeds
      • Building your Dataset
        • Seeding your Dataset
        • Generating Data
      • Generate Realistic Personal Details
      • Structured Outputs
      • Code Validation
      • Data Evaluation
      • Magic Assistance
      • Using Jinja Templates
  • Gretel Playground [Legacy]
    • Getting Started
    • Prompts Tips & Best Practices
    • FAQ
    • SDK Examples
    • Tutorials
    • Videos
    • Gretel Playground [Legacy] Inference API
    • Batch Job SDK
  • Reference
    • Gretel's Python Client
    • Gretel’s Open Source Synthetic Engine
    • Gretel’s REST API
    • Homepage
    • Model Suites
Powered by GitBook
On this page

Was this helpful?

Export as PDF
  1. Create Synthetic Data

Gretel Data Designer

PreviousSDKNextGetting Started with Data Designer

Last updated 27 days ago

Was this helpful?

Data Designer is a general purpose system for building datasets to improve your AI models. Developers can describe the attributes of the dataset they want and iterate on the generated data through fast previews and detailed evaluations.

With Data Designer, you get:

  • Speed: Generate preview datasets in minutes, production datasets in hours

  • Quality: Built-in evaluation metrics ensure accuracy and relevance

  • Simplicity: Automated workflows replace complex manual processes

  • Scale: Move from proof-of-concept to production without rebuilding

  • Data-centric AI: Unlock true data experimentation with rapid iteration on use-case-specific data.

If you're looking for hands-on examples, check out our , where you'll find:

✅ A 101 tutorial – A notebook series to get you started ✅ Structured Outputs – Generate complex, nested synthetic data ✅ Evaluation Datasets – Create high-quality AI evaluation datasets ✅ Multi-turn Chat – Build user-assistant dialogue datasets ✅ Text-to-SQL & Text-to-Code – Generate SQL & Python code datasets ✅✅✅ And many more!

Data Designer is a synthetic data generation tool that helps you build repeatable and scalable synthetic data generation pipelines. You can use the Gretel SDK to declaratively define the dataset you want - the columns, the characteristics of each column, constraints across columns, validation or evaluation for columns, and then generate data that matches is.

Data Designer is meant to be used iteratively as you design the perfect dataset for your use case.

Example Notebooks