22-25 April 2026

Evaluating the realism of synthetic data in DevOps

2025

TL; DR

Synthetic data is essential for training models, testing apps, and ensuring privacy. However, its quality evaluation varies by application. This session offers insights into assessing synthetic data's usability, accuracy, privacy, and key factors for effective production use.

Session Details

Synthetic data is rapidly gaining traction, but evaluating its quality remains complex. What works for one application may not be suitable for another. Given its critical role in training machine learning models, testing applications, and ensuring data privacy, it’s essential to assess how well synthetic data mirrors real-world data while safeguarding sensitive information. DevOps and data teams must prioritize the right metrics in testing environments. In this session, we’ll provide practical insights into assessing and applying synthetic data effectively, helping attendees understand its limitations and key considerations for different use cases.

3 things you'll get out of this session

Understanding the key factors for evaluating synthetic data Gaining practical insights into how synthetic data can be applied in their development and testing workflows. Recognizing the limitations and considerations of synthetic data for different use cases.