Results on Public Datasets
Use the Galileo Sandbox environment to explore the enterprise-grade data quality platform
Galileo helps you discover insights and errors in your training dataset within minutes, not days! You can now confidently ditch excel sheets and ad hoc python scripts, mitigating the cumbersome detective work of exploratory dataset analysis.
We used a pretrained DistilBERT model to train (until convergence) on four popular public datasets across two tasks (described in Table below). Galileo was used to inspect, discover, and fix dataset errors using insights surfaced from the UI.