Using Alerts

Galileo Alerts are your starting point in your data inspection journey. After you complete a run, Galileo surfaces a summary of issues it has found in your dataset in the Alerts section. Each Alert represents a problematic pocket of data that Galileo has identified.

Clicking on an alert will filter the dataset to this problematic subset of data and allows you to fix them.

Alerts will also educate you on why this subset of your data might be causing issues and tell you how you can fix them. You can think of Alerts as a partner Data Scientist working with you to find and fix your data.

Alerts that we support today

We support a growing list of alerts, and are open to feature requests! Some of the highlights include:

Hard for the model

Exposes the samples we believe are hard for your model to learn. These are the samples with high Data Error Potential scores.

Hard for the model cluster

Exposes clusters of data that have a high Data Error Potential.

High Uncertainty Outputs

Surfaces samples that have High Uncertainty on the generated output (only available if generations were created for this split).

High Perplexity Samples

Identifies samples whose predictions have high Perplexity.

Empty Samples

Identifies samples that have empty Input, empty Target or empty Generations.

Low Performing Cluster

Exposes clusters that have poor BLEU or ROUGE scores (only available if generations were created for this split).

How to request a new alert?

Have a great idea for a new alert? We'd love to hear about it! File any requests under your Profile Avatar Menu > "Bug Report or Feature Request", and we'll immediately get your request 🔭.

Last updated