Data Error Potential (DEP) scores are shown throughout the product. Token-level highlighting of DEP can be turned on wherever the Target Output is shown. Red indicates high DEP, orange medium DEP, and green low DEP.
How to use DEP?
Look for patterns in groups of High DEP samples (e.g. a High DEP cluster). A High Data Error Potential might be due to a mistake in the annotation (e.g. expecting an answer that the model couldn’t possibly infer from the input), due to there not being enough “similar samples” (something the model could learn but you need to feed it more samples like it) or it simply being garbage sample which needs to be removed.
Determine whether you need to Edit Target and change your Target Output, Remove your samples, or Find Similar Data to include in your dataset, and take action.