Updated multilingual model & consistent class names

laurahanu released this 27 Oct 17:03

· 73 commits to master since this release

Updated the multilingual model weights used by Detoxify with a model trained on the translated data from the 2nd Jigsaw challenge (as well as the 1st). This model has also been trained to minimise bias and now returns the same categories as the unbiased model. New best AUC score on the test set: 92.11 (89.71 before).
All detoxify models now return consistent class names (e.g. "identity_attack" replaces "identity_hate" in the original model to match the unbiased classes).

Assets 2

Provide feedback