‘How Do i Fool You?’ Manipulating User Trust via Misleading Black Box Explanations.
Explainable AI
Surrogate Explainers
Critique
Paper
Description
Lakkaraju and Bastani (2020) show how misleading black box explanations can manipulate users into trusting an untrustworthy model.
References
Lakkaraju, Himabindu, and Osbert Bastani. 2020. “" How Do i Fool You?" Manipulating User Trust via Misleading Black Box Explanations.” In Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, 79–85.