Explanation Methods - Publications - Faculty & Research - Harvard Business School

Filter Results: (30)

Filter Results: (30)

Show Results For

Page 1 of 30 Results →

Are you looking for?

→Search All HBS Web

2025
Working Paper

Home Sweet Home: How Much Do Employees Value Remote Work?

By: Zoë B. Cullen, Bobak Pakzad-Hurson and Ricardo Perez-Truglia

We estimate the value employees place on remote work using revealed preferences in a high-stakes, real-world context, focusing on U.S. tech workers. On average, employees are willing to accept a 25% pay cut for partly or fully remote roles. Our estimates are three to... View Details

Keywords: Employees; Compensation and Benefits; Satisfaction; Value; Research

Find at Harvard

Register to Read

2023
Article

M4: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities, and Models

By: Himabindu Lakkaraju, Xuhong Li, Mengnan Du, Jiamin Chen, Yekun Chai and Haoyi Xiong

While Explainable Artificial Intelligence (XAI) techniques have been widely studied to explain predictions made by deep neural networks, the way to evaluate the faithfulness of explanation results remains challenging, due to the heterogeneity of explanations for... View Details

Keywords: AI and Machine Learning

2023
Article

Post Hoc Explanations of Language Models Can Improve Language Models

By: Satyapriya Krishna, Jiaqi Ma, Dylan Slack, Asma Ghandeharioun, Sameer Singh and Himabindu Lakkaraju

Large Language Models (LLMs) have demonstrated remarkable capabilities in performing complex tasks. Moreover, recent research has shown that incorporating human-annotated rationales (e.g., Chain-of-Thought prompting) during in-context learning can significantly enhance... View Details

Keywords: AI and Machine Learning; Performance Effectiveness

2023
Article

Verifiable Feature Attributions: A Bridge between Post Hoc Explainability and Inherent Interpretability

By: Usha Bhalla, Suraj Srinivas and Himabindu Lakkaraju

With the increased deployment of machine learning models in various real-world applications, researchers and practitioners alike have emphasized the need for explanations of model behaviour. To this end, two broad strategies have been outlined in prior literature to... View Details

Keywords: AI and Machine Learning; Mathematical Methods

2023
Article

Which Models Have Perceptually-Aligned Gradients? An Explanation via Off-Manifold Robustness

By: Suraj Srinivas, Sebastian Bordt and Himabindu Lakkaraju

One of the remarkable properties of robust computer vision models is that their input-gradients are often aligned with human perception, referred to in the literature as perceptually-aligned gradients (PAGs). Despite only being trained for classification, PAGs cause... View Details

Keywords: AI and Machine Learning; Mathematical Methods

2023
Article

On Minimizing the Impact of Dataset Shifts on Actionable Explanations

By: Anna P. Meyer, Dan Ley, Suraj Srinivas and Himabindu Lakkaraju

The Right to Explanation is an important regulatory principle that allows individuals to request actionable explanations for algorithmic decisions. However, several technical challenges arise when providing such actionable explanations in practice. For instance, models... View Details

Keywords: Mathematical Methods; Analytics and Data Science

August 2023
Article

Explaining Machine Learning Models with Interactive Natural Language Conversations Using TalkToModel

By: Dylan Slack, Satyapriya Krishna, Himabindu Lakkaraju and Sameer Singh

Practitioners increasingly use machine learning (ML) models, yet models have become more complex and harder to understand. To understand complex models, researchers have proposed techniques to explain model predictions. However, practitioners struggle to use... View Details

Keywords: AI and Machine Learning; Technological Innovation; Technology Adoption

2023
Article

Towards Bridging the Gaps between the Right to Explanation and the Right to Be Forgotten

By: Himabindu Lakkaraju, Satyapriya Krishna and Jiaqi Ma

The Right to Explanation and the Right to be Forgotten are two important principles outlined to regulate algorithmic decision making and data usage in real-world applications. While the right to explanation allows individuals to request an actionable explanation for an... View Details

Keywords: Analytics and Data Science; AI and Machine Learning; Decision Making; Governing Rules, Regulations, and Reforms

2022
Article

OpenXAI: Towards a Transparent Evaluation of Model Explanations

By: Chirag Agarwal, Satyapriya Krishna, Eshika Saxena, Martin Pawelczyk, Nari Johnson, Isha Puri, Marinka Zitnik and Himabindu Lakkaraju

While several types of post hoc explanation methods have been proposed in recent literature, there is very little work on systematically benchmarking these methods. Here, we introduce OpenXAI, a comprehensive and extensible opensource framework for evaluating and... View Details

Keywords: Measurement and Metrics; Analytics and Data Science

2022
Article

Which Explanation Should I Choose? A Function Approximation Perspective to Characterizing Post hoc Explanations

By: Tessa Han, Suraj Srinivas and Himabindu Lakkaraju

A critical problem in the field of post hoc explainability is the lack of a common foundational goal among methods. For example, some methods are motivated by function approximation, some by game theoretic notions, and some by obtaining clean visualizations. This... View Details

Keywords: Mathematical Methods; Decision Choices and Conditions; Analytics and Data Science

2022
Article

Fairness via Explanation Quality: Evaluating Disparities in the Quality of Post hoc Explanations

By: Jessica Dai, Sohini Upadhyay, Ulrich Aivodji, Stephen Bach and Himabindu Lakkaraju

As post hoc explanation methods are increasingly being leveraged to explain complex models in high-stakes settings, it becomes critical to ensure that the quality of the resulting explanations is consistently high across all subgroups of a population. For instance, it... View Details

Keywords: Prejudice and Bias; Mathematical Methods; Research; Analytics and Data Science

2022
Conference Presentation

Towards the Unification and Robustness of Post hoc Explanation Methods

By: Sushant Agarwal, Shahin Jabbari, Chirag Agarwal, Sohini Upadhyay, Steven Wu and Himabindu Lakkaraju

As machine learning black boxes are increasingly being deployed in critical domains such as healthcare and criminal justice, there has been a growing emphasis on developing techniques for explaining these black boxes in a post hoc manner. In this work, we analyze two... View Details

Keywords: AI and Machine Learning

2022
Article

Exploring Counterfactual Explanations Through the Lens of Adversarial Examples: A Theoretical and Empirical Analysis.

By: Martin Pawelczyk, Chirag Agarwal, Shalmali Joshi, Sohini Upadhyay and Himabindu Lakkaraju

As machine learning (ML) models become more widely deployed in high-stakes applications, counterfactual explanations have emerged as key tools for providing actionable model explanations in practice. Despite the growing popularity of counterfactual explanations, a... View Details

Keywords: Machine Learning Models; Counterfactual Explanations; Adversarial Examples; Mathematical Methods

2022
Article

Probing GNN Explainers: A Rigorous Theoretical and Empirical Analysis of GNN Explanation Methods.

By: Chirag Agarwal, Marinka Zitnik and Himabindu Lakkaraju

As Graph Neural Networks (GNNs) are increasingly employed in real-world applications, it becomes critical to ensure that the stakeholders understand the rationale behind their predictions. While several GNN explanation methods have been proposed recently, there has... View Details

Keywords: Graph Neural Networks; Explanation Methods; Mathematical Methods; Framework; Theory; Analysis

2022
Working Paper

The Disagreement Problem in Explainable Machine Learning: A Practitioner's Perspective

By: Satyapriya Krishna, Tessa Han, Alex Gu, Javin Pombra, Shahin Jabbari, Steven Wu and Himabindu Lakkaraju

As various post hoc explanation methods are increasingly being leveraged to explain complex models in high-stakes settings, it becomes critical to develop a deeper understanding of if and when the explanations output by these methods disagree with each other, and how... View Details

Keywords: AI and Machine Learning; Analytics and Data Science; Mathematical Methods

2022
Working Paper

TalkToModel: Explaining Machine Learning Models with Interactive Natural Language Conversations

By: Dylan Slack, Satyapriya Krishna, Himabindu Lakkaraju and Sameer Singh

Practitioners increasingly use machine learning (ML) models, yet they have become more complex and harder to understand. To address this issue, researchers have proposed techniques to explain model predictions. However, practitioners struggle to use explainability... View Details

Keywords: Natural Language Conversations; Predictive Models; AI and Machine Learning

Article

Counterfactual Explanations Can Be Manipulated

By: Dylan Slack, Sophie Hilgard, Himabindu Lakkaraju and Sameer Singh

Counterfactual explanations are useful for both generating recourse and auditing fairness between groups. We seek to understand whether adversaries can manipulate counterfactual explanations in an algorithmic recourse setting: if counterfactual explanations indicate... View Details

Keywords: Machine Learning Models; Counterfactual Explanations

Article

Reliable Post hoc Explanations: Modeling Uncertainty in Explainability

By: Dylan Slack, Sophie Hilgard, Sameer Singh and Himabindu Lakkaraju

As black box explanations are increasingly being employed to establish model credibility in high stakes settings, it is important to ensure that these explanations are accurate and reliable. However, prior work demonstrates that explanations generated by... View Details

Keywords: Black Box Explanations; Bayesian Modeling; Decision Making; Risk and Uncertainty; Information Technology

Article

Learning Models for Actionable Recourse

By: Alexis Ross, Himabindu Lakkaraju and Osbert Bastani

As machine learning models are increasingly deployed in high-stakes domains such as legal and financial decision-making, there has been growing interest in post-hoc methods for generating counterfactual explanations. Such explanations provide individuals adversely... View Details

Keywords: Machine Learning Models; Recourse; Algorithm; Mathematical Methods

Article

Towards the Unification and Robustness of Perturbation and Gradient Based Explanations

By: Sushant Agarwal, Shahin Jabbari, Chirag Agarwal, Sohini Upadhyay, Steven Wu and Himabindu Lakkaraju

As machine learning black boxes are increasingly being deployed in critical domains such as healthcare and criminal justice, there has been a growing emphasis on developing techniques for explaining these black boxes in a post hoc manner. In this work, we analyze two... View Details

Keywords: Machine Learning; Black Box Explanations; Decision Making; Forecasting and Prediction; Information Technology

1
2
→