Skip to Main Content
HBS Home
  • About
  • Academic Programs
  • Alumni
  • Faculty & Research
  • Baker Library
  • Giving
  • Harvard Business Review
  • Initiatives
  • News
  • Recruit
  • Map / Directions
Faculty & Research
  • Faculty
  • Research
  • Featured Topics
  • Academic Units
  • …→
  • Harvard Business School→
  • Faculty & Research→
  • Research
    • Research
    • Publications
    • Global Research Centers
    • Case Development
    • Initiatives & Projects
    • Research Services
    • Seminars & Conferences
    →
  • Publications→

Publications

Publications

Filter Results: (19) Arrow Down
Filter Results: (19) Arrow Down Arrow Up

Show Results For

  • All HBS Web  (19)
    • News  (5)
    • Research  (11)
  • Faculty Publications  (12)

Show Results For

  • All HBS Web  (19)
    • News  (5)
    • Research  (11)
  • Faculty Publications  (12)
Page 1 of 19 Results
  • October–December 2022
  • Article

Achieving Reliable Causal Inference with Data-Mined Variables: A Random Forest Approach to the Measurement Error Problem

By: Mochen Yang, Edward McFowland III, Gordon Burtch and Gediminas Adomavicius
Combining machine learning with econometric analysis is becoming increasingly prevalent in both research and practice. A common empirical strategy involves the application of predictive modeling techniques to "mine" variables of interest from available data, followed... View Details
Keywords: Machine Learning; Econometric Analysis; Instrumental Variable; Random Forest; Causal Inference; AI and Machine Learning; Forecasting and Prediction
Citation
Find at Harvard
Register to Read
Related
Yang, Mochen, Edward McFowland III, Gordon Burtch, and Gediminas Adomavicius. "Achieving Reliable Causal Inference with Data-Mined Variables: A Random Forest Approach to the Measurement Error Problem." INFORMS Journal on Data Science 1, no. 2 (October–December 2022): 138–155.
  • August 2018 (Revised September 2018)
  • Supplement

LendingClub (B): Decision Trees & Random Forests

By: Srikant M. Datar and Caitlin N. Bowler
This case builds directly on the LendingClub (A) case. In this case students follow Emily Figel as she builds two tree-based models using historical LendingClub data to predict, with some probability, whether borrower will repay or default on his loan.
... View Details
Keywords: Data Science; Data Analytics; Decision Trees; Investment; Financing and Loans; Analytics and Data Science; Analysis; Forecasting and Prediction
Citation
Purchase
Related
Datar, Srikant M., and Caitlin N. Bowler. "LendingClub (B): Decision Trees & Random Forests." Harvard Business School Supplement 119-021, August 2018. (Revised September 2018.)

    Achieving Reliable Causal Inference with Data-Mined Variables: A Random Forest Approach to the Measurement Error Problem

    Combining machine learning with econometric analysis is becoming increasingly prevalent in both research and practice. A common empirical strategy involves the application of predictive modeling techniques to "mine" variables of interest from available data,... View Details
    • Article

    Eliminating Unintended Bias in Personalized Policies Using Bias-Eliminating Adapted Trees (BEAT)

    By: Eva Ascarza and Ayelet Israeli

    An inherent risk of algorithmic personalization is disproportionate targeting of individuals from certain groups (or demographic characteristics such as gender or race), even when the decision maker does not intend to discriminate based on those “protected”... View Details

    Keywords: Algorithm Bias; Personalization; Targeting; Generalized Random Forests (GRF); Discrimination; Customization and Personalization; Decision Making; Fairness; Mathematical Methods
    Citation
    Read Now
    Related
    Ascarza, Eva, and Ayelet Israeli. "Eliminating Unintended Bias in Personalized Policies Using Bias-Eliminating Adapted Trees (BEAT)." e2115126119. Proceedings of the National Academy of Sciences 119, no. 11 (March 8, 2022).
    • 2020
    • Working Paper

    Machine Learning for Pattern Discovery in Management Research

    By: Prithwiraj Choudhury
    Supervised machine learning (ML) methods are a powerful toolkit for discovering robust patterns in quantitative data. The patterns identified by ML could be used as an observation for further inductive or abductive research, but should not be treated as the result of a... View Details
    Keywords: Machine Learning; Theory Building; Induction; Decision Trees; Random Forests; K-nearest Neighbors; Neural Network; P-hacking; Analytics and Data Science; Analysis
    Citation
    SSRN
    Related
    Choudhury, Prithwiraj, Ryan Allen, and Michael G. Endres. "Machine Learning for Pattern Discovery in Management Research." Harvard Business School Working Paper, No. 19-032, September 2018. (Revised June 2020.)
    • January 2021
    • Article

    Machine Learning for Pattern Discovery in Management Research

    By: Prithwiraj Choudhury, Ryan Allen and Michael G. Endres
    Supervised machine learning (ML) methods are a powerful toolkit for discovering robust patterns in quantitative data. The patterns identified by ML could be used for exploratory inductive or abductive research, or for post-hoc analysis of regression results to detect... View Details
    Keywords: Machine Learning; Supervised Machine Learning; Induction; Abduction; Exploratory Data Analysis; Pattern Discovery; Decision Trees; Random Forests; Neural Networks; ROC Curve; Confusion Matrix; Partial Dependence Plots; AI and Machine Learning
    Citation
    Find at Harvard
    Read Now
    Related
    Choudhury, Prithwiraj, Ryan Allen, and Michael G. Endres. "Machine Learning for Pattern Discovery in Management Research." Strategic Management Journal 42, no. 1 (January 2021): 30–57.
    • Awards

    Runner-Up for Best Paper Award, INFORMS Workshop on Data Science, 2018

    By: Edward McFowland III
    Runner Up for the 2018 Best Paper Award at the INFORMS Workshop on Data Science for "Using Data-Mined Variables in Causal Inference Tasks: A Random Forest Approach to the Measurement Error Problem" with Mochen Yang, Gordon Burtch, and Gediminas Adomavicius. View Details

      Eliminating unintended bias in personalized policies using Bias Eliminating Adapted Trees (BEAT) - PNAS

      An inherent risk of algorithmic personalization is disproportionate targeting of individuals from certain groups (or demographic characteristics such as gender or race), even when the decision maker does not intend to discriminate based on those... View Details

      • November 2022
      • Article

      A Language-Based Method for Assessing Symbolic Boundary Maintenance between Social Groups

      By: Anjali M. Bhatt, Amir Goldberg and Sameer B. Srivastava
      When the social boundaries between groups are breached, the tendency for people to erect and maintain symbolic boundaries intensifies. Drawing on extant perspectives on boundary maintenance, we distinguish between two strategies that people pursue in maintaining... View Details
      Keywords: Culture; Machine Learning; Natural Language Processing; Symbolic Boundaries; Organizations; Boundaries; Social Psychology; Interpersonal Communication; Organizational Culture
      Citation
      Find at Harvard
      Purchase
      Related
      Bhatt, Anjali M., Amir Goldberg, and Sameer B. Srivastava. "A Language-Based Method for Assessing Symbolic Boundary Maintenance between Social Groups." Sociological Methods & Research 51, no. 4 (November 2022): 1681–1720.
      • 2021
      • Working Paper

      An Empirical Study of Time Allotment and Delays in E-commerce Delivery

      By: M. Balakrishnan, MoonSoo Choi and Natalie Epstein
      Problem definition: We study how having more time allotted to deliver an order affects the speed of the delivery process. Furthermore, we seek to predict orders that are likely to be delayed early in the delivery process so that actions can be taken to avoid delays.... View Details
      Keywords: Logistics; E-commerce; Mathematical Methods; AI and Machine Learning; Performance Productivity
      Citation
      SSRN
      Related
      Balakrishnan, M., MoonSoo Choi, and Natalie Epstein. "An Empirical Study of Time Allotment and Delays in E-commerce Delivery." Working Paper, December 2021.
      • 01 Mar 2024
      • News

      Alumni and Faculty Books and Podcasts

      Edited by Margie Kelley Alumni Books You Got This! A Straightforward, No-Nonsense Playbook for Crushing 130+ Workplace Challenges By Heidi Abelli (MBA 1993) Palmetto Publishing Stepping into the corporate world can feel like navigating a labyrinth, especially when... View Details
      Keywords: Publishing Industries (except Internet); Information
      • 12 Jan 2023
      • News

      ‘Debiasing’ Debt with Data

      records on borrowers, and put the data through a random forest regression, a type of analysis known for producing good predictions that can be readily understood. When they tested the resulting model, the... View Details
      Keywords: Ralph Ranalli
      • Web

      Technology & Operations Management Awards & Honors - Faculty & Research

      Award at the INFORMS Workshop on Data Science for "Using Data-Mined Variables in Causal Inference Tasks: A Random Forest Approach to the Measurement Error Problem" with Mochen Yang, Gordon Burtch, and... View Details
      • 09 Sep 2013
      • Lessons from the Classroom

      Teaching Climate Change to Skeptics

      A few years ago, Joseph B. Lassiter traveled to San Francisco, Houston, and New York to hold discussions with Harvard alumni on the topic of business and the environment. Each time, he surveyed the audience about the touchy subject of climate change and how society... View Details
      Keywords: by Carmen Nobel
      • 08 Jan 2008
      • First Look

      First Look: January 8, 2008

      estimating the variance of demand using dispersion among experts' forecasts and scale. We test this methodology using three datasets, demand data at item level, sales data at firm level for retailers, and sales data at firm level for manufacturers. We show that the... View Details
      Keywords: Martha Lagace
      • 04 Dec 2018
      • First Look

      New Research and Ideas, December 4, 2018

      follows Figel as she dives into the data to use it to build a model. Purchase this case:https://hbsp.harvard.edu/product/119020-PDF-ENG Harvard Business School Case 119-021 LendingClub (B): Decision Trees & Random View Details
      Keywords: Dina Gerdeman
      • 1
      ǁ
      Campus Map
      Harvard Business School
      Soldiers Field
      Boston, MA 02163
      →Map & Directions
      →More Contact Information
      • Make a Gift
      • Site Map
      • Jobs
      • Harvard University
      • Trademarks
      • Policies
      • Accessibility
      • Digital Accessibility
      Copyright © President & Fellows of Harvard College.