Filter Results:
(35)
Show Results For
- All HBS Web (42)
- Faculty Publications (18)
Show Results For
- All HBS Web (42)
- Faculty Publications (18)
Page 1 of 35
Results →
Sort by
- Article
Fast Subset Scan for Multivariate Spatial Biosurveillance
By: Daniel B. Neill, Edward McFowland III and Huanian Zheng
We present new subset scan methods for multivariate event detection in massive space-time datasets. We extend the recently proposed 'fast subset scan' framework from univariate to multivariate data, enabling computationally efficient detection of irregular space-time... View Details
Neill, Daniel B., Edward McFowland III, and Huanian Zheng. "Fast Subset Scan for Multivariate Spatial Biosurveillance." Statistics in Medicine 32, no. 13 (June 15, 2013): 2185–2208.
- 2016
- Article
Penalized Fast Subset Scanning
By: Skyler Speakman, Sriram Somanchi, Edward McFowland III and Daniel B. Neill
We present the penalized fast subset scan (PFSS), a new and general framework for scalable and accurate pattern detection. PFSS enables exact and efficient identification of the most anomalous subsets of the data, as measured by a likelihood ratio scan statistic.... View Details
Keywords: Disease Surveillance; Likelihood Ratio Statistic; Pattern Detection; Scan Statistic; Mathematical Methods
Speakman, Skyler, Sriram Somanchi, Edward McFowland III, and Daniel B. Neill. "Penalized Fast Subset Scanning." Journal of Computational and Graphical Statistics 25, no. 2 (2016): 382–404. (Selected for “Best of JCGS” invited session by the journal’s editor in chief.)
- Article
Fast Subset Scan for Multivariate Spatial Biosurveillance
By: Daniel B. Neill, Edward McFowland III and Huanian Zheng
We extend the recently proposed ‘fast subset scan’ framework from univariate to multivariate data, enabling computationally efficient detection of irregular space-time clusters even when the numbers of spatial locations and data streams are large. These fast algorithms... View Details
- 2022
- Article
Nonparametric Subset Scanning for Detection of Heteroscedasticity
By: Charles R. Doss and Edward McFowland III
We propose Heteroscedastic Subset Scan (HSS), a novel method for identifying covariates that are responsible for violations of the homoscedasticity assumption in regression settings. Viewing the problem as one of anomalous pattern detection, we use subset scanning... View Details
Doss, Charles R., and Edward McFowland III. "Nonparametric Subset Scanning for Detection of Heteroscedasticity." Journal of Computational and Graphical Statistics 31, no. 3 (2022): 813–823.
- Article
Fast Generalized Subset Scan for Anomalous Pattern Detection
By: Edward McFowland III, Skyler Speakman and Daniel B. Neill
We propose Fast Generalized Subset Scan (FGSS), a new method for detecting anomalous patterns in general categorical data sets. We frame the pattern detection problem as a search over subsets of data records and attributes, maximizing a nonparametric scan statistic... View Details
Keywords: Pattern Detection; Anomaly Detection; Knowledge Discovery; Bayesian Networks; Scan Statistics; Analytics and Data Science
McFowland III, Edward, Skyler Speakman, and Daniel B. Neill. "Fast Generalized Subset Scan for Anomalous Pattern Detection." Art. 12. Journal of Machine Learning Research 14 (2013): 1533–1561.
- 2015
- Article
Scalable Detection of Anomalous Patterns With Connectivity Constraints
By: Skyler Speakman, Edward McFowland III and Daniel B. Neill
We present GraphScan, a novel method for detecting arbitrarily shaped connected clusters in graph or network data. Given a graph structure, data observed at each node, and a score function defining the anomalousness of a set of nodes, GraphScan can efficiently and... View Details
Speakman, Skyler, Edward McFowland III, and Daniel B. Neill. "Scalable Detection of Anomalous Patterns With Connectivity Constraints." Journal of Computational and Graphical Statistics 24, no. 4 (2015): 1014–1033.
- 2011
- Article
Scalable Detection of Anomalous Patterns With Connectivity Constraints
By: Skyler Speakman, Edward McFowland III and Daniel B. Neill
We present GraphScan, a novel method for detecting arbitrarily shaped connected clusters in graph or network data. Given a graph structure, data observed at each node, and a score function defining the anomalousness of a set of nodes, GraphScan can efficiently and... View Details
- January 2024
- Article
Subset Scanning for Multi-Trait Analysis Using GWAS Summary Statistics
By: Rui Cao, Evan Olawsky, Edward McFowland III, Erin Marcotte, Logan Spector and Tianzhong Yang
Multi-trait analysis has been shown to have greater statistical power than single-trait analysis. Most of the existing multi-trait analysis methods only work with a limited number of traits and usually prioritize high statistical power over identifying relevant traits,... View Details
Cao, Rui, Evan Olawsky, Edward McFowland III, Erin Marcotte, Logan Spector, and Tianzhong Yang. "Subset Scanning for Multi-Trait Analysis Using GWAS Summary Statistics." Bioinformatics 40, no. 1 (January 2024).
- November 2021
- Article
Gaussian Process Subset Scanning for Anomalous Pattern Detection in Non-iid Data
By: William Herlands, Edward McFowland III, Andrew Gordon Wilson and Daniel B. Neill
Identifying anomalous patterns in real-world data is essential for understanding where, when, and how systems deviate from their expected dynamics. Yet methods that separately consider the anomalousness of each individual data point have low detection power for subtle,... View Details
Herlands, William, Edward McFowland III, Andrew Gordon Wilson, and Daniel B. Neill. "Gaussian Process Subset Scanning for Anomalous Pattern Detection in Non-iid Data." Proceedings of Machine Learning Research (PMLR) 84 (2018): 425–434. (Also presented at the 21st International Conference on Artificial Intelligence and Statistics (AISTATS), 2018.)
- Article
Detecting Adversarial Attacks via Subset Scanning of Autoencoder Activations and Reconstruction Error
By: Celia Cintas, Skyler Speakman, Victor Akinwande, William Ogallo, Komminist Weldemariam, Srihari Sridharan and Edward McFowland III
Reliably detecting attacks in a given set of inputs is of high practical relevance because of the vulnerability of neural networks to adversarial examples. These altered inputs create a security risk in applications with real-world consequences, such as self-driving... View Details
Keywords: Autoencoder Networks; Pattern Detection; Subset Scanning; Computer Vision; Statistical Methods And Machine Learning; Machine Learning; Deep Learning; Data Mining; Big Data; Large-scale Systems; Mathematical Methods; Analytics and Data Science
Cintas, Celia, Skyler Speakman, Victor Akinwande, William Ogallo, Komminist Weldemariam, Srihari Sridharan, and Edward McFowland III. "Detecting Adversarial Attacks via Subset Scanning of Autoencoder Activations and Reconstruction Error." Proceedings of the International Joint Conference on Artificial Intelligence 29th (2020).
- November 1996
- Article
Localized Autocorrelation Diagnostic Statistic for Sociological Models: Times-series, Network, and Spatial Datasets
By: C. I. Nass and Y. Moon
Nass, C. I., and Y. Moon. "Localized Autocorrelation Diagnostic Statistic for Sociological Models: Times-series, Network, and Spatial Datasets." Sociological Methods & Research 25, no. 2 (November 1996): 223–247.
- Article
Statistical Physics of Human Cooperation
By: Matjaž Perc, Jillian J. Jordan, David G. Rand, Zhen Wang, Stefano Boccaletti and Attila Szolnoki
Extensive cooperation among unrelated individuals is unique to humans, who often sacrifice personal benefits for the common good and work together to achieve what they are unable to execute alone. The evolutionary success of our species is indeed due, to a large... View Details
Keywords: Human Cooperation; Evolutionary Game Theory; Public Goods; Reward; Punishment; Tolerance; Self-organization; Pattern Formation; Cooperation; Behavior; Game Theory
Perc, Matjaž, Jillian J. Jordan, David G. Rand, Zhen Wang, Stefano Boccaletti, and Attila Szolnoki. "Statistical Physics of Human Cooperation." Physics Reports 687 (May 8, 2017): 1–51.
- 2023
- Working Paper
Efficient Discovery of Heterogeneous Quantile Treatment Effects in Randomized Experiments via Anomalous Pattern Detection
By: Edward McFowland III, Sriram Somanchi and Daniel B. Neill
In the recent literature on estimating heterogeneous treatment effects, each proposed method makes its own set of restrictive assumptions about the intervention’s effects and which subpopulations to explicitly estimate. Moreover, the majority of the literature provides... View Details
Keywords: Causal Inference; Program Evaluation; Algorithms; Distributional Average Treatment Effect; Treatment Effect Subset Scan; Heterogeneous Treatment Effects
McFowland III, Edward, Sriram Somanchi, and Daniel B. Neill. "Efficient Discovery of Heterogeneous Quantile Treatment Effects in Randomized Experiments via Anomalous Pattern Detection." Working Paper, 2023.
- May 2017
- Article
Agent-based Modeling: A Guide for Social Psychologists
By: Joshua Conrad Jackson, David Rand, Kevin Lewis, Michael I. Norton and Kurt Gray
Agent-based modeling is a longstanding but underused method that allows researchers to simulate artificial worlds for hypothesis testing and theory building. Agent-based models (ABMs) offer unprecedented control and statistical power by allowing researchers to... View Details
Jackson, Joshua Conrad, David Rand, Kevin Lewis, Michael I. Norton, and Kurt Gray. "Agent-based Modeling: A Guide for Social Psychologists." Social Psychological & Personality Science 8, no. 4 (May 2017): 387–395.
- Research Summary
Overview
Over the last decade, technology companies like Amazon, Google, and Netflix have pioneered data-driven research and development processes centered on massive experimentation. However, as companies increase the breadth and scale of their experiments to millions of... View Details
- Research Summary
Overview
By: Iavor I. Bojinov
Over the last decade, technology companies like Amazon, Google, and Netflix have pioneered data-driven research and development processes centered on massive experimentation. However, as companies increase the breadth and scale of their experiments to millions of... View Details
- 07 Feb 2023
- Research & Ideas
Supervisor of Sandwiches? More Companies Inflate Titles to Avoid Extra Pay
If it seems like everyone is a manager these days, you may be onto something. Not only is there a profusion of assistant managers, there are also now carpet shampoo and food cart managers, directors of first impressions, assistant bingo managers, and price View Details
Keywords: by Scott Van Voorhis
- 30 May 2024
- Research & Ideas
Racial Bias Might Be Infecting Patient Portals. Can AI Help?
registered nurses. The statistical evidence suggests that medical teams tended to prioritize messages from white patients, says Ariel Stern, a visiting professor at Harvard Business School and one of the study’s authors. As mobile... View Details
- 11 Jun 2018
- Research & Ideas
Why South Korea's Samsung Built the Only Outdoor Skating Rink in Texas
spatial detail how Walmart advertises in Akron versus Toledo versus L.A.,” Cohen says. Sure enough, they found that advertising dollars increased in local areas by an average of 23 percent after a lawsuit was filed there. In addition, the... View Details
- 22 Feb 2018
- Book
The New History of American Capitalism
communicate reality. Timothy Mitchell, for example, explores how the parameters of economic expertise shape the questions the discipline investigates, while others have scrutinized how data and statistics come to represent the... View Details
Keywords: Manufacturing