Filter Results:
(12)
Show Results For
- All HBS Web
(15)
- Faculty Publications (12)
Show Results For
- All HBS Web
(15)
- Faculty Publications (12)
Page 1 of 12
Results
- January 2024
- Article
Subset Scanning for Multi-Trait Analysis Using GWAS Summary Statistics
By: Rui Cao, Evan Olawsky, Edward McFowland III, Erin Marcotte, Logan Spector and Tianzhong Yang
Multi-trait analysis has been shown to have greater statistical power than single-trait analysis. Most of the existing multi-trait analysis methods only work with a limited number of traits and usually prioritize high statistical power over identifying relevant traits,... View Details
Cao, Rui, Evan Olawsky, Edward McFowland III, Erin Marcotte, Logan Spector, and Tianzhong Yang. "Subset Scanning for Multi-Trait Analysis Using GWAS Summary Statistics." Bioinformatics 40, no. 1 (January 2024).
- 2022
- Article
Nonparametric Subset Scanning for Detection of Heteroscedasticity
By: Charles R. Doss and Edward McFowland III
We propose Heteroscedastic Subset Scan (HSS), a novel method for identifying covariates that are responsible for violations of the homoscedasticity assumption in regression settings. Viewing the problem as one of anomalous pattern detection, we use subset scanning... View Details
Doss, Charles R., and Edward McFowland III. "Nonparametric Subset Scanning for Detection of Heteroscedasticity." Journal of Computational and Graphical Statistics 31, no. 3 (2022): 813–823.
- Article
Pattern Detection in the Activation Space for Identifying Synthesized Content
By: Celia Cintas, Skyler Speakman, Girmaw Abebe Tadesse, Victor Akinwande, Edward McFowland III and Komminist Weldemariam
Generative Adversarial Networks (GANs) have recently achieved unprecedented success in photo-realistic image synthesis from low-dimensional random noise. The ability to synthesize high-quality content at a large scale brings potential risks as the generated samples may... View Details
Cintas, Celia, Skyler Speakman, Girmaw Abebe Tadesse, Victor Akinwande, Edward McFowland III, and Komminist Weldemariam. "Pattern Detection in the Activation Space for Identifying Synthesized Content." Pattern Recognition Letters 153 (January 2022): 207–213.
- Article
Detecting Adversarial Attacks via Subset Scanning of Autoencoder Activations and Reconstruction Error
By: Celia Cintas, Skyler Speakman, Victor Akinwande, William Ogallo, Komminist Weldemariam, Srihari Sridharan and Edward McFowland III
Reliably detecting attacks in a given set of inputs is of high practical relevance because of the vulnerability of neural networks to adversarial examples. These altered inputs create a security risk in applications with real-world consequences, such as self-driving... View Details
Keywords: Autoencoder Networks; Pattern Detection; Subset Scanning; Computer Vision; Statistical Methods And Machine Learning; Machine Learning; Deep Learning; Data Mining; Big Data; Large-scale Systems; Mathematical Methods; Analytics and Data Science
Cintas, Celia, Skyler Speakman, Victor Akinwande, William Ogallo, Komminist Weldemariam, Srihari Sridharan, and Edward McFowland III. "Detecting Adversarial Attacks via Subset Scanning of Autoencoder Activations and Reconstruction Error." Proceedings of the International Joint Conference on Artificial Intelligence 29th (2020).
- November 2021
- Article
Gaussian Process Subset Scanning for Anomalous Pattern Detection in Non-iid Data
By: William Herlands, Edward McFowland III, Andrew Gordon Wilson and Daniel B. Neill
Identifying anomalous patterns in real-world data is essential for understanding where, when, and how systems deviate from their expected dynamics. Yet methods that separately consider the anomalousness of each individual data point have low detection power for subtle,... View Details
Herlands, William, Edward McFowland III, Andrew Gordon Wilson, and Daniel B. Neill. "Gaussian Process Subset Scanning for Anomalous Pattern Detection in Non-iid Data." Proceedings of Machine Learning Research (PMLR) 84 (2018): 425–434. (Also presented at the 21st International Conference on Artificial Intelligence and Statistics (AISTATS), 2018.)
- 2023
- Working Paper
Efficient Discovery of Heterogeneous Quantile Treatment Effects in Randomized Experiments via Anomalous Pattern Detection
By: Edward McFowland III, Sriram Somanchi and Daniel B. Neill
In the recent literature on estimating heterogeneous treatment effects, each proposed method makes its own set of restrictive assumptions about the intervention’s effects and which subpopulations to explicitly estimate. Moreover, the majority of the literature provides... View Details
Keywords: Causal Inference; Program Evaluation; Algorithms; Distributional Average Treatment Effect; Treatment Effect Subset Scan; Heterogeneous Treatment Effects
McFowland III, Edward, Sriram Somanchi, and Daniel B. Neill. "Efficient Discovery of Heterogeneous Quantile Treatment Effects in Randomized Experiments via Anomalous Pattern Detection." Working Paper, 2023.
- 2016
- Article
Penalized Fast Subset Scanning
By: Skyler Speakman, Sriram Somanchi, Edward McFowland III and Daniel B. Neill
We present the penalized fast subset scan (PFSS), a new and general framework for scalable and accurate pattern detection. PFSS enables exact and efficient identification of the most anomalous subsets of the data, as measured by a likelihood ratio scan statistic.... View Details
Keywords: Disease Surveillance; Likelihood Ratio Statistic; Pattern Detection; Scan Statistic; Mathematical Methods
Speakman, Skyler, Sriram Somanchi, Edward McFowland III, and Daniel B. Neill. "Penalized Fast Subset Scanning." Journal of Computational and Graphical Statistics 25, no. 2 (2016): 382–404. (Selected for “Best of JCGS” invited session by the journal’s editor in chief.)
- 2015
- Article
Scalable Detection of Anomalous Patterns With Connectivity Constraints
By: Skyler Speakman, Edward McFowland III and Daniel B. Neill
We present GraphScan, a novel method for detecting arbitrarily shaped connected clusters in graph or network data. Given a graph structure, data observed at each node, and a score function defining the anomalousness of a set of nodes, GraphScan can efficiently and... View Details
Speakman, Skyler, Edward McFowland III, and Daniel B. Neill. "Scalable Detection of Anomalous Patterns With Connectivity Constraints." Journal of Computational and Graphical Statistics 24, no. 4 (2015): 1014–1033.
- Article
Fast Generalized Subset Scan for Anomalous Pattern Detection
By: Edward McFowland III, Skyler Speakman and Daniel B. Neill
We propose Fast Generalized Subset Scan (FGSS), a new method for detecting anomalous patterns in general categorical data sets. We frame the pattern detection problem as a search over subsets of data records and attributes, maximizing a nonparametric scan statistic... View Details
Keywords: Pattern Detection; Anomaly Detection; Knowledge Discovery; Bayesian Networks; Scan Statistics; Analytics and Data Science
McFowland III, Edward, Skyler Speakman, and Daniel B. Neill. "Fast Generalized Subset Scan for Anomalous Pattern Detection." Art. 12. Journal of Machine Learning Research 14 (2013): 1533–1561.
- Article
Fast Subset Scan for Multivariate Spatial Biosurveillance
By: Daniel B. Neill, Edward McFowland III and Huanian Zheng
We present new subset scan methods for multivariate event detection in massive space-time datasets. We extend the recently proposed 'fast subset scan' framework from univariate to multivariate data, enabling computationally efficient detection of irregular space-time... View Details
Neill, Daniel B., Edward McFowland III, and Huanian Zheng. "Fast Subset Scan for Multivariate Spatial Biosurveillance." Statistics in Medicine 32, no. 13 (June 15, 2013): 2185–2208.
- 2011
- Article
Scalable Detection of Anomalous Patterns With Connectivity Constraints
By: Skyler Speakman, Edward McFowland III and Daniel B. Neill
We present GraphScan, a novel method for detecting arbitrarily shaped connected clusters in graph or network data. Given a graph structure, data observed at each node, and a score function defining the anomalousness of a set of nodes, GraphScan can efficiently and... View Details
- Article
Fast Subset Scan for Multivariate Spatial Biosurveillance
By: Daniel B. Neill, Edward McFowland III and Huanian Zheng
We extend the recently proposed ‘fast subset scan’ framework from univariate to multivariate data, enabling computationally efficient detection of irregular space-time clusters even when the numbers of spatial locations and data streams are large. These fast algorithms... View Details