Filter Results:
(15)
Show Results For
- All HBS Web (19)
- Faculty Publications (10)
Show Results For
- All HBS Web (19)
- Faculty Publications (10)
Page 1 of 15
Results
Sort by
- 2016
- Article
Penalized Fast Subset Scanning
By: Skyler Speakman, Sriram Somanchi, Edward McFowland III and Daniel B. Neill
We present the penalized fast subset scan (PFSS), a new and general framework for scalable and accurate pattern detection. PFSS enables exact and efficient identification of the most anomalous subsets of the data, as measured by a likelihood ratio scan statistic.... View Details
Keywords: Disease Surveillance; Likelihood Ratio Statistic; Pattern Detection; Scan Statistic; Mathematical Methods
Speakman, Skyler, Sriram Somanchi, Edward McFowland III, and Daniel B. Neill. "Penalized Fast Subset Scanning." Journal of Computational and Graphical Statistics 25, no. 2 (2016): 382–404. (Selected for “Best of JCGS” invited session by the journal’s editor in chief.)
- Article
Fast Subset Scan for Multivariate Spatial Biosurveillance
By: Daniel B. Neill, Edward McFowland III and Huanian Zheng
We present new subset scan methods for multivariate event detection in massive space-time datasets. We extend the recently proposed 'fast subset scan' framework from univariate to multivariate data, enabling computationally efficient detection of irregular space-time... View Details
Neill, Daniel B., Edward McFowland III, and Huanian Zheng. "Fast Subset Scan for Multivariate Spatial Biosurveillance." Statistics in Medicine 32, no. 13 (June 15, 2013): 2185–2208.
- 2022
- Article
Nonparametric Subset Scanning for Detection of Heteroscedasticity
By: Charles R. Doss and Edward McFowland III
We propose Heteroscedastic Subset Scan (HSS), a novel method for identifying covariates that are responsible for violations of the homoscedasticity assumption in regression settings. Viewing the problem as one of anomalous pattern detection, we use subset scanning... View Details
Doss, Charles R., and Edward McFowland III. "Nonparametric Subset Scanning for Detection of Heteroscedasticity." Journal of Computational and Graphical Statistics 31, no. 3 (2022): 813–823.
- Article
Fast Generalized Subset Scan for Anomalous Pattern Detection
By: Edward McFowland III, Skyler Speakman and Daniel B. Neill
We propose Fast Generalized Subset Scan (FGSS), a new method for detecting anomalous patterns in general categorical data sets. We frame the pattern detection problem as a search over subsets of data records and attributes, maximizing a nonparametric scan statistic... View Details
Keywords: Pattern Detection; Anomaly Detection; Knowledge Discovery; Bayesian Networks; Scan Statistics; Analytics and Data Science
McFowland III, Edward, Skyler Speakman, and Daniel B. Neill. "Fast Generalized Subset Scan for Anomalous Pattern Detection." Art. 12. Journal of Machine Learning Research 14 (2013): 1533–1561.
- January 2024
- Article
Subset Scanning for Multi-Trait Analysis Using GWAS Summary Statistics
By: Rui Cao, Evan Olawsky, Edward McFowland III, Erin Marcotte, Logan Spector and Tianzhong Yang
Multi-trait analysis has been shown to have greater statistical power than single-trait analysis. Most of the existing multi-trait analysis methods only work with a limited number of traits and usually prioritize high statistical power over identifying relevant traits,... View Details
Cao, Rui, Evan Olawsky, Edward McFowland III, Erin Marcotte, Logan Spector, and Tianzhong Yang. "Subset Scanning for Multi-Trait Analysis Using GWAS Summary Statistics." Bioinformatics 40, no. 1 (January 2024).
- November 2021
- Article
Gaussian Process Subset Scanning for Anomalous Pattern Detection in Non-iid Data
By: William Herlands, Edward McFowland III, Andrew Gordon Wilson and Daniel B. Neill
Identifying anomalous patterns in real-world data is essential for understanding where, when, and how systems deviate from their expected dynamics. Yet methods that separately consider the anomalousness of each individual data point have low detection power for subtle,... View Details
Herlands, William, Edward McFowland III, Andrew Gordon Wilson, and Daniel B. Neill. "Gaussian Process Subset Scanning for Anomalous Pattern Detection in Non-iid Data." Proceedings of Machine Learning Research (PMLR) 84 (2018): 425–434. (Also presented at the 21st International Conference on Artificial Intelligence and Statistics (AISTATS), 2018.)
- Article
Detecting Adversarial Attacks via Subset Scanning of Autoencoder Activations and Reconstruction Error
By: Celia Cintas, Skyler Speakman, Victor Akinwande, William Ogallo, Komminist Weldemariam, Srihari Sridharan and Edward McFowland III
Reliably detecting attacks in a given set of inputs is of high practical relevance because of the vulnerability of neural networks to adversarial examples. These altered inputs create a security risk in applications with real-world consequences, such as self-driving... View Details
Keywords: Autoencoder Networks; Pattern Detection; Subset Scanning; Computer Vision; Statistical Methods And Machine Learning; Machine Learning; Deep Learning; Data Mining; Big Data; Large-scale Systems; Mathematical Methods; Analytics and Data Science
Cintas, Celia, Skyler Speakman, Victor Akinwande, William Ogallo, Komminist Weldemariam, Srihari Sridharan, and Edward McFowland III. "Detecting Adversarial Attacks via Subset Scanning of Autoencoder Activations and Reconstruction Error." Proceedings of the International Joint Conference on Artificial Intelligence 29th (2020).
- 2015
- Article
Scalable Detection of Anomalous Patterns With Connectivity Constraints
By: Skyler Speakman, Edward McFowland III and Daniel B. Neill
We present GraphScan, a novel method for detecting arbitrarily shaped connected clusters in graph or network data. Given a graph structure, data observed at each node, and a score function defining the anomalousness of a set of nodes, GraphScan can efficiently and... View Details
Speakman, Skyler, Edward McFowland III, and Daniel B. Neill. "Scalable Detection of Anomalous Patterns With Connectivity Constraints." Journal of Computational and Graphical Statistics 24, no. 4 (2015): 1014–1033.
- 2011
- Article
Scalable Detection of Anomalous Patterns With Connectivity Constraints
By: Skyler Speakman, Edward McFowland III and Daniel B. Neill
We present GraphScan, a novel method for detecting arbitrarily shaped connected clusters in graph or network data. Given a graph structure, data observed at each node, and a score function defining the anomalousness of a set of nodes, GraphScan can efficiently and... View Details
- 2023
- Working Paper
Efficient Discovery of Heterogeneous Quantile Treatment Effects in Randomized Experiments via Anomalous Pattern Detection
By: Edward McFowland III, Sriram Somanchi and Daniel B. Neill
In the recent literature on estimating heterogeneous treatment effects, each proposed method makes its own set of restrictive assumptions about the intervention’s effects and which subpopulations to explicitly estimate. Moreover, the majority of the literature provides... View Details
Keywords: Causal Inference; Program Evaluation; Algorithms; Distributional Average Treatment Effect; Treatment Effect Subset Scan; Heterogeneous Treatment Effects
McFowland III, Edward, Sriram Somanchi, and Daniel B. Neill. "Efficient Discovery of Heterogeneous Quantile Treatment Effects in Randomized Experiments via Anomalous Pattern Detection." Working Paper, 2023.
- 30 May 2024
- Research & Ideas
Racial Bias Might Be Infecting Patient Portals. Can AI Help?
registered nurses. The statistical evidence suggests that medical teams tended to prioritize messages from white patients, says Ariel Stern, a visiting professor at Harvard Business School and one of the study’s authors. As mobile... View Details
- 07 Feb 2023
- Research & Ideas
Supervisor of Sandwiches? More Companies Inflate Titles to Avoid Extra Pay
If it seems like everyone is a manager these days, you may be onto something. Not only is there a profusion of assistant managers, there are also now carpet shampoo and food cart managers, directors of first impressions, assistant bingo managers, and price View Details
Keywords: by Scott Van Voorhis
- 24 Jun 2013
- Research & Ideas
Is Your iPhone Turning You Into a Wimp?
What kind of a device are you using to read this article? And what does your body posture look like? Are you hunching over a smartphone screen, arms tightly at your side? Are you slouching over an iPad or laptop? Or are you stretched out comfortably in an office chair,... View Details
- 03 Jun 2020
- Research & Ideas
Who Guarantees Your Workplace Is Safe for Return?
distancing markers on the floor or Plexiglass shields in elevators or thermal scanning when people come to work or shop? Will employees consistently stand too close or not cover their mouth and nose when sneezing? Employers will want to... View Details
- 16 Nov 2010
- Lessons from the Classroom
Data.gov: Matching Government Data with Rapid Innovation
use. It may cut down on the volume of requests that local agencies need to field on a day-to-day basis. Participants also probed questions of concern: Political window dressing: Will Data.gov release controversial datasets or will it favor uncontroversial information... View Details