I am a research associate at the Johns Hopkins University Human Language Technology Center of Excellence (HLTCOE), where I focus on enabling human annotation of natural language data and improving natural language processing software infrastructure. I earned my PhD in computer science from Johns Hopkins University, writing my dissertation on the relationship between theory and practice in topic modeling. I have a bachelor’s in mathematics from Harvey Mudd College, and between college and grad school I performed research and software engineering at Pacific Northwest National Laboratory. My pronouns are she/her.
Software
Selected software I’ve contributed to:
Concrete-python
A Python interface to the Concrete communication protocol for
annotated text.
Concrete-js
A JavaScript/Node.js interface to Concrete.
Apache Thrift
A protocol for cross-language, cross-platform services. Used by
Concrete.
Turkle
A Python (Django) web interface for crowd work. Largely compatible
with Mechanical Turk formats.
Commented word2vec
The word2vec word representation learning C code, commented.
Publications
Selected publications I’ve contributed to:
Topic Modeling in Theory and Practice
Chandler May
Doctoral dissertation, Johns Hopkins University, 2022
You Keep Using That Word: Ways of Thinking about Gender in Computing Research
Os Keyes, Chandler May, and Annabelle Carrell
Proceedings of the ACM on Human-Computer Interaction, 2021
On Measuring Social Biases in Sentence Encoders
Chandler May, Alex Wang, Shikha Bordia, Samuel R. Bowman, and
Rachel Rudinger
Proceedings of the 2019 Conference of the North American Chapter of the
Association for Computational Linguistics, 2019
code
Streaming Word Embeddings with the Space-Saving Algorithm
Chandler May, Kevin Duh, Benjamin Van Durme, and Ashwin Lall
Preprint, 2017
code
Social Bias in Elicited Natural Language Inferences
Rachel Rudinger, Chandler May, and Benjamin Van Durme
Proceedings of the First ACL Workshop on Ethics in Natural Language Processing, 2017
code
An Analysis of Lemmatization on Topic Models of Morphologically Rich Language
Chandler May, Ryan Cotterell, and Benjamin Van Durme
Preprint, 2016
code
Topic Identification and Discovery on Text and Speech
Chandler May, Francis Ferraro, Alan McCree, Jonathan Wintrode,
Daniel Garcia-Romero, and Benjamin Van Durme
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015
Particle Filter Rejuvenation and Latent Dirichlet Allocation
Chandler May, Alex Clemmer, and Benjamin Van Durme
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
code