Dorottya Demszky

Google Scholar | Github | LinkedIn | CV

Hi! I'm a third-year PhD candidate in the Linguistics Department at Stanford University, advised by Dan Jurafsky. I'm an enthusiastic member of the Stanford NLP group.

My primary research areas are computational semantics and pragmatics and computational social science. At a high-level, I'm excited about combining quantitative and qualitative approaches to studying and modeling social and cognitive processes mediated through language. More concretely, by building on –– and in turn, informing –– theories from pragmatics, psycholinguistics and sociolinguistics, I seek to create and improve quantitative methods that capture phenomena including and going beyond compositional meaning, such as presuppositionality, implicatures and social meaning. I'm invested in understanding how NLP can be put to the service of social good. I work on projects with applications to questions and issues within the domains of education and politics.

You can reach me at ddemszky [at] stanford [dot] edu. Feel free to call me Dora.

Content Analysis of Textbooks via Natural Language Processing: Novel Findings on Gender, Race, and Ethnicity in Texas US History Textbooks
Li Lucy*, Demszky*, D., Bromley, P., & Jurafsky, D. *equal contribution
American Education Research Association (AERA) Open Journal (2020)
2020 Education Data Science Conference Best Paper Award
[slides][code][Stanford HAI article]

GoEmotions: A Dataset of Fine-Grained Emotions
Demszky, D., Movshovitz-Attias, D., Ko, J. and Cowen, A., Nemade, G., & Ravi, S.
ACL 2020
[code & data]

The Role of Verb Semantics in Hungarian Verb-Object Order
Demszky, D., Kálmán, L., Jurafsky, D., & Levin, B.
arXiv preprint arXiv:2006.09432
[LSA 2021 slides]

Pártélet: A Hungarian Corpus of Propaganda Texts from the Hungarian Socialist Era
Kmetty, Z., Vincze, V., Demszky, D., Ring, O., Nagy, B., & Szabó, M. K.
LREC 2020

Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings
Demszky, D., Garg, N., Voigt, R., Zou, J., Shapiro, J., Gentzkow, M. & Jurafsky, D.
ACL 2019
[code and data][NAACL slides][Stanford News, Washington Post]

Transforming Question Answering Datasets Into Natural Language Inference Datasets
Demszky, D.*, Guu, K.*, & Liang, P. *equal contribution
arXiv preprint arXiv:1809.02922.
In the summers of 2019 and 2020, I was a Research Intern at Google, where I got the chance to work on great teams on NLP projects.
Other Activities
In 2016, I co-founded a nonprofit organization, Tarisznya Alapítvány (Knapsack Foundation), the goal of which is to empower underprivileged children in Hungary through education. You can learn more on our website or on Facebook.