Aryaman Arora

/ˈäːɾjəmən əˈɾoːɾäː/ Sophomore, Georgetown University

Hi, I’m Aryaman! I’m an undergrad at Georgetown University studying computer science and linguistics. I do computational linguistics research as a member of NERT, Nathan Schneider’s research group at Georgetown. I’m excited about making technologies for South Asian languages and doing linguistic work on Indo-Aryan languages.

For this summer, I am a research intern at ETH Zürich with Ryan Cotterell.

I love learning languages—besides being a native speaker of Hindi and English, I’m comfortable with Urdu and Punjabi and am actively learning Mandarin and Sanskrit. I’m an admin on the English Wiktionary where I manage South Asian language documentation. I also run a South Asian linguistics reading group, Moḻi-maṇḍala.

I like to bicycle (though not very well), read (in English and Hindi, but the latter not very well), and eat unreasonable amounts of food relative to my size. I’d also say I like to go places, but we’re in the middle of a pandemic. With my friend Ananya Gulati, I founded Washingtutors, an online tutoring non-profit for students in the D.C. Public School system (in which I spent my best years of high school, at the School Without Walls).

Check out my CV.


Jul 1, 2021 Started my internship at ETH Zürich with Ryan Cotterell!
Feb 15, 2021 I’m presenting “SNACS Annotation of Case Markers and Adpositions in Hindi” at SCiL 2021.
Nov 16, 2020 Attending EMNLP and presenting at SIGTYP.
Aug 27, 2020 My research is in the news! “First-Year Student Presented Paper at Prestigious Computational Linguistics Conference”
Aug 3, 2020 Starting undergrad at Georgetown.

selected publications

  1. Bhāṣācitra: Visualising the dialect geography of South Asia
    Aryaman Arora, Adam Farris, Gopalakrishnan R, Samopriya Basu
    In Proceedings of the 2nd International Workshop on Computational Approaches to Historical Language Change, 2021
  2. SNACS Annotation of Case Markers and Adpositions in Hindi
    In Proceedings of the Society for Computation in Linguistics, 2021
  3. Supervised Grapheme-to-Phoneme Conversion of Orthographic Schwas in Hindi and Punjabi
    Aryaman Arora, Luke Gessler, Nathan Schneider
    In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020