Welcome to my homepage ^_^

I am a fourth year PhD student in the Department of Computer Science at Stanford University. My advisor is Prof. Christopher Manning.

My home country is Vietnam, the S-shaped easternmost area on the Indochina Peninsula in Southeast Asia. My B.S degree was awarded by the National University of Singapore in 2009. During my time in Singapore, I was fortunate to be affiliated with the two research groups: WING group under Prof. Min-Yen Kan (Feb 2009 - Oct 2010), and Speech group under Prof. Khe Chai Sim (Nov 2010 - Aug 2011).

Here is my (often outdated) CV.


  • Oct 2014 — I had a fruitful summer internship at Google hosted by Quoc Le. Together with other great members in Google Brain team, Ilya Sutskever, Oriol Vinyals, Wojciech Zaremba, we built a neural machine translation that, for the first time, outperforms the state-of-the-art system in the English-to-French translation task. More details.

  • July 2013 — Our TACL'13 paper "Parsing entire discourses as very long strings: capturing topic continuity in grounded language learning" with Prof. Mark Johnson and Prof. Michael C. Frank is now available. This work is done in Fall, 2012 at Macquarie University, Australia.

  • June 2013 — Our CoNLL'13 paper "Better Word Representations with Recursive Neural Networks for Morphology" with Richard Socher and Prof. Christopher Manning is now available. See you in Sofia!

  • Apr 2012 — This Winter quarter, I am officially aligned with Prof. Christopher Manning, i.e., my advisor!

  • Jan 2012 — Happy new year, everyone! This Spring quarter, I am back to Gates, joining Prof. Christopher Manning's NLP group as part of the research rotation program.

  • Sep 2011 — This Fall quarter, I am having a research rotation with Prof. Noah D. Goodman, exploring computational models for human reading time prediction.

  • Aug 2011 — Paper accepted at ASRU'2011 "A Trajectory-based Parallel Model Combination with a Unified Static and Dynamic Parameter Compensation For Noisy Speech Recognition" with Prof. Khe Chai Sim.

  • Apr 2011 — I will be joining the Stanford CS department this Fall as a PhD candidate (Sep 2011).


I am now mainly into deep learning models for NLP tasks, specifically machine translation. Occassionaly, I build parsers used for psycholinguistics applications :)

Before Stanford: I worked in many scholarly Digital Library projects – document structure discovery and keyphrase extraction. I also investigated in model-based techniques for robust Speech Recognition systems.



  • Minh-Thang Luong, Ilya Sutskever, Quoc V. Le, Oriol Vinyals, Wojciech Zaremba. 2014. Addressing the Rare Word Problem in Neural Machine Translation. ArXiv.
    [ Paper ]


  • Minh-Thang Luong, Richard Socher, and Christopher D. Manning. 2013. Better Word Representations with Recursive Neural Networks for Morphology. Conference on Computational Natural Language Learning (CoNLL’13). Sofia, Bulgaria.
    [ Paper ] [ Bib ] [ Dataset ] [ Project page for word vectors and other information. ]

  • Minh-Thang Luong, Michael C. Frank, and Mark Johnson. 2013. Parsing entire discourses as very long strings: Capturing topic continuity in grounded language learning. Transactions of the Association for Computational Linguistics (TACL’13).
    [ Paper ] [ Bib ] [ Code (Earleyx parser) ]

Before Stanford:

  • Khe Chai Sim and Minh-Thang Luong. 2011. A Trajectory-based Parallel Model Combination with a Unified Static and Dynamic Parameter Compensation For Noisy Speech Recognition. Automatic Speech Recognition and Understanding (ASRU’11). Hawaii, USA. Oral presentation.
    [ Paper ]   [ Poster (.pdf) ]

  • Minh-Thang Luong, Thuy Dung Nguyen, and Min-Yen Kan. 2011. Logical Structure Recovery in Scholarly Articles with Rich Document Features. International Journal of Digital Library Systems (IJDLS) 1(4). pp 1-23. Invited paper.
    [ Paper ]   [ doi:10.4018/jdls.2010100101 ]


  • Minh-Thang Luong, Preslav Nakov and Min-Yen Kan. 2010. A Hybrid Morpheme-Word Representation for Machine Translation of Morphologically Rich Languages. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’10). Boston, Massachusetts, USA. pp.148-157. Oral presentation.
    [ Paper ]   [ ACL Anthology (D10-1015) ]

  • Minh-Thang Luong and Min-Yen Kan. 2010. Enhancing Morphological Alignment for Translating Highly Inflected Languages. In Proceedings of the International Conference on Computational Linguistics (COLING’10). Beijing, China. pp. 743-751. Oral presentation.
    [ Paper ]   [ ACL Anthology (C10-1084) ]

  • Thuy Dung Nguyen and Minh-Thang Luong. 2010. WINGNUS: Keyphrase Extraction Utilizing Document Logical Structure. In Proceedings of the ACL 2010 Workshop on Evaluation Exercises on Semantic Evaluation (SemEval-2). pp. 166-169. Poster presentation.
    Rank 3rd in the task #5 – Automatic Keyphrase Extraction from Scientific Articles.
    [ Paper ]   [ ACL Anthology (S10-1035) ]

  • Thuy Dung Nguyen, Min-Yen Kan, Dinh-Trung Dang, Markus Hänse, Ching Hoi Andy Hong, Minh-Thang Luong, Jesse Prabawa Gozali, Kazunari Sugiyama and Yee Fan Tan (2010). ForeCite: Towards a Reader-centric Scholarly Digital Library. In Proceedings of the Joint Conference on Digital Libraries (JCDL’10). Poster Paper (1 page).
    [ Paper ] [ Poster (.png) ] [ doi:10.1145/1816123.1816193 ]


Below are some links to the demos of software written by myself or with members in WING group:
  • ParsCit: An open-source CRF Reference String and Logical Document Structure Parsing Package
  • GSNAP: Google Scholar - Network Analysis Package
  • ForeCite: A Reader-centric Digital Library


This is my latest notes, containing all tips/commands/notes about various tools and languages. They are arranged in structures so that I could write a script later to split into separate files, one per language or tool ... For now, sorry for such a mess ...
Below are my notes collected randomly while I were learning some new tools and stuffs. To be organized:) :

See more in my academic page!


Below is a what-every-one-often-has-because-others-also-have Google Calendar :P