Welcome to my homepage ^_^
My home country is Vietnam, the S-shaped easternmost area on the Indochina Peninsula in Southeast Asia. My B.S degree was awarded by the National University of Singapore in 2009. During my time in Singapore, I was fortunate to be affiliated with the two research groups: WING group under Prof. Min-Yen Kan (Feb 2009 - Oct 2010), and Speech group under Prof. Khe Chai Sim (Nov 2010 - Aug 2011).Here is my (often outdated) CV.
- Oct 2014 — I had a fruitful summer internship at Google hosted by Quoc Le. Together with other great members in Google Brain team, Ilya Sutskever, Oriol Vinyals, Wojciech Zaremba, we built a neural machine translation that, for the first time, outperforms the state-of-the-art system in the English-to-French translation task. More details.
- July 2013 — Our TACL'13 paper "Parsing entire discourses as very long strings: capturing topic continuity in grounded language learning" with Prof. Mark Johnson and Prof. Michael C. Frank is now available. This work is done in Fall, 2012 at Macquarie University, Australia.
- June 2013 — Our CoNLL'13 paper "Better Word Representations with Recursive Neural Networks for Morphology" with Richard Socher and Prof. Christopher Manning is now available. See you in Sofia!
- Apr 2012 — This Winter quarter, I am officially aligned with Prof. Christopher Manning, i.e., my advisor!
- Jan 2012 — Happy new year, everyone! This Spring quarter, I am back to Gates, joining Prof. Christopher Manning's NLP group as part of the research rotation program.
- Sep 2011 — This Fall quarter, I am having a research rotation with Prof. Noah D. Goodman, exploring computational models for human reading time prediction.
- Aug 2011 — Paper accepted at ASRU'2011 "A Trajectory-based Parallel Model Combination with a Unified Static and Dynamic Parameter Compensation For Noisy Speech Recognition" with Prof. Khe Chai Sim.
- Apr 2011 — I will be joining the Stanford CS department this Fall as a PhD candidate (Sep 2011).
I am now mainly into deep learning models for NLP tasks, specifically machine translation. Occassionaly, I build parsers used for psycholinguistics applications :)
Before Stanford: I worked in many scholarly Digital Library projects – document structure discovery and keyphrase extraction. I also investigated in model-based techniques for robust Speech Recognition systems.
- Minh-Thang Luong, Ilya Sutskever, Quoc V. Le, Oriol Vinyals, Wojciech Zaremba. 2014. Addressing the Rare Word Problem in Neural Machine Translation. ArXiv.
[ Paper ]
- Minh-Thang Luong, Richard Socher, and Christopher D. Manning. 2013. Better Word Representations with Recursive Neural Networks for Morphology. Conference on Computational Natural Language Learning (CoNLL’13). Sofia, Bulgaria.
[ Paper ] [ Bib ] [ Dataset ] [ Project page for word vectors and other information. ]
- Minh-Thang Luong, Michael C. Frank, and Mark Johnson. 2013. Parsing entire discourses as very long strings: Capturing topic continuity in grounded language learning. Transactions of the Association for Computational Linguistics (TACL’13).
[ Paper ] [ Bib ] [ Code (Earleyx parser) ]
Before Stanford: 2011
- Khe Chai Sim and Minh-Thang Luong. 2011. A Trajectory-based Parallel Model Combination with a Unified Static and Dynamic Parameter Compensation For Noisy Speech Recognition. Automatic Speech Recognition and Understanding (ASRU’11). Hawaii, USA. Oral presentation.
[ Paper ] [ Poster (.pdf) ]
- Minh-Thang Luong, Thuy Dung Nguyen, and Min-Yen Kan. 2011. Logical Structure Recovery in Scholarly Articles with Rich Document Features. International Journal of Digital Library Systems (IJDLS) 1(4). pp 1-23. Invited paper.
[ Paper ] [ doi:10.4018/jdls.2010100101 ]
- Minh-Thang Luong, Preslav Nakov and Min-Yen Kan. 2010. A Hybrid Morpheme-Word Representation for Machine Translation of Morphologically Rich Languages. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’10). Boston, Massachusetts, USA. pp.148-157. Oral presentation.
[ Paper ] [ ACL Anthology (D10-1015) ]
- Minh-Thang Luong and Min-Yen Kan. 2010. Enhancing Morphological Alignment for Translating Highly Inflected Languages. In Proceedings of the International Conference on Computational Linguistics (COLING’10). Beijing, China. pp. 743-751. Oral presentation.
[ Paper ] [ ACL Anthology (C10-1084) ]
- Thuy Dung Nguyen and Minh-Thang Luong. 2010. WINGNUS: Keyphrase Extraction Utilizing Document Logical Structure. In Proceedings of the ACL 2010 Workshop on Evaluation Exercises on Semantic Evaluation (SemEval-2). pp. 166-169. Poster presentation.
Rank 3rd in the task #5 – Automatic Keyphrase Extraction from Scientific Articles.
[ Paper ] [ ACL Anthology (S10-1035) ]
- Thuy Dung Nguyen, Min-Yen Kan, Dinh-Trung Dang, Markus Hänse, Ching Hoi Andy Hong, Minh-Thang Luong, Jesse Prabawa Gozali, Kazunari Sugiyama and Yee Fan Tan (2010). ForeCite: Towards a Reader-centric Scholarly Digital Library. In Proceedings of the Joint Conference on Digital Libraries (JCDL’10). Poster Paper (1 page).
[ Paper ] [ Poster (.png) ] [ doi:10.1145/1816123.1816193 ]
SoftwareBelow are some links to the demos of software written by myself or with members in WING group:
- ParsCit: An open-source CRF Reference String and Logical Document Structure Parsing Package
- GSNAP: Google Scholar - Network Analysis Package
- ForeCite: A Reader-centric Digital Library
NotesThis is my latest notes, containing all tips/commands/notes about various tools and languages. They are arranged in structures so that I could write a script later to split into separate files, one per language or tool ... For now, sorry for such a mess ...
Below are my notes collected randomly while I were learning some new tools and stuffs. To be organized:) :
See more in my academic page!