Volen National Center for Complex Systems, 136
DegreesUniversity of Delaware, Ph.D.
Nankai University, M.A.
Nankai University, B.A.
ExpertiseSyntactic and semantic parsing. Chinese word segmentation. Discourse analysis. Building large-scale natural language processing infrastructures (Chinese TreeBank, Chinese Proposition Bank, OntoNotes)
ProfileNianwen Xue is an Associate Professor in the Computer Science Department and the Language & Linguistics Program at Brandeis University. Before joining Brandeis, Nianwen Xue was a research assistant professor in the Department of Linguistics and the Center for Computational Language and Education Research (CLEAR) at the University of Colorado at Boulder. Prior to that, he was a postdoctoral fellow in the Institute for Research in Cognitive Science and the Department of Computer and Information Science at the University of Pennsylvania. He got his PhD in linguistics from University of Delaware.
Nianwen Xue has broad interests in computational linguistics and natural language processing. He has devoted substantial efforts to the development of linguistic corpora annotated with syntactic, semantic, temporal and discourse information that are crucial resources in the field of natural language processing. The other thread of his research involves using statistical and machine learning techniques in solving natural language processing problems. He has published work in the areas of Chinese word segmentation, syntactic and semantic parsing, coreference, discourse analysis, machine translation as well as biomedical natural language processing. His research has received support from the National Science Foundation (NSF), IARPA and DARPA. He serves on the editorial boards of IEEE Transactions on Asian Language Processing, Language Resources and Evaluation, and Computer Processing of Oriental Languages.
|COSI||216a||Topics in Natural Language Processing|
|LING||131a||Programming for Linguistics|
Xue, Nianwen, Rint Sybesma. "The Penn Chinese Treebank." Encyclopedia of Chinese Language and Linguistics. 2016. (forthcoming)
Xiaopeng Bai and Niawnen Xue. "Generalizing the semantics roles in the Chinese Proposition Bank." Language Resources and Evaluation (2016): 1-24.
Xuansong Li, Martha Palmer, Nianwen Xue, Lance Ramshaw, Mohamed Maamouri, Kathryn Summerville Conger, Stephen Grimes, and Stephanie Strassel. "Largest Multi-lingual, Multi-level and Multi-genre Annotation Corpus." LREC-2016, Portoroz, Slovenia. 2016.
Attapol Rutherford and Nianwen Xue. "mproving the inference of implicit discourse relations via classifying explicit discourse connectives." Proceedings of NAACL-HLT-2015, Denver, Clorado. 2015.
Chu-Ren Huang and Nianwen Xue. "Modeling word concepts without convention: linguistic and computational issues in Chinese Word Identification." The Oxford Handbook of Chinese Linguistics. Ed. William S-Y. Wang and Chaofen Sun. Oxford University Press, 2015
Chuan Wang, Nianwen Xue and Sameer Pradhan. "A transition-based algorithm for AMR parsing." Proceedings of NAACL-HLT-2015, Denver, Colorado. 2015.
Chuan Wang, Nianwen Xue and Sameer Pradhan. "Boosting transition-based AMR parsing with refined actions and auxiliary analyzers. In Proceedigns of ACL-2015." Beijing, CHina. 2015.
Dun Deng, Nianwen Xue and Shiman Guo. "Harmonizing word alignments and syntactic structures for extracting phrase translation equivalents.." Proceedings of SSST-2015, Denver, Colorado. 2015.
Xue, Nianwen. "Nianwen Xue, Hwee Tou Ng, Sameer Pradhan, Rashmi Prasad, Christopher Bryant, Attapol Rutherford." The CoNLL-2015 Shared Task on Shallow Discourse Parsing, Beijing, China. 2015.
Yaqin Yang, Yalin Liu and Nianwen Xue. "Recovering dropped pronouns from Chinese text messages." Proceedigns of ACL-2015, Beijing, China. 2015.
Zhiguo Wang, Haitao Mi, and Nianwen Xue. "Feature optimization for constituent structure parsing via neural networks." Proceedigns of ACL-2015, Beijing, China. 2015.
Attapol Rutherford and Nianwen Xue. Discovering Implicit Discourse Relations Through Brown Cluster Pair Representation and Coreference Patterns. Proc. of Proceedings of EACL-2014. Gothenburg, Sweden: 2014.
Dun Deng and Nianwen Xue. "Aligning Chinese-English Parallel Parse Trees: Is it Feasible?." Proceedings of LAW VIII, Dublin, Ireland. 2014.
Dun Deng and Nianwen Xue. "Building a Hierarchically Aligned Parallel Chinese-English TreeBank." Proceedings of COLING-2014, Dublin, Ireland. 2014.
Nianwen Xue and Yuchen Zhang. Buy One Get One Free: Distant Annotation of Chinese Tense, Event Type and Modality. Proc. of Proceedings of LREC.. Reykjavik, Iceland: 2014.
Nianwen Xue, Ondrej Bojar, Jan Hajic, Martha Palmer, Zdenka Uresova and Xiuhong Zhang. Not an Interlingua, but Close: Comparison of English AMRs to Chinese and Czech. Proc. of Proceedings of LREC-2014. Reykjavík, Island: 2014.
Si Li and Nianwen Xue. "Effective Document-level Features for Chinese Patent Word Segmentation." Proceedings of ACL-2014 (Short papers), Baltimore, MD. 2014.
Yuchen Zhang and Nianwen Xue. "Automatic inference of the tense of Chinese events using implicit Information." Proceedings of EMNLP-2014., Doha, Qatar. 2014.
Yuping Zhou and Nianwen Xue. "The Chinese Discourse TreeBank: A Chinese Corpus Annotated with Discourse relations." Language Resources and Evaluation. (2014).
Zhiguo Wang and Nianwen Xue. Joint POS Tagging and Transition-based Con- stituent Parsing in Chinese with Non-local Features. Proc. of Proceedings of ACL-2014. Baltimore, MD: 2014.
Chen, Yao, Peter Anick, Pengyu Hong and Nianwen Xue. "Temporal Relation Discovery between Events and Temporal Expressions Identified in Clinical Narrative." Journal of BioInformatics 46 (2013): S48-S53.
Nianwen Xue, Yuchen Zhang and Yaqin Yang. "Distant annotation of Chinese tense and modality." Proceedings of the Workshop on Annotation of Modal Meaning in Natural Language (WAMM), Potsdam, Germany. 2013.
Sameer Pradhan, Alessandro Moschitti, Nianwen Xue, Hwee-Tou Ng, Zhong Zhi, Olga Uryupina, Anders Bjorkelund, Yuchen Zhang. Evaluating NLP Components through Multiple Interdependent Layers of Linguistic Annotation in the Multilingual, Multi-Genre OntoNotes Corpus. Proc. of Proceedings of CoNLL 2013. Sophia, Bulgaria: 2013.
Wang, Zhiguo and Nianwen Xue. A Lattice-based framework for Joint Chinese word seg- mentation, POS tagging and parsing.. Proc. of Proceedings of ACL 2013. Sophia, Bulgaria.: 2013.
Xue, Nianwen and Yaqin Yang. "Dependency-based empty category detection via phrase structure trees." Proceedings of NAACL-HLT 2013, Atlanta, Georgia. 2013.
Zhiguo Wang, Chengqing Zong and Nianwen Xue. Bidirectional Sequence Labeling via Dual Decomposition. Proc. of Proceedings of Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. Suzhou, China: 2013.
Bai, Xiaopeng and Xue, Nianwen. "Building a Chinese Lexical Taxonomy." The 2nd CIPS-SIGHAN Joint Conference on Chinese Language Processing (CLP-2012), Tianjin, China. 2012.
Cheng, Yao, Peter Anick, Nianwen Xue and Pengyu Hong. "Temporal Relation Discovery between Events and Temporal Expressions Identified in Clinical Narrative." The 2012 i2b2 Shared-Tasks and Workshop on Challenges in Natural Language Processing for Clinical Data, Chicago, IL. October.
Sammeer Pradhan, Alessandro Moschitti, and Nianwen Xue, Olga Uryupina, Yuchen Zhang. "Modeling multilingual unrestricted coference in OntoNotes." EMNLP-CoNLL 2012 Shared Task, Jeju Island, Korea. 2012.
Verspoor K., Cohen, K.B., Lanfranchi, A., Warner, C., Johnson, H. L., Roeder, C., Choi, J.D., Funk, C. Malenkiy, Y., Eckert, M., Xue, N., Baumgartner Jr., W.A., Bada, M., Palmer, M., Hunter L.E.. "A corpus of full-text journal articles is a robust evaluation tool for revealing differences in performance of biomedical natural language processing tools." BMC Bioinformatics (2012).
Warner, Jeremy, Peter Anick, Kenneth Roach, Nianwen Xue, Robin Joyce, Charles Safran and Pengyu Hong. "Towards an Annotation Schema for Cancer Trajectory State Detection." AMIA Proceedings Poster, Chicago, IL. 2012.
Xuansong Li, Stephanie Strassel, Stephen Grimes, Safa Ismael, Mohamed Maamouri, Ann Bies and Nianwen Xue. "Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures." LREC-2012, Istanbul, Turkey. 2012.
Xue, Nianwen. "Elizabeth Baran, Yaqin Yang and Nianwen Xue." Annotating dropped pronouns in Chinese newswire text, LREC-2012. 2012.
Yaqin Yang and Nianwen Xue. "Chinese comma disambiguation for discourse analysis." ACL-2012, Jeju Island, Korea. 2012.
Yuping Zhou and Nianwen Xue. "Exploring temporal vagueness via Mechanical Turk." Proceedings of LAW VI, Jeju Island, Korea. 2012.
Yuping Zhou and Nianwen Xue. "PDTB-style discourse annotation of Chinese text." ACL-2012, Jeju Island, Korea. 2012.
Zhang, Xiuhong and Xue, Nianwen. "Extending and Scaling up the Chinese Treebank Annotation." The 2nd CIPS-SIGHAN Joint Conference on Chinese Language Processing (CLP-2012), Tianjin, China. 2012.
Adam Meyers, Michiko Kosaka, Shasha Liao and Nianwen Xue. "Improving MT Word Alignment Using Aligned Multi-Stage Parses." SSST-2011, Portland, Oregon. 2011.
Elizabeth Baran and Nianwen Xue. "Singular or Plural? Exploiting Parallel Corpora for Chinese Number Prediction." Machine Translation Summit XIII, Xiamen, China. 2011.
Jeremy Warner, Peter Anick, Pengyu Hong and Nianwen Xue. "Natural Language Processing and the Oncologic History: Is There a Match?." Journal of Oncological Practice 7. 4 (2011).
Keh-Jiann Chen, Qun Liu, Nianwen Xue and Le Sun. "Introduction to the Special Issue on Chinese Language Processing." ACM Transactions on Asian Language Information Processing 10. 3 (2011).
Peter Anick, Pengyu Hong, Nianwen Xue and Yaqin Yang. "2B2 2011 Challenge: Coreference Resolution for Electronic Medical Records." The Fifth I2B2/VA Workshop on Challenges in Natural Language Processing for Clinical Data, Washington, D. C.. 2011.
Ralph Weischedel, Eduard Hovy, Mitchell Marcus, Martha Palmer, Robert Belvin, Sameer Pradan, Lance Ramshaw and Nianwen Xue. "OntoNotes: A Large Training Corpus for Enhanced Processing." Handbook of Natural Language Processing and Machine Translation. Ed. Joseph Olive, Caitlin Christianson and John McCary. Springer, 2011
Sameer Pradhan, Lance Ramshaw, Mitchell Marcus, Martha Palmer, Ralph Weischedel and Nianwen Xue. "CoNLL-2011 Shared Task: Modeling Unrestricted Coreference in OntoNotes." CoNLL-2011, Portland, Oregon. 2011.
Xue, Nianwen and Yaqin Yang. "Chinese sentence segmentation as comma disambiguation." The 49th Annual Meeting of the Association of Computational Linguistics (Short Papers), Portland, Oregon. 2011.
Xue, Nianwen. "Book review of Natural Language Processing with Python." Rev. of Natural Language Processing with Python, by Steven Bird, Ewan Klein and Edward Loper. vol. 17 of 3 2011: 419-424.
Xue, Nianwen. "The Impact of Word Segmentation on Chinese Parsing." Handbook of Natural Language Processing and Machine Translation. Ed. Joseph Olive, Caitlin Christianson and John McCary. Springer, 2011
Yaqin Yang, Nianwen Xue and Peter Anick. "A Machine Learning-Based Coreference Detection System for OntoNotes." CoNLL-2011, Portland, Oregon.. 2011.
Yuping Zhou and Nianwen Xue. "Discourse-constrained Temporal Annotation." The Fifth Linguistic Annotation Workshop (LAW V), Oregon, Portland. 2011.
Jena D. Hwang, Archna Bhatia, Claire Bonial, Aous Mansouri, Ashwini Vaidya, Nianwen Xue and Martha Palmer. "Propbank Annotation of Multilingual Light Verb Constructions." The Fourth Linguistic Annotation Workshop (LAW IV), Uppsala, Sweden. 2010.
Martha Palmer and Nianwen Xue.. "Linguistic Annotation." Handbook of Computational Linguistics and Natural Language Processing. Ed. Clark, Fox and Lappin. Blackwell, 2010
Martha Palmer, Daniel Gildea, and Nianwen Xue. Semantic Role Labeling. 1st ed. Morgan and Claypool Publishers, 2010.
Xue, Nianwen and Zhou, Yuping. "Applying syntactic, semantic and discourse constraints to Chinese temporal annotation." The 23rd International Conference on Computational Linguistics (COLING), Beijing, China. 2010.
Yang, Yaqin and Xue, Nianwen. "Chasing the ghost: recovering empty categories in the Chinese Treebank." The 23rd International Conference on Computational Linguistics (COLING), Beijing China. 2010.
Adam Meyers, Michico Kosaka, Heng Ji, Nianwen Xue, Mary Harper, Ang Sun, Wei Sun and Shasha Liao. "Transducing Logic Relations from Automatic and Manual Annotation." Proceedings of ACL-IJCNLP workshop on Linguistic Annotation, Singapore. 2009.
Adam Meyers, Michiko Kosaka, Nianwen Xue, Heng Ji, Ang Sun, Shasha Liao and Wei Xue. Automatic Recognition of Logical Relations for English, Chinese and Japanese. SEW-2009. Proc. of Semantic Evaluation Workshop. Boulder, Colorado: SEW-2009, 2009.
Jan Hajic, Massimiliano Ciaramita, Richard Johansson, Daisuke Kawahara, Maria Antonia Marti, Lluis Marquez, Adam Meyers, Joakim Nivre, Sebastian Pado, Jan Stepanek, Pavel Stranak, Mihai Surdeanu, Nianwen Xue and Yi Zhang.. The CoNLL-2009 Shared Task: Syntactic and Semantic Dependencies in Multiple languages. Proc. of The 13th Conference on Computational Natural Language Learning (CoNLL-2009). Boulder, Colorado: ACL, 2009.
Jinho D. Choi, Martha Palmer and Nianwen Xue. "Using Parallel Propbanks to enhance Word-alignments.." Proceedings of ACL-IJCNLP workshop on Linguistic Annotation, Singapore. 2009.
Xue, Nianwen and Martha Palmer. "Adding semantic roles to the Chinese Treebank." Natural Language Engineering 15. 1 (2009): 143-172.
Xue, Nianwen, Hua Zhong and Kai-Yun Chen. Annotating ``tense'' in a tense-less language. Proc. of LREC-2008. Marrakech, Morocco: LREC, 2008.
Xue, Nianwen. "Labeling Chinese predicates with semantic roles." Computational Linguistics 24. 2 (2008): 225-255.
Xue, Nianwen. Automatic inference of the temporal location of situations in Chinese text. Proc. of EMNLP-2008. Honolulu, Hawaii: EMNLP, 2008.
Vicky Lai, Meiyu Chang,Cecily Duffield, Jena D. Hwang, Nianwen Xue and Martha Palmer. Defining a Methodology for Mapping Chinese \& English Sense Inventories. Proc. of Chinese Lexical Semantics Workshop. Hong Kong, China: CLSW, 2007.
Xue, Nianwen. Tapping the implicit information in the PS to DS conversion of the Chinese Treebank. Proc. of Sixth International Workshop on Treebanks and Linguistic Theories. Bergen, Norway: TLT, 2007.
A. Meyers, A. C. Fang, L. Ferro, S. Kler, T. Jia-Lin, M. Palmer, M. Poesio, A. Dolbey, K. K. Schuler, E. Loper, H. Zinsmeister, G. Penn, N. Xue, E. Hinrichs, J. Wiebe, J. Pustejovsky, D. Farwell, E. Hajicova, B. Dorr, E. Hovy, B. A. Onyshkevych and. Annotation Compatibility Working Group Report. Proc. of ACL 2006 Workshop: Frontiers in Linguistically Annotated Corpora. Sydney, Australia: ACL, 2006.
Nianwen Xue, Jinying Chen and Martha Palmer. Aligning features with sense distinction dimensions. Proc. of Proceedings of ACL/COLING, Poster Session. Sydney, Australia: ACL, 2006.
Xue, Nianwen. "A Chinese lexicon of roles and senses." Journal of Language Resources 40. 1-2 (2006): 395-403.
Xue, Nianwen. Annotating the predicate-argument structure of Chinese nominalizations. Proc. of LREC-2006. Genoa, Italy: LREC, 2006.
Xue, Nianwen. Semantic Role Labeling of nominalized predicates. Proc. of HLT-NAACL. New York City: ACL, 2006.
Martha Palmer, Nianwen Xue, Olga B Babko-Malaya, Jinying Chen and Benjamin Snyder.. A Parallel Proposition Bank II for Chinese and English. Proc. of the ACL Workshop in Frontiers in Annotation II: Pie in the Sky. Ann Arbor, Michigan: ACL, 2005.
Xue, Nianwen and Martha Palmer. Automatic Semantic Role Labeling for Chinese Verbs. Proc. of IJCAI'05. Edinburgh, Scotland: IJCAI, 2005.
Xue, Nianwen, Fei Xia, Fu-Dong Chiou and Martha Palmer. "The Penn Chinese Treebank: Phrase Structure Annotation of a Large Corpus." Natural Language Engineering 11. 2 (2005): 207-238.
Xue, Nianwen. Annotating discourse connectives in the Chinese Treebank. Proc. of The ACL Workshop in Frontiers in Annotation II: Pie in the Sky. Ann Arbor, Michigan: ACL, 2005.
Jinying Chen, Nianwen Xue and Martha Palmer. Using a Smoothing Maximum Entropy Model for Chinese Nominal Entity Tagging. Proc. of the International Joint Conference on Natural Language Processing. Hainan Island, China: IJCNLP, 2004.
Olga Babko-Malaya, Martha Palmer, Nianwen Xue, Aravind Joshi and Seth Kulick. Proposition Bank II: Delving Deeper. Proc. of the Frontiers in Corpus Annotation Workshop. Boston, MA: NAACL, 2004.
Xue, Nianwen and Martha Palmer. Propbanking in Parallel. Proc. of the Workshop on "the Amazing Utility of Parallel and Comparable Corpora, in conjunction with LREC'04. Lisbon, Portugal: LREC, 2004.
Xue, Nianwen. Handling Dislocated and Discontinuous Constituents in Chinese Semantic Role Labeling. Proc. of the 4th Workshop on Asian Language Resources. Hainan Island, China: ALR, 2004.
Xue, Nianwen and Libin Shen. Word Chinese Segmentation as LMR Tagging. Proc. of the 2nd SIGHAN Workshop on Chinese Language Processing. Sapporo, Japan: SIGHAN, 2003.
Xue, Nianwen and Martha Palmer. Annotating Propositions in the Penn Chinese Treebank. Proc. of the 2nd SIGHAN Workshop on Chinese Language Processing. Sapporo, Japan: SIGHAN, 2003.
Xue, Nianwen and Seth Kulick. Automatic Predicate Argument Structure Analysis of the Penn Chinese Treebank. Proc. of Machine Translation Summit IX. New Orleans, Louisiana: AMTA, 2003.
Xue, Nianwen. "Chinese Word Segmentation as Character Tagging." International Journal of Computational Linguistics and Chinese Language Processing 8. 1 (2003): 29-48.
Nianwen Xue, Fu-Dong Chiou and Martha Palmer. Building a Large-Scale Annotated Chinese Corpus. Proc. of the 19th. International Conference on Computational Linguistics. Taipei, Taiwan: COLING, 2002.
Xue, Nianwen and Susan Converse. Combining Classifiers for Chinese Word Segmentation. Proc. of the 1st SIGHAN Workshop on Chinese Language Processing. Taipei, Taiwan: SIGHAN, 2002.
Fei Xia, Martha Palmer, Nianwen Xue, Mary Ellen Okurowski, John Kovarik, Fu-Dong Chiou, Shizhe Huang, Tony Kroch, and Mitch Marcus. Designing guidelines and ensuring consistency for Chinese text annotation. Proc. of the Second International Conference on Language Resources and Evaluation. Athens, Greece: LREC, 2001.
Xue, Nianwen and Fei Xia. The Bracketing Guidelines for the Chinese Treebank. IRCS technical report, University of Pennsylvania. Philadelphia: 2001.