Junjie Hu | 胡俊杰
Assistant Professor
Biostatistics & Medical Informatics
Computer Science
Data Science Institute
University of Wisconsin-Madison
Office: 4735 MSC, 420 North Charter Street, Madison, WI
Office phone: +1-6082656118
Email: junjie.hu@wisc.edu
[Research Statement]
About
I am an assistant professor with appointments in the Department of Biostatistics, Department of Computer Science and Data Science Institute at the University of Wisconsin-Madison. I obtained my Ph.D. from School of Computer Science at Carnegie Mellon University, where I worked with Jaime Carbonell and Graham Neubig. I have a broad interest in natural language processing and machine learning. My research goal is to build robust intelligent systems that evolve with changes in the environment and interact with people speaking different languages. In particular, my research focuses on algorithmic design and fundamental understanding of machine learning models in NLP that enable safe deployment in the wild. Most recently, I’m fascinated by understanding behaviors of large language models, adapting them effectively to knowledge-intensive reasoning tasks, and aligning them safely with users from diverse backgrounds. Specific topics of interest include the following aspects of large language models:
Prospective students (Updated on Oct 16 2024): Thanks for your interest! I may not be able to reply to all inqueries due to the large amounts of emails. If you still want to bring my attention to your papers by email, please add “[prospective student to Hulab]” in the email subject. I’ll update hiring information on my website. I am looking for 1 or 2 excellent PhD students to join our lab in the fall of 2025. Please apply to the CS or BDS program, and mention my name in your application and research statement. UW-Madison is an excellent place for research, and Madison is a wonderful city to live in. Please check out these videos (Why UW-Madison, Madison). I’m also happy to work with masters or undergraduate students at UW-Madison. If you are interested, please send me an email with your CV. |
Research Group
I am really fortunate to work with a group of excellent students at UW-Madison. Stay tuned for our latest works! Graduate Students
|
Recent Preprints
2024
- arXiv
- arXiv
- arXivPyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling arXiv preprint arXiv:2406.02069 2024
- arXiv
- arXivPrompting Large Vision-Language Models for Compositional Reasoning arXiv preprint arXiv:2401.11337 2024
Publications
2024
- CSCWMetaWriter: Exploring the Potential and Perils of AI Writing Support in Scientific Peer Review In Proceedings of The 26th ACM Conference on Computer-Supported Cooperative Work and Social Computing 2024
- EMNLPBenchmarking Machine Translation with Cultural Awareness In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP Findings) 2024
- EMNLPBeyond Demographics: Aligning Role-playing LLM-based Agents Using Human Belief Networks In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP Findings) 2024
- COLMHow Well Do LLMs Identify Cultural Unity in Diversity? In The first Conference of Language Modeling 2024
- ACLOLIVE: Object Level In-Context Visual Embeddings In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics 2024
- ACLData Augmentation using LLMs: Data Perspectives, Learning Paradigms and Challenges In Findings of the 62nd Annual Meeting of the Association for Computational Linguistics 2024
- ICMLChatbot Meets Pipeline: Augment Large Language Model with Definite Finite Automaton In International Conference on Machine Learning (ICML) 2024
- CogSciEvaluating LLM Agent Group Dynamics against Human Group Dynamics: A Case Study on Wisdom of Partisan Crowds In The Annual Conference of the Cognitive Science Society (CogSci). 2024
- NAACLSimulating Opinion Dynamics with Networks of LLM-based Agents In Findings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics 2024
- NAACLHow does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics 2024
- CVPRLookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation In The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024
- EACLLearning Label Hierarchy with Supervised Contrastive Learning In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics 2024
- NeurIPSMitigating Fine-tuning based Jailbreak Attack with Backdoor Enhanced Safety Alignment In Proceedings of the Thirty-eighth Annual Conference on Neural Information Processing Systems (NeurIPS) 2024
2023
- ACLSingle Sequence Prediction over Reasoning Graphs for Multi-hop QA In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics 2023
- ACLIs Fine-tuning Needed? Pre-trained Language Models Are Near Perfect for Out-of-Domain Detection In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics 2023
- ACLLocal Byte Fusion for Neural Machine Translation In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics 2023
- ACLMultimodal Prompt Retrieval for Generative Visual Question Answering In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL Findings) 2023
- WSEvolving Domain Adaptation of Pretrained Language Models for Text Classification In NeurIPS Workshop on Distribution Shifts, 37th Conference on Neural Information Processing Systems. 2023
2022
- EMNLPBeyond Counting Datasets: Investigating Multilingual Dataset Construction and Necessary Resources In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP Findings) 2022
- EMNLPUtilizing Language-Image Pretraining for Efficient and Robust Bilingual Word Alignment In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP Findings) 2022
- IEEE TPAMIVideo Pivoting Unsupervised Multi-modal Neural Machine Translation IEEE transactions on pattern analysis and machine intelligence (To Appear) 2022
- ACLDEEP: DEnoising Entity Pre-training for Neural Machine Translation In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 2022
- ACLGlobalWoZ: Globalizing MultiWoZ to Develop Multilingual Task-Oriented Dialogue Systems In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics 2022
2021
- WMTPhrase-level Active Learning for Neural Machine Translation In The Sixth Conference on Machine Translation (WMT) 2021 [Abs] [Code]
- EMNLPAfroMT: Pretraining Strategies and Reproducible Benchmarks for Translation of 8 African Languages In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP) 2021 [Abs] [Code]
- EMNLPXTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP) 2021 [Abs] [Code]
- NAACLExplicit Alignment Objectives for Multilingual Bidirectional Encoders In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics 2021 [Abs] [Code]
- NAACLMultilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics 2021 [Abs] [Code]
2020
- ICMLXTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalisation In International Conference on Machine Learning (ICML) 2020 [Abs] [Code]
- ICMLOn Learning Language-Invariant Representations for Universal Machine Translation In International Conference on Machine Learning (ICML) 2020 [Abs]
- ACLUnsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 2020 [Abs]
- WorkshopTICO-19: the Translation Initiative for COvid-19 In Proceedings of the 1st Workshop on NLP for COVID-19 (Part 2) at EMNLP 2020 [Abs]
- AAAIWhat Makes A Good Story? Designing Composite Rewards for Visual Storytelling In Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI) 2020 [Code]
2019
- ACLDomain Adaptation of Neural Machine Translation by Lexicon Induction In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics 2019 [Abs] [Code]
- CIKMA hybrid retrieval-generation neural conversation model In Proceedings of the 28th ACM International Conference on Information and Knowledge Management 2019 [Code]
- EMNLPREO-Relevance, Extraness, Omission: A Fine-grained Evaluation for Image Captioning In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) 2019 [Abs]
- EMNLPHandling Syntactic Divergence in Low-resource Machine Translation In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) 2019 [Abs]
- EMNLPUnsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) 2019 [Abs]
- WNGTDomain Differential Adaptation for Neural Machine Translation In Proceedings of the 3rd Workshop on Neural Generation and Translation 2019 [Abs]
- NAACLcompare-mt: A Tool for Holistic Comparison of Language Generation Systems In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics (Demonstrations) 2019 [Abs] [Code] [Best Demon Nomination]
2018
- EMNLPRapid Adaptation of Neural Machine Translation to New Languages In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing 2018 [Abs] [Code]
- ACLAutomatic Estimation of Simultaneous Interpreter Performance In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) 2018 [Abs]
- WMTContextual Encoding for Translation Quality Estimation In Proceedings of the Third Conference on Machine Translation: Shared Task Papers 2018 [Abs] [Code]
2017
- EMNLPStructural Embedding of Syntactic Trees for Machine Comprehension In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing 2017 [Abs]
- ACLSemi-Supervised QA with Generative Domain-Adaptive Nets In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2017 [Abs]
- AAAIAnswer-aware attention on grounded question answering in images In AAAI 2017 Fall Symposium on Natural Communication for Human-Robot Collaboration 2017
- IEEE TNNLSOnline nonlinear AUC maximization for imbalanced data sets IEEE transactions on neural networks and learning systems 2017 [Abs]
2016
- HCOMPLearning Lexical Entries for Robotic Commands via Paraphrasing In AAAI conference on Human Computation 2016 [Abs]
- ICLRWords or Characters? Fine-grained Gating for Reading Comprehension In International Conference on Learning Representations 2016 [Abs]
2015
- IEEE Cybern.Diversified Sensitivity-Based Undersampling for Imbalance Classification Problems IEEE Transactions on Cybernetics 2015 [Abs]
- AAAIKernelized Online Imbalanced Learning with Fixed Budgets In Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI) 2015 [Abs]
- SOSEAr-tracker: Track the dynamics of mobile apps via user review mining In 2015 IEEE Symposium on Service-Oriented System Engineering 2015 [Abs]
Teaching
Talks
Invited Talk at University of Cambridge, LTL Seminar, June 09, 2022.
Invited Talk at Lingustics Fridays Seminar at UW-Madison, April 01, 2022.
Invited Talk at Microsoft Azure Cognitive Services Research, January 20, 2022.
Invited Talk at Bay Area NLP Seminar, November 18, 2021.
Invited Talk at ICTR Seminar at UW-Madison, October 26, 2021.
Invited Talk at Microsoft Research Summit, October 21, 2021.
Invited Talk at CIBM Seminar at UW-Madison, October 19, 2021.
Invited Talk at IFDS Ideas Forum at UW-Madison, October 11, 2021.
XTREME: A Massively Multilingual Multi-task Benchmarkfor Evaluating Cross-lingual Generalization, Junjie Hu, LTI Summer Seminar Series at Carnegie Mellon University, Pittsburgh, July 2, 2020.
Pre-training of Multilingual Encoder for Crosslingual Transfer, Junjie Hu, Google Translate Team, Mountain View, August 20 2019.
Cross-Lingual and Cross Domain Transfer for Neural Machine Translation, Junjie Hu, AI Seminar at Carnegie Mellon University, Pittsburgh April 30 2019.
Transfer Learning for Multilingual Neural Machine Translation, Junjie Hu, SMART-Select Workshop on Multilingual Models and Unsupervised NMT supported by DG Connect of the European Commission, Luxembourg, June 20 2019. Facebook AI Research Lab, Paris, June 21 2019.
Rethinking Visual Storytelling: What Makes A Good Story? Junjie Hu, Microsoft 365 AI Research, Redmond, August 23 2018.
Machine Reading Comprehension via Structural Tree Embeddings, Junjie Hu, Seminar at Chinese University of Hong Kong, March 5 2018.
Lorelei: Understanding Low Resource Languages, Pat Littell, Junjie Hu, Shruti Rijhwani, and Ruochen Xu. LTI Colloquium at Carnegie Mellon University, Pittsburgh, September 8, 2017.
Natural Communication for Human-Robot Collaboration, Junjie Hu, Symposium on Natural Communication for Human-Robot Collaboration, November 9, 2017.
Selected Awards and Scholarships
CMU Graduate Student Assembly Dissertation Writing Group Grant, 2020
CMU Graduate Student Assembly Conference Travel Grant, 2020
NAACL 2019 Best Demonstration Paper Nomination, 2019
Graduate Research Scholarship, Carnegie Mellon University, 2015-2021
Postgraduate Scholarship, The Chinese University of Hong Kong, 2013-2015
Certificate of Merit for Teaching Assistantship, Department of CSE, Chinese University of Hong Kong, 2013-2014
IBM Outstanding Student Scholarship (1 of 77 winners in China), 2012-2013
Outstanding Undergraduate Awards by China Computer Federation (99 winners), 2012-2013
National Scholarship, the Ministry of Education, 2010-2011, 2011-2012