publications

Publications (30+): First Author (10+), Corresponding Author(10+). CCF-A (20+)

2026

  1. RealSec-bench: A Benchmark for Evaluating Secure Code Generation in Real-World Repositories
    Yanlin Wang, Ziyao Zhang, Chong Wang, and 5 more authors
    arXiv preprint arXiv:2601.22706, 2026
  2. TSE
    Are decoder-only large language models the silver bullet for code search?
    Yuxuan Chen, Mingwei Liu, Guangsheng Ou, and 4 more authors
    IEEE Transactions on Software Engineering, 2026
  3. Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey
    Caihua Li, Lianghong Guo, Yanlin Wang, and 8 more authors
    arXiv preprint arXiv:2601.11655, 2026
  4. ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation
    Sicong Liu, Yanxian Huang, Mingwei Liu, and 6 more authors
    arXiv preprint arXiv:2601.09703, 2026

2025

  1. ASE 2025
    DrainCode: Stealthy Energy Consumption Attacks on Retrieval-Augmented Code Generation via Context Poisoning
    Yanlin Wang, Jiadong Wu, Tianyue Jiang, and 7 more authors
    In 2025 40th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2025
  2. SimpleDevQA: Benchmarking Large Language Models on Development Knowledge QA
    Jing Zhang, Lianghong Guo, Yanlin Wang, and 7 more authors
    arXiv preprint arXiv:2512.08867, 2025
  3. Framework-Aware Code Generation with API Knowledge Graph-Constructed Data: A Study on HarmonyOS
    Mingwei Liu, Zheng Pei, Yanlin Wang, and 5 more authors
    arXiv preprint arXiv:2512.00380, 2025
  4. Knowledge Matters: Injecting Project and Testing Knowledge into LLM-based Unit Test Generation
    Anji Li, Mingwei Liu, Zhenxi Chen, and 5 more authors
    arXiv preprint arXiv:2511.14224, 2025
  5. Uncovering Pretraining Code in LLMs: A Syntax-Aware Attribution Approach
    Yuanheng Li, Zhuoyang Chen, Xiaoyun Liu, and 5 more authors
    arXiv preprint arXiv:2511.07033, 2025
  6. TSE
    Cosqa+: Enhancing code search evaluation with a multi-choice benchmark and test-driven agents
    Jing Gong, Yanghui Wu, Linxi Liang, and 4 more authors
    IEEE Transactions on Software Engineering, 2025
  7. EffiReasonTrans: RL-Optimized Reasoning for Code Translation
    Yanlin Wang, Rongyi Ou, Yanli Wang, and 6 more authors
    arXiv preprint arXiv:2510.18863, 2025
  8. Generating High-Quality Datasets for Code Editing via Open-Source Language Models
    Zekai Zhang, Mingwei Liu, Zhenxi Chen, and 7 more authors
    arXiv preprint arXiv:2509.25203, 2025
  9. EvolMathEval: Towards Evolvable Benchmarks for Mathematical Reasoning via Evolutionary Testing
    Shengbo Wang, Mingwei Liu, Zike Li, and 4 more authors
    arXiv preprint arXiv:2508.13003, 2025
  10. A hierarchical and evolvable benchmark for fine-grained code instruction following with multi-turn feedback
    Guoliang Duan, Mingwei Liu, Yanlin Wang, and 3 more authors
    arXiv preprint arXiv:2507.00699, 2025
  11. FSE 2025
    Beyond functional correctness: Investigating coding style inconsistencies in large language models
    Yanlin Wang, Tianyue Jiang, Mingwei Liu, and 5 more authors
    Proceedings of the ACM on Software Engineering, 2025
  12. Adadec: Uncertainty-guided adaptive decoding for llm-based code generation
    Kaifeng He, Mingwei Liu, Chong Wang, and 4 more authors
    arXiv e-prints, 2025
  13. Code Copycat Conundrum: Demystifying Repetition in LLM-based Code Generation
    Mingwei Liu, Juntao Li, Ying Wang, and 8 more authors
    arXiv preprint arXiv:2504.12608, 2025
  14. Feedbackeval: A benchmark for evaluating large language models in feedback-driven code repair tasks
    Dekun Dai, MingWei Liu, Anji Li, and 5 more authors
    arXiv preprint arXiv:2504.06939, 2025
  15. Enhancing the robustness of llm-generated code: Empirical study and framework
    Zike Li, Mingwei Liu, Anji Li, and 4 more authors
    arXiv e-prints, 2025
  16. What to retrieve for effective retrieval-augmented code generation? an empirical study and beyond
    Wenchao Gu, Juntao Chen, Yanlin Wang, and 6 more authors
    arXiv preprint arXiv:2503.20589, 2025
  17. Enhancing llm-based code translation in repository context via triple knowledge-augmented
    Guangsheng Ou, Mingwei Liu, Yuxuan Chen, and 5 more authors
    arXiv e-prints, 2025
  18. RustEvo\^ 2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation
    Linxi Liang, Jing Gong, Mingwei Liu, and 5 more authors
    arXiv preprint arXiv:2503.16922, 2025
  19. How should we build a benchmark? revisiting 274 code-related benchmarks for llms
    Jialun Cao, Yuk-Kit Chan, Zixuan Ling, and 8 more authors
    arXiv preprint arXiv:2501.10711, 2025
  20. ASE 2025
    RustRepoTrans: Repository-level Context Code Translation Benchmark Targeting Rust
    Guangsheng Ou, Mingwei Liu, Yuxuan Chen, and 3 more authors
    In 2025 40th IEEE/ACM International Conference on Automated Software Engineering (ASE), 2025

2024

  1. Evaluating Large Language Models in Class-Level Code Generation
    Xueying Du, Mingwei Liu, Kaixin Wang, and 7 more authors
    In Proceedings of the 46th IEEE/ACM International Conference on Software Engineering, ICSE 2024, Lisbon , Portugal, April 14 - 20, 2024, 2024
  2. Evaluating and Improving ChatGPT for Unit Test Generation
    Zhiqiang Yuan, Mingwei Liu, Shiji Ding, and 4 more authors
    In Proceedings of the 32st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2024, to appear, July 2024, Brazil, Brazil, 2024
  3. JoS
    Revisiting The Retrieval-Augmentation Strategy In Code Completion
    Baihan Zou, Ying Wang, Xin Peng, and 5 more authors
    Journal of Software, 2024
  4. STALL+: Boosting LLM-based Repository-level Code Completion with Static Analysis
    Junwei Liu, Yixuan Chen, Mingwei Liu, and 2 more authors
    CoRR, 2024
  5. Beyond Functional Correctness: Investigating Coding Style Inconsistencies in Large Language Models
    Yanlin Wang, Tianyue Jiang, Mingwei Liu, and 2 more authors
    arXiv preprint arXiv:2407.00456, 2024

2023

  1. XCoS: Explainable Code Search Based on Query Scoping and Knowledge Graph
    Chong Wang, Xin Peng, Zhenchang Xing, and 4 more authors
    ACM Trans. Softw. Eng. Methodol., 2023
  2. TSE
    Task-Oriented ML/DL Library Recommendation Based on a Knowledge Graph
    Mingwei Liu, Chengyuan Zhao, Xin Peng, and 3 more authors
    IEEE Trans. Software Eng., 2023
  3. ICSME 2023
    Knowledge Graph based Explainable Question Retrieval for Programming Tasks
    Mingwei Liu, Simin Yu, Xin Peng, and 4 more authors
    In IEEE International Conference on Software Maintenance and Evolution, ICSME 2023, Bogotá, Colombia, October 1-6, 2023, 2023
  4. ASE 2023
    CodeGen4Libs: A Two-Stage Approach for Library-Oriented Code Generation
    Mingwei Liu, Tianyong Yang, Yiling Lou, and 3 more authors
    In 38th IEEE/ACM International Conference on Automated Software Engineering, ASE 2023, Luxembourg, September 11-15, 2023, 2023
  5. KG4CraSolver: Recommending Crash Solutions via Knowledge Graph
    Xueying Du, Yiling Lou, Mingwei Liu, and 2 more authors
    In Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2023, San Francisco, CA, USA, December 3-9, 2023, 2023
  6. Recommending Analogical APIs via Knowledge Graph Embedding
    Mingwei Liu, Yanjun Yang, Yiling Lou, and 4 more authors
    In Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2023, San Francisco, CA, USA, December 3-9, 2023, 2023
  7. No More Manual Tests? Evaluating and Improving ChatGPT for Unit Test Generation
    Zhiqiang Yuan, Yiling Lou, Mingwei Liu, and 4 more authors
    CoRR, 2023
  8. Evaluating Instruction-Tuned Large Language Models on Code Comprehension and Generation
    Zhiqiang Yuan, Junwei Liu, Qiancheng Zi, and 3 more authors
    CoRR, 2023
  9. Resolving Crash Bugs via Large Language Models: An Empirical Study
    Xueying Du, Mingwei Liu, Juntao Li, and 3 more authors
    CoRR, 2023
  10. JoS
    Survey on Representation Learning Methods of Knowledge Graph for Link Prediction
    Xueying Du, Mingwei Liu, Liwei Shen, and 1 more author
    Journal of Software, 2023

2022

  1. TSE
    API-Related Developer Information Needs in Stack Overflow
    Mingwei Liu, Xin Peng, Andrian Marcus, and 3 more authors
    IEEE Trans. Software Eng., 2022
  2. How to formulate specific how-to questions in software development?
    Mingwei Liu, Xin Peng, Andrian Marcus, and 4 more authors
    In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/FSE 2022, Singapore, Singapore, November 14-18, 2022, 2022

2021

  1. Learning-based extraction of first-order logic representations of API directives
    Mingwei Liu, Xin Peng, Andrian Marcus, and 5 more authors
    In ESEC/FSE ’21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Athens, Greece, August 23-28, 2021, 2021
  2. JoS
    Automatic Code Semantic Tag Generation Approach Based on Software Knowledge Graph
    Shuangshuang Xing, Mingwei Liu, and Xin Peng
    Journal of Software, 2021

2020

  1. ICSME 2020
    Source Code based On-demand Class Documentation Generation
    Mingwei Liu, Xin Peng, Xiujie Meng, and 5 more authors
    In IEEE International Conference on Software Maintenance and Evolution, ICSME 2020, Adelaide, Australia, September 28 - October 2, 2020, 2020
  2. ICSME 2020
    Learning based and Context Aware Non-Informative Comment Detection
    Mingwei Liu, Yanjun Yang, Xin Peng, and 4 more authors
    In IEEE International Conference on Software Maintenance and Evolution, ICSME 2020, Adelaide, Australia, September 28 - October 2, 2020, 2020
  3. ASE 2020
    Generating Concept based API Element Comparison Using a Knowledge Graph
    Yang Liu, Mingwei Liu, Xin Peng, and 3 more authors
    In 35th IEEE/ACM International Conference on Automated Software Engineering, ASE 2020, Melbourne, Australia, September 21-25, 2020, 2020
  4. ESEC/FSE 2020
    API method recommendation via explicit matching of functionality verb phrases
    Wenkai Xie, Xin Peng, Mingwei Liu, and 4 more authors
    In ESEC/FSE ’20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Virtual Event, USA, November 8-13, 2020, 2020

2019

  1. A learning-based approach for automatic construction of domain glossary from source code and documentation
    Chong Wang, Xin Peng, Mingwei Liu, and 4 more authors
    In Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2019, Tallinn, Estonia, August 26-30, 2019, 2019
  2. Generating query-specific class API summaries
    Mingwei Liu, Xin Peng, Andrian Marcus, and 4 more authors
    In Proceedings of the ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, ESEC/SIGSOFT FSE 2019, Tallinn, Estonia, August 26-30, 2019, 2019

2018

  1. ICSME 2018
    Automatic Generation of API Documentations for Open-Source Projects
    Xin Peng, Yifan Zhao, Mingwei Liu, and 4 more authors
    In IEEE Third International Workshop on Dynamic Software Documentation, DySDoc@ICSME 2018, Madrid, Spain, September 25, 2018, 2018
  2. ICSME 2018
    Improving API Caveats Accessibility by Mining API Caveats Knowledge Graph
    Hongwei Li, Sirui Li, Jiamou Sun, and 4 more authors
    In 2018 IEEE International Conference on Software Maintenance and Evolution, ICSME 2018, Madrid, Spain, September 23-29, 2018, 2018
  3. Internetware 2018
    Searching StackOverflow Questions with Multi-Faceted Categorization
    Mingwei Liu, Xin Peng, Qingtao Jiang, and 3 more authors
    In Proceedings of the Tenth Asia-Pacific Symposium on Internetware, Internetware 2018, Beijing, China, September 16-16, 2018, 2018

  1. Exploring Large Language Models in Resolving Environment-Related Crash Bugs: Localizing and Repairing
    Xueying Du, Mingwei Liu, Hanlin Wang, and 3 more authors
    ACM Transactions on Software Engineering and Methodology