Qiming Bao

Education

University of Auckland (UoA)Auckland, New Zealand
Ph.D. of Computer Science supervised by Prof. Michael Witbrock and Assoc Prof. Jiamou Liu 02/20 – 12/24

DAAD AINeT fellow 2025 on Natural Language Processing
PhD Research Project Scholarship/Outstanding PhD Mentor
Graduate Teaching Assistant (Tutor)/Research Assistant (Professional Casual Staff)

University of Auckland (UoA)Auckland, New Zealand
Bachelor of Science (Honours) in Computer Science (First Class), GPA: 7/9 07/18 – 09/19

Precision Driven Health & Orion Health Summer Research Scholarship

China Jiliang University (CJLU)Hangzhou, China
Bachelor of Engineering, GPA: 4.11/5 09/14 – 07/18

Outstanding Graduated Student of Zhejiang Province
The Honorable Mention of American Mathematics Modelling Contest

Paper List

Qiming Bao, Juho Leinonen, Alex Yuxuan Peng, Wanjun Zhong, Tim Pistotti, Alice Huang, Paul Denny, Michael Witbrock, Jiamou Liu. Exploring Iterative Enhancement for Improving Learnersourced Multiple-Choice Question Explanations with Large Language Models, Proceedings of the AAAI Conference on Artificial Intelligence (2025). https://ojs.aaai.org/index.php/AAAI/article/view/35164

Qiming Bao, Alex Peng, Zhenyun Deng, Wanjun Zhong, Gaël Gendron, Neşet Tan, Nathan Young, Yang Chen, Yonghua Zhu, Michael Witbrock, Jiamou Liu. Abstract Meaning Representation-Based Logic-Driven Data Augmentation for Logical Reasoning., The Findings of ACL (2024). https://doi.org/10.18653/v1/2024.findings-acl.353

Qiming Bao, Gaël Gendron, Alex Peng, Neset Tan, Michael Witbrock, Jiamou Liu. Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical Reasoning., ICONIP (2024). https://doi.org/10.48550/arXiv.2310.09430

Qiming Bao, Alex Peng, Tim Hartill, Neset Tan, Zhenyun Deng, Michael Witbrock, Jiamou Liu. Multi-Step Deductive Reasoning Over Natural Language: An Empirical Study on Out-of-Distribution Generalisation, IJCLR-NeSy (2022). https://ceur-ws.org/Vol-3212/paper15.pdf

Nathan Young, Qiming Bao, Joshua Ljudo Bensemann, Michael J. Witbrock. AbductionRules: Training Transformers to Explain Unexpected Inputs, The Findings of ACL (2022). https://doi.org/10.18653/v1/2022.findings-acl.19

Gaël Gendron, Qiming Bao, Michael Witbrock, Gillian Dobbie. Large Language Models Are Not Strong Abstract Reasoners, IJCAI (2024). https://www.ijcai.org/proceedings/2024/693

Lin Ni, Qiming Bao, Xiaoxuan Li, Qianqian Qi, Paul Denny, Jim Warren, Michael Witbrock, Jiamou Liu. DeepQR: Neural-based Quality Ratings for Learnersourced Multiple-Choice Questions, Proceedings of the AAAI Conference on Artificial Intelligence (2022). https://doi.org/10.1609/aaai.v36i11.21562

Qianqian Qi, Qiming Bao*, Alex Yuxuan Peng, Jiamou Liu, Michael Witbrock. A Dynamic Prompt-tuning Method for Data Augmentation with Associated Knowledge, ICLR TinyPapers (2023). https://openreview.net/pdf?id=hli7A0ioiS_

Qiming Bao, Lin Ni, Jiamou Liu. HHH: An Online Medical Chatbot System based on Knowledge Graph and Hierarchical Bi-Directional Attention, ACSW (2020). https://doi.org/10.1145/3373017.3373049

Zhongsheng Wang, Jiamou Liu, Qiming Bao, Hongfei Rong, Jingfeng Zhang. ChatLogic: Integrating Logic Programming with Large Language Models for Multi-step Reasoning, NucLeaR@AAAI (2024). https://doi.org/10.48550/arXiv.2407.10162

Neset TAN, Trung Nguyen, Josh Bensemann, Alex Peng, Qiming Bao, Yang Chen, Mark Gahegan, Michael Witbrock. Multi2Claim: Generating Scientific Claims from Multi-Choice Questions for Scientific Fact-Checking, EACL (2023). https://doi.org/10.18653/v1/2023.eacl-main.194

Neset TAN, Alex Peng, Joshua Bensemann, Qiming Bao, Tim Hartill, Mark Gahegan, Michael Witbrock. Input-length-shortening and text generation via attention values, AAAI-EMC^2 (2023). https://doi.org/10.48550/arXiv.2303.07585

Work & Project Experience

Large Language Model and Logical Reasoning (Ph.D. Main Topic) UoA, Auckland, New Zealand
Research & Development Project Leader/Developer 02/20 – 12/24

Recipient of research funding for the project Strong AI Lab (Grant No. 5000675), awarded by the Tertiary Education Commission under the Entrepreneurial Research Funding program, with a total grant amount of NZD 9.6 million. Qiming Bao was primarily responsible for the logical reasoning research direction within this project.
We have developed an iterative enhancement framework based on LLM for generating explanations. The framework iteratively interacts between an explanation generation module ad an explanation evaluation module to enhance the quality of the generated explanations. Our paper has been accepted by AAAI Proceedings (2025) and AGI@ICLR (2024). paper and source code.
Our method "AMR-LDA" (GPT-4 + AMR-LDA Prompt Augmentation) achieved #1 on the ReClor leaderboard. We are the first group scored above 90% on the hidden test set around the world. Our paper has been accepted by the Findings of ACL-24 and LLM@IJCAI'23 respectively. paper, source code and model weights.
We evaluated generative and discriminative large language models on out-of-distribution logical reasoning tasks. While they excel in standard tasks, minor changes lead to notable performance drops, indicating insufficient reasoning capabilities. Our paper has been accepted by LLM@IJCAI'23. paper and source code.
To address depth imbalance in multi-step reasoning datasets and enhance model performance, we created the IMA-GloVe-GA model, combining DeepLogic with Gate Attention. Additionally, we developed a larger dataset, PARARULE-Plus, for deep multi-step reasoning over natural language. We published the paper, code and data and presentation recording on IJCLR-NeSy-22.
We built up a dataset called AbductionRules to increase the Transformer's performance on the tasks requiring abduction reasoning. We published the paper, code and data on the Findings of ACL-22.
PARARULE Plus (Multi-step deductive reasoning) and AbductionRules (Abductive reasoning) datasets are collected and merged as part of LogiTorch.ai, ReasoningNLP, Prompt4ReasoningPapers, OpenAI/Evals, A Survey on Evaluation of Large Language Models and Reasoning Language Models: A Blueprint.

Enhancing Max Sequence Length in Large Multimodal Models Xtracta (Accredited Employer), Auckland, New Zealand
Artificial Intelligence Researcher/Engineer 07/22 – now

Investigated and implemented alternative attention mechanisms to extend the effective sequence length in multi-modal document processing models such as LayoutLMv3 and ERNIE-LayoutX.
By applied the sliding window technique and a global attention mask from Longformer to extend the maximum sequence length from 512 to 4096, which model among LayoutLMv3 and ERNIE-LayoutX achieves a higher F1 score on the XFUND, FUNSD and other company internal datasets without significantly increasing GPU memory usage.
Replicated the multi-task, multimodal pre-training code for LayoutLMv3, which Microsoft did not open source, including masked language modeling, masked image modeling, and word-patch alignment.
Integrated DeepSpeed and adapters into ERNIE-LayoutX and LayoutLMv3, which can reduce training costs, result in a smaller model size, and make it easier to deploy in the production environment.
Successfully applied for the Research & Development Tax Incentive (RDTI) grants from Callaghan Innovation (New Zealand's Innovation Agency) for both 2022 and 2023, each offering a tax credit equal to 15% of eligible R&D expenditure. This credit can be utilised to reduce the income tax payable by the company.
Integrated Flash-Attention 2 into Self-Attention can help ERNIE-LayoutX reduce maximum training GPU memory usage by up to 50% under FP16.
Applied affine transformations for data augmentation to train the model and improve the robustness of line alignment issues for document extraction.
By using the PEFT adapter, Flash-Attention 2 and GPTQ int4 quantization to continually train the Qwen2-VL-7B and make Qwen2-VL-7B training on a single A4090 GPU (within 24GB GPU memory).
Adding page embeddings to vision-language models (Qwen2.5-VL and ERNIE-LayoutX) can improve their performance on fields that frequently appear on each page of a multi-page document (more than 15%), such as supplier names or bank names.

Abstract Extraction and Multi-Turn Dialogue System Advanced Institute of Information Technology, Peking University, Hangzhou, China
Research and Development Engineer 11/19 – 02/20

We developed and researched a robot-based system including automatic abstract extraction, text segmentation, theme prediction, and multi-turn question answering.
Investigation and standard documentation of robot-related technologies.
We built a well-encapsulated API to implement meeting record document processing based on the abstract extraction, text segmentation, and theme prediction.

HHH: An Online Medical Chatbot System Precision Driven Health & Orion Health, Auckland, New Zealand
Research Project Leader and Developer 11/18 – 04/19

We developed a medical text similarity algorithm called HBAM using Pre-trained Language Model and Knowledge Graph.
Compared with BERT and MaLSTM models, HBAM performs higher test accuracy than the two Deep Learning models respectively code (#star: 90+), news, recording and published paper (#citation: 70+) on ACSW-20.

Invited Speaker/Visiting Scholar

Microsoft Research Asia Invited Talk 2022 (Invitation Letter) (Presentation Slide) (Recording)

Samsung AI Center Cambridge UK Invited Talk 2022 (Invitation Letter) (Presentation Slide) (Recording)

IEEE Vehicular Technology Society (VTS) New Zealand North Chapter and IEEE New Zealand North Section SIGHT Group 2022 (Invitation Letter) (Presentation Slide) (Recording)

ZJU-NLP Group, Zhejiang University 2023

NLP Group, The University of Melbourne Invited Talk 2023 (Invitation Letter) (Presentation Slide)

Institute of Automation, Chinese Academy of Sciences Invited Talk 2023 (Invitation Letter) (Presentation Slide)

University of Massachusetts - Amherst Invited Talk 2024 (Invitation Letter) (Presentation Slide)

Penn State University & University of Auckland Online Workshop 2024 Day 1 Session 2 Children's Future, Intercultural Learning (Invitation Letter) (Presentation Slide) (Recording)

Logic and AI Seminar 2025 (Peking University & Tsinghua University) invited by Prof. Fenrong Liu and A/Prof. Haoxuan Li

Max Planck Institute For Software Systems invited by Prof. Adish Singla (Invitation Letter)

Technical University of Munich invited by Dr. Stefan Fuchs

QIMING BAO

Education

Paper List

Work & Project Experience

Invited Speaker/Visiting Scholar