About Me
Hi, my name’s Chengyue Jiang, PhD at ShanghaiTech University. I majored in Computer Science, and I’m doing research focusing on Natural Language Processing (NLP), and I’m supervised by Prof. Kewei Tu.
More specifically, My NLP research interests are
- use symbolic knowledge (e.g. Regular Expressions) to help NLP models reach better performance and have better interpretability on NLP tasks.
- information extraction, such as entity typing and named entity recognition.
- information retrieval, familiar with related techniques.
- investigate the ontological knowledge in large pretrained language models.
I'm open to the job market this Autumn, feel free to contact me through my email: jiangchy@shanghaitech.edu.cn
Education
- I recieve my Bachelor degree in Computer Science in ShanghaiTech University, (June 2015 - June 2019).
- And I’m now a PhD student in ShanghaiTech University, supervised by Prof. Kewei Tu.
Publication
ACL 2023
Chengyue Jiang, Wenyang Hui (Equal Contribution), Yong Jiang, Xiaobin Wang, Pengjun Xie, Kewei Tu “Recall, Expand and Multi-Candidate Cross-Encode: Fast and Accurate Ultra-Fine Entity Typing”. (Paper)
ACL 2023 (Outstanding Paper Award)
Weiqi Wu, Chengyue Jiang, Yong Jiang, Xiaobin Wang, Pengjun Xie, Kewei Tu “Do PLMs Know and Understand Ontological Knowledge?”. (Paper)
EACL 2023
EMNLP 2022
EMNLP 2021
EMNLP 2020
Findings of EMNLP 2020
CoNLL 2019
Xinyu Wang, Yixian Liu, Zixia Jia, Chengyue Jiang, and Kewei Tu. “ShanghaiTech at MRP 2019: Sequence-to-Graph Transduction with Second-Order Edge Inference for Cross-Framework Meaning Representation Parsing”. In Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL 2019). (Paper) (Poster)
Experience
NLP Research Intern
Alibaba Damo Academy, Hangzhou
August 2021 -- June 2023
Doing some research about large language model (LLM) and information extraction. Act as the core developer of the entity typing part of the Adaseq Project in DAMO academy.
Software Intern
Honeywell, Nanjing
July 2018 -- September 2018
Doing some early research on using image captioning to help fault detection. Using simple sequence to sequence architecture, and implemented using Keras.
Web Intern
Boonray Technology, Shanghai
July 2016 -- September 2016
Software engineer internship. Participating in developing a webapp using Django backend and MySQL. The webapp is a simple cloud platform, used for managing costumers.
Projects
Adaseq
one of core contributors
Adaseq is the open source sequence understanding libarary presented by DAMO NLP. It contains SOTA Named Entity Recognition,
Entity Typing, Relation Extraction models. I act as the core developer of the entity typing part of the Adaseq Project in DAMO academy.
Implemente and replicate entity typing models including multi-label MLC, NPCRF, MCCE, Prompt Learning, etc.
RegExp to Neural Network
author
The source codes(RE2NN,RE2NN-SEQ) of my papers “Cold-start and Interpretability: Turning Regular Expressions into Trainable Recurrent Neural Networks” and “Neuralizing Regular Expressions for Slot Filling”, reaches over 130 stars on GitHub. The repos change regular expressions for text classification and slot filling into neural networks.
Services
- ACL2023 EACL2023 AACL2022 ICLR2022