2024
-
Zihan Qiu*, Zeyu Huang*, Youcheng Huang*, Jie Fu: Empirical Study on Updating Key-Value Memories in Transformer Feed-forward Layers. Tiny paper at ICLR 2024.
-
Wenyu Du*, Shuang Cheng*, Tongxu Luo, Zihan Qiu, Zeyu Huang, Ka Chun Cheung, Reynold Cheng, Jie Fu: Unlocking Continual Learning Abilities in Language Models. EMNLP 2024 Findings
-
Wenyu Du* Tongxu Luo*, Zihan Qiu, Zeyu Huang, Yikang Shen, Reynold Cheng, Yike Guo, Jie Fu: Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training. NeurIPS 2024 Spotlight
2023
- Zihan Qiu*, Zeyu Huan\g*, Jie Fu: Unlocking Emergent Modularity in Large Language Models. NAACL 2024, Outstanding Paper Award.
- Zeyu Huang, Yikang Shen, Xiaofeng Zhang, Jie Zhou, Wenge Rong, Zhang Xiong: Transformer-Patcher: One Mistake Worth One Neuron. ICLR 2023
- Zeyu Huang, Xiaofeng Zhang, Jun Bai, Wenge Rong, Yuanxin Ouyang, Zhang Xiong: Solving Math Word Problems Following Logically Consistent Template. IJCNN 2023: 1-8
2022
-
Zeyu Huang, Wenge Rong, Xiaofeng Zhang, Yuanxin Ouyang, Chenghua Lin, and Zhang Xiong. 2022. Token Relation Aware Chinese Named Entity Recognition. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 22, 1, Article 24 (January 2023), 21 pages. https://doi.org/10.1145/3531534
-
Xiaofeng Zhang*, Yikang Shen*, Zeyu Huang, Jie Zhou, Wenge Rong, Zhang Xiong: Mixture of Attention Heads: Selecting Attention Heads Per Token. EMNLP 2022: 4150-4162