Zhou, Ej

Spelt Yijie · read Ej

About

I am a second-year PhD student at the Language Technology Lab, University of Cambridge, advised by Prof. Anna Korhonen.

My PhD focuses on Natural Language Processing — in particular, multilingual interpretability of large language models.

Publications

Beyond the Final Layer — paper figure
Beyond the Final Layer: Intermediate Representations for Better Multilingual Calibration in Large Language Models
Ej Zhou, Caiqi Zhang, Tiancheng Hu, Chengzu Li, Nigel Collier, Ivan Vulić, Anna Korhonen
In submission — EMNLP 2026 · Spotlight · COLM 2025 MELT · NeurIPS 2025 MechInterp
HealthDial dataset figure
Dial HEALTHDIAL for Advice: A Multilingual and Multi-Parallel Spoken Dialogue Dataset for Evidence-Based Public Health Communication
Songbo Hu*, Yinhong Liu*, Ej Zhou*, Evgeniia Razumovskaia, Xiaobin Wang, Alexander Fraser, Ivan Vulić**, Anna Korhonen**
ACL 2026 Findings
Safe-SAIL — paper figure
Safe-SAIL: Towards a Fine-grained Safety Landscape of Large Language Models via Sparse Autoencoder Interpretation Framework
Jiaqi Weng, Han Zheng, Hanyu Zhang, Ej Zhou, Qinqin He, Jialing Tao, Hui Xue, Zhixuan Chu, Xiting Wang
ACL 2026 Findings
Bias Beyond English — paper figure
Bias Beyond English: Evaluating Social Bias and Debiasing Methods in a Low-Resource Setting
Ej Zhou, Weiming Lu
NLPCC 2025 Main · ACL 2025 GeBNLP
Cross-Lingual Summarization pipeline
Revisiting Cross-Lingual Summarization: A Corpus-based Study and a New Benchmark with Improvement Annotation
Yulong Chen, Huajian Zhang, Yijie Zhou, Xuefeng Bai, Yueguan Wang, Ming Zhong, Jianhao Yan, Yafu Li, Judy Li, Michael Zhu, Yue Zhang
ACL 2023 Long
ODSUM figure
ODSUM: New Benchmarks for Open Domain Multi-Document Summarization
Yijie Zhou, Kejian Shi, Wencai Zhang, Yixin Liu, Yilun Zhao, Arman Cohan
arXiv 2309.08960

Education

Before my PhD, I completed a B.Eng. (Honours) in Computer Science and Engineering at Zhejiang University (Chu Kochen Honors College). During my undergraduate studies I was supervised by Prof. Yue Zhang as a research intern at Westlake University, spent a semester as an exchange student at Cornell University working as a research assistant in CommCollabTech, and completed a research internship at Yale University under Prof. Arman Cohan. I was also a visiting student at the University of Oxford.

University of Cambridge
Sept 2024 — Jun 2028 (expected)
PhD in Computation, Cognition and Language · Hughes Hall · advised by Prof. Anna Korhonen
Zhejiang University
Sept 2020 — Jun 2024
B.Eng. (Honours), Computer Science · GPA 3.99 / 4 · Chu Kochen Honors College
Cornell University
Jan 2023 — Jun 2023
Exchange student — CommCollabTech
University of Oxford
Sept 2023 — Jun 2024
Visiting student — St Hilda's College

Experience

Research Intern — Yale
Apr 2023 — Oct 2023
Advised by Prof. Arman Cohan. Benchmarked open-domain multi-document summarization and robustness; single-author paper: arXiv:2309.08960.
Research Assistant — Cornell
Jan 2023 — Jun 2023
In the CommCollabTech Lab. Built a persuasive agent for misinformation detection and contributed to the Cornell Language Expansion Program.
Research Intern — Westlake University
Sept 2022 — Jan 2023
Advised by Prof. Yue Zhang. Contributed to a cross-lingual summarization benchmark (ACL 2023 Long).

Language et al.

Wu is my muttersprache but my education was in Mandarin. Apart from that I speak some French, Japanese, Russian, English and German to varying degrees. I am learning Polish and Portuguese this year. I am passionate about languages and it has always been a motivation for me to work on NLP.

Some other things: I am a cinema enthusiast and I was at the 76th–78th Festival de Cannes (and going again this year, the 79th!); I consider myself a Buddhist; I trained Guqin at the Xihu Qinshe and taught it at ZJU for a year.

Link to my blog ↗

गते गते पारगते पारसंगते बोधि स्वाहा