Rohit Saxena

rohit.saxena@ed.ac.uk
University of Edinburgh
I am a final year PhD student at the University of Edinburgh. I am advised by Prof Frank Keller. I also collaborate with Prof Pasquale Minervini and Prof Hao Tang. I am affiliated with EdinburghNLP and CDT NLP.
My research interests broadly lie in natural language understanding and multimodal learning, with a focus on long-context modeling and the intersection of vision and language.
Previously, I worked as a Researcher at Tata Research Development and Design Center (TRDDC), TCS Research, Pune India. At TRDDC, I was part of Media and Entertainment Research group and was advised by Niranjan Pedanekar. My work there involved emotion detection in dialogues, analysing TV viewers’ attention, and style transfer for advertisements. Additionally, I interned as an Applied Scientist at Amazon Web Services (AWS) in Seattle, where I worked on mitigating hallucinations in large language models.
In my free time, I enjoy photography.
news
Mar 13, 2025 | Paper accepted at ICLR 2025 Workshop. |
---|---|
Jan 22, 2025 | Paper accepted at NAACL 2025. |
Aug 14, 2024 | Presented work at ACL 2024. |
Jun 19, 2024 | Presented work at NAACL 2024 |
May 16, 2024 | Paper accepted at ACL Findings 2024. |
selected publications
- Under ReviewEnd-to-End Long Document Summarization using Gradient Caching2025
- ICLR 2025 WorkshopLost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs2025
- Under ReviewPosterSum: A Multimodal Benchmark for Scientific Poster Summarization2025
- ACL 2024MovieSum: An Abstractive Summarization Dataset for Movie ScreenplaysIn Findings of the Association for Computational Linguistics ACL 2024, Aug 2024