Rohit Saxena

prof_pic.jpg

rohit.saxena@ed.ac.uk

University of Edinburgh

I am a final year PhD student at the University of Edinburgh. I am advised by Prof Frank Keller. I also collaborate with Prof Pasquale Minervini and Prof Hao Tang. I am affiliated with EdinburghNLP and CDT NLP.

My research interests broadly lie in natural language understanding and multimodal learning, with a focus on long-context modeling and the intersection of vision and language.

Previously, I worked as a Researcher at Tata Research Development and Design Center (TRDDC), TCS Research, Pune India. At TRDDC, I was part of Media and Entertainment Research group and was advised by Niranjan Pedanekar. My work there involved emotion detection in dialogues, analysing TV viewers’ attention, and style transfer for advertisements. Additionally, I interned as an Applied Scientist at Amazon Web Services (AWS) in Seattle, where I worked on mitigating hallucinations in large language models.

In my free time, I enjoy photography.

news

Mar 13, 2025 Paper accepted at ICLR 2025 Workshop.
Jan 22, 2025 Paper accepted at NAACL 2025.
Aug 14, 2024 Presented work at ACL 2024.
Jun 19, 2024 Presented work at NAACL 2024
May 16, 2024 Paper accepted at ACL Findings 2024.

selected publications

  1. Under Review
    End-to-End Long Document Summarization using Gradient Caching
    Rohit Saxena, Hao Tang, and Frank Keller
    2025
  2. ICLR 2025 Workshop
    Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs
    Rohit Saxena, Aryo Pradipta Gema, and Pasquale Minervini
    2025
  3. Under Review
    PosterSum: A Multimodal Benchmark for Scientific Poster Summarization
    Rohit Saxena, Pasquale Minervini, and Frank Keller
    2025
  4. ACL 2024
    MovieSum: An Abstractive Summarization Dataset for Movie Screenplays
    Rohit Saxena, and Frank Keller
    In Findings of the Association for Computational Linguistics ACL 2024, Aug 2024