Anwoy Chatterjee

I am a third-year PhD student at IIT Delhi, advised by Prof. Tanmoy Chakraborty. My research focuses on making Large Language Models (LLMs) more robust, reliable, and interpretable, part of a broader mission to build safe and transparent AI systems.

I am grateful to be supported by the Google PhD Fellowship. I also collaborate closely with Dr. Sumit Bhatia and have previously interned (twice) at the Media and Data Science Research (MDSR) Lab at Adobe Inc.

Before my PhD, I earned my Bachelor's in Computer Science and Engineering from IIT(BHU), Varanasi where I worked on Computer Vision and Graph Neural Networks with Prof. Rajeev Srivastava and Prof. Pratik Chattopadhyay.

Email  |  CV  |  DBLP  |  Google Scholar  |  LinkedIn  |  X  |  Bluesky  |  GitHub

profile photo

Recent Updates

  • August 2025: Our paper on cultural evaluation of multilingual language models got accepted to EMNLP 2025. This was a nice collaboration with the UCSB NLP Group.
  • July 2025: Our paper on the effect of instruction tuning loss on the generalization capability of language models got accepted to the Transactions of the Association for Computational Linguistics (TACL). This work is yet another outcome of my internship at Adobe!
  • January 2025: Started my (second) internship at Adobe's Media and Data Science Research (MDSR) Lab.
  • November 2024: Delivered an invited talk at Google Deepmind, Bangalore.
  • November 2024: Presented our work at Amazon Research Days 2024.
  • September 2024: Our paper on a novel prompt sensitivity index got accepted to EMNLP (Findings) 2024. This work was done during my internship at Adobe.
  • August 2024: Awarded Google PhD Fellowship 2024 in NLP.
  • May 2024: Our paper on cross-task in-context learning got accepted to ACL 2024.

Education

IIT Delhi Logo Doctor of Philosophy in Artificial Intelligence
Indian Institute of Technology, Delhi
2022 – Present
IIT BHU Logo Bachelor of Technology in Computer Science and Engineering
Indian Institute of Technology (BHU), Varanasi
2018 – 2022

Experience

Adobe Logo Research Intern
Media and Data Science Research (MDSR) Lab, Adobe Inc.
Mentor: Dr. Sumit Bhatia
January 2025 – July 2025
Adobe Logo Research Intern
Media and Data Science Research (MDSR) Lab, Adobe Inc.
Mentor: Dr. Sumit Bhatia
May 2024 – August 2024

Research

I'm interested in natural language processing, deep learning, generative AI, and model interpretability. My current research revolves around understanding the workings of large language models, making them more robust and reliable, and enabling their effective in-context adaptation in low-resource settings.

Publications

HIDE and Seek: Detecting Hallucinations in Language Models via Decoupled Representations
Anwoy Chatterjee, Yash Goel, Tanmoy Chakraborty
Preprint (Under Review), 2025
paper  /  code  /  bibtex
Do You Know About My Nation? Investigating Multilingual Language Models’ Cultural Literacy Through Factual Knowledge
Eshaan Tanwar, Anwoy Chatterjee, Michael Saxon, Alon Albalak, William Yang Wang, Tanmoy Chakraborty
EMNLP, 2025
paper  /  code  /  bibtex
On the Effect of Instruction Tuning Loss on Generalization
Anwoy Chatterjee*, H S V N S Kowndinya Renduchintala*, Sumit Bhatia, Tanmoy Chakraborty
Transactions of the Association for Computational Linguistics (TACL), 2025
paper  /  code  /  bibtex
POSIX: A Prompt Sensitivity Index For Large Language Models
Anwoy Chatterjee*, H S V N S Kowndinya Renduchintala*, Sumit Bhatia, Tanmoy Chakraborty
EMNLP (Findings), 2024
paper  /  code  /  video  /  bibtex
Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks
Anwoy Chatterjee*, Eshaan Tanwar*, Subhabrata Dutta, Tanmoy Chakraborty
ACL, 2024
paper  /  code  /  video  /  bibtex

Miscellanea

Recorded
Lectures/
Tutorials/
Talks

Panel discussion on the current state of LLM research in academia and industry.
Lecture on "Interpretability: Demystifying the Black-Box LMs" as part of ELL881/AIL821 at IIT Delhi.
Virtual presentation of our EMNLP'24 paper - "POSIX: A Prompt Sensitivity Index For Large Language Models".
Virtual presentation of our ACL'24 paper - "Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks".


Teaching

Graduate Teaching Assistant, ELL8299/ELL881/AIL861 (Advanced Large Language Models), Fall 2025
Teaching Assistant, Introduction to Large Language Models, NPTEL (January 2025, July 2025)
Graduate Teaching Assistant, ELL884 (Deep Learning for NLP), Spring 2025
Graduate Teaching Assistant, ELL881/AIL821 (Large Language Models: Introduction and Recent Advances), Fall 2024
Graduate Teaching Assistant, ELL880 (Social Network Analysis), Fall 2023

Talks/
Posters

Invited talk at Google Deepmind, Bangalore, India, Nov 22, 2024.
Poster presentation at Amazon Research Days 2024, Bangalore, India.
Poster presentation at ACL 2024, Bangkok, Thailand.

Design and source code from Jon Barron's website