I am an
ELLIS Ph.D. student at
INSAIT where I am advised by
Prof. Luc Van Gool and currently a Machine Learning Researcher
Intern at Netflix.
From May to November 2024, I was a Student Researcher at Google DeepMind in Toronto,
working with Robert Geirhos.
Before my PhD journey began, I was a visiting researcher from 2021 to 2023 at CMU's
Human Sensing Lab working with the amazing
Fernando De La Torre. I also spent 7 wonderful years
at the University of Toronto's
Computer Science department
where I earned my HBSc and MS degrees.
I'm broadly interested in Generative Vision models for content creation,
and currently focused on Video synthesis. My research aims to gain a better
understanding of how to enable user-intuitive control over Generative models.
I am also interested in bias mitigation and harnessing the power of large vision
and language models by adapting them to solve personalized tasks using limited data.
Relevant work is highlighted here.
Research Projects
Generative Vision for Video Synthesis: Developing user-intuitive controls for generative video
models to enable interactive and personalized content creation.
Bias Mitigation in Vision Models: Investigating strategies to mitigate bias in generative models
while preserving their generalization capabilities.
Adaptation of Large Models: Adapting large-scale vision and language models to solve personalized
tasks efficiently using limited data.
A framework for defining control over latent-based generative models.
Talks
Invited talks and presentations.
Dec 16, 2025
Improving VLM's understanding of physically implausible scenes. Invited Talk • Vector Institute, Toronto Invited by Dr. Babak Taati • Journal
Club
Jan 30, 2025
How to benchmark video generative models for their physics understanding? Invited Talk • Stability AI Reading Group Invited by Rahim Entezari
Somewhat related to computer vision and content creation, I enjoy film photography on 35 mm and medium format film. You can
view some of my photos below.