I am a Research Scientist at Google (Mountain View, CA) in the Creative Camera group. Previously, I was a Postdoctoral Researcher at Stanford University. I hold a PhD in Computer Science from the Weizmann Institute of Science, supervised by Prof. Shimon Ullman. I studied how vision-language models function. Exploring their core mechanisms, strengths, and limitations - mainly by developing new data and training approaches
I earned my Masterās degree in Electrical Engineering from Tel Aviv University and my Bachelorās degree in Electrical Engineering from Ben-Gurion University of the Negev (BGU). In parallel with my academic journey, I have also worked at Google, Applied Materials and IBM Research.
I am actively looking for student collaborators in the area of multi-modal learning.
Contact: sdoveh [at] gmail [dot] com
Meta-Prompting for Automating Zero-shot Visual Recognition with LLMs
ECCV 2024
[ Paper | Project Page | Code ]