Abhishek Kar

I am currently an applied research lead at Google AR. where we work on problems at the intersection of 3D computer vision, computer graphics, computational photography and machine learning. Some features I have worked on and shipped at Google include the ARCore Depth API, Cinematic Memories for Google Photos and Pixel and portrait mode for Google Pixel.

Prior to Google, I was the Director of Machine Learning at Fyusion Inc., a spatial photography startup based in San Francisco where we shipped multiple 3D technologies including casual light field capture, AI-driven damage estimation, creation of user generated AR/VR content and real-time style transfer on mobile devices. I graduated from UC Berkeley in 2017 from Jitendra Malik's group working on learnt 3D object reconstruction. I have also spent time at Microsoft Research and Adobe Research.

Email / CV / Google Scholar / LinkedIn

Publications

My primary research interests lie in 3D computer vision and computational photography. Specifically, I am excited about applied research problems with the potential to scale to billions of users.

[NEW] NeRFiller: Completing Scenes via Generative 3D Inpainting
Ethan Weber, Aleksander Hołyński, Varun Jampani, Saurabh Saxena, Noah Snavely, Abhishek Kar, Angjoo Kanazawa
Computer Vision and Pattern Recognition (CVPR), 2024

project / paper / abstract / bibtex

@inproceedings{weber2023nerfiller,
  title = {NeRFiller: Completing Scenes 
    via Generative 3D Inpainting},
  author = {Ethan Weber and 
    Aleksander Holynski and 
    Varun Jampani and 
    Saurabh Saxena and
    Noah Snavely and 
    Abhishek Kar and 
    Angjoo Kanazawa},
  booktitle = {CVPR},
  year = {2024},
}
}
                  

[NEW] Probing the 3D Awareness of Visual Foundation Models
Mohamed El Banani, Amit Raj, Kevis-Kokitsi Maninis, Abhishek Kar, Yuanzhen Li, Michael Rubinstein, Deqing Sun, Leonidas Guibas, Justin Johnson, Varun Jampani
Computer Vision and Pattern Recognition (CVPR), 2024

paper / abstract / bibtex

@inproceedings{elbanani2024probe3d,
title = {{Probing the 3D Awareness of 
  Visual Foundation Models}},
author = {El Banani, Mohamed and 
  Raj, Amit and 
  Maninis, Kevis-Kokitsi and 
  Kar, Abhishek and 
  Li, Yuanzhen and 
  Rubinstein, Michael and 
  Sun, Deqing and 
  Guibas, Leonidas and 
  Johnson, Justin and 
  Jampani, Varun},
booktitle = {CVPR},
year = {2024},
}
                  

[NEW] SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Andreas Engelhardt, Amit Raj, Mark Boss, Yunzhi Zhang, Abhishek Kar, Yuanzhen Li, Ricardo Martin Brualla, Deqing Sun, Jonathan T. Barron, Hendrik P. A. Lensch, Varun Jampani
Computer Vision and Pattern Recognition (CVPR), 2024

project / video / paper / abstract / bibtex

@inproceedings{engelhardt2024-shinobi,
  author = {Engelhardt, Andreas and 
    Raj, Amit and
    Boss, Mark and
    Zhang, Yunzhi and
    Kar, Abhishek and 
    Li, Yuanzhen and 
    Sun, Deqing and 
    Martin Brualla, Ricardo and
    Barron, Jonathan T. and 
    Lensch, Hendrik P.A. and 
    Jampani, Varun},
    title = {SHINOBI: Shape and Illumination
      using Neural Object Decomposition 
      via BRDF Optimization In-the-wild},
    venue={Computer Vision and Pattern Recognition (CVPR)},
  year = {2024}
}
                  

[NEW] Accelerating Neural Field Training via Soft Mining
Shakiba Kheradmand, Daniel Rebain, Gopal Sharma, Hossam Isack, Abhishek Kar, Andrea Tagliasacchi, Kwang Moo Yi
Computer Vision and Pattern Recognition (CVPR), 2024

project / paper / abstract / bibtex

@inproceedings{kheradmand2024softmining,
  title={Accelerating Neural Field Training via Soft Mining},
  year={2024},
  venue={Computer Vision and Pattern Recognition (CVPR)},
  arxiv={https://arxiv.org/abs/2312.00075},
  authors={Shakiba Kheradmand and 
    Daniel Rebain and 
    Gopal Sharma and 
    Hossam Isack and 
    Abhishek Kar and 
    Andrea Tagliasacchi and 
    Kwang Moo Yi}
}
}
      
                  

[NEW] Unsupervised Keypoints from Pretrained Diffusion Models
Eric Hedlin, Gopal Sharma, Shweta Mahajan, Hossam Isack, Abhishek Kar, Helge Rhodin, Andrea Tagliasacchi, Kwang Moo Yi
Computer Vision and Pattern Recognition (CVPR), 2024

project / paper / abstract / bibtex

@inproceedings{hedlin2024keypoints,
  title={Unsupervised Keypoints from 
    Pretrained Diffusion Models},
  year={2024},
  venue={Computer Vision and Pattern Recognition (CVPR)},
  arxiv={https://arxiv.org/abs/2312.00065},
  authors={Eric Hedlin and 
    Gopal Sharma and 
    Shweta Mahajan and 
    Hossam Isack and 
    Abhishek Kar and 
    Helge Rhodin and 
    Andrea Tagliasacchi and 
    Kwang Moo Yi}
}
}
      
                  

Unsupervised Semantic Correspondence Using Stable Diffusion
Eric Hedlin, Gopal Sharma, Shweta Mahajan, Hossam Isack, Abhishek Kar, Andrea Tagliasacchi, Kwang Moo Yi
Neural Information Processing Systems (NeurIPS), 2023

project / paper / abstract / bibtex

@inproceedings{hedlin2023unsupervised,
  title={Unsupervised Semantic Correspondence
    Using Stable Diffusion},
  author={Eric Hedlin and 
    Gopal Sharma and 
    Shweta Mahajan and 
    Hossam Isack and 
    Abhishek Kar and 
    Andrea Tagliasacchi and 
    Kwang Moo Yi},
  booktitle={arXiv preprint},
  year={2023},
  publisher_page={https://arxiv.org/abs/2305.15581},
  homepage={https://ubc-vision.github.io/LDM_correspondences/}
}
      
                  

The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation
Saurabh Saxena, Charles Hermnann, Junwa Hur, Abhishek Kar, Mohammad Norouzi, Deqing Sun, David J. Fleet
Neural Information Processing Systems (NeurIPS), 2023

project / paper / previous version / abstract / bibtex

@misc{saxena2023surprising,
title={The Surprising Effectiveness of Diffusion 
  Models for Optical Flow and Monocular Depth 
  Estimation},
author={Saurabh Saxena and 
  Charles Herrmann and 
  Junhwa Hur and 
  Abhishek Kar and 
  Mohammad Norouzi and 
  Deqing Sun and 
  David J. Fleet},
year={2023},
eprint={2306.01923},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
      
                  

ASIC: Aligning Sparse in-the-wild Image Collections
Kamal Gupta, Varun Jampani, Carlos Esteves, Abhinav Shrivastava, Ameesh Makadia, Noah Snavely, Abhishek Kar
International Conference on Computer Vision (ICCV), 2023

project / arxiv / abstract / video / bibtex

@inproceedings{gupta2023asic,
author ={Gupta, Kamal and 
  Jampani, Varun and 
  Esteves, Carlos and 
  Shrivastava, Abhinav and 
  Makadia, Abhinav and 
  Snavely, Noah and 
  Kar, Abhishek},
  title = {ASIC: Aligning Sparse 
    in-the-wild Image Collections},
booktitle={Proceedings of the IEEE 
International Conference on Computer Vision},
  year = {2023},
}
                

LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs
Zezhou Cheng, Carlos Esteves, Varun Jampani, Abhishek Kar, Subhransu Maji, Ameesh Makadia
International Conference on Computer Vision (ICCV), 2023

project / paper / abstract / bibtex

@inproceedings{cheng2023lunerf,
title={LU-NeRF: Scene and Pose Estimation 
  by Synchronizing Local Unposed NeRFs},
author={Cheng, Zezhou and 
  Esteves, Carlos and 
  Jampani, Varun and 
  Kar, Abhishek and 
  Maji, Subhransu and 
  Makadia, Ameesh},
booktitle={Proceedings of the IEEE 
International Conference on Computer Vision},
year={2023}
}
      
                  

DC^2: Dual-Camera Defocus Control by Learning to Refocus
Hadi Alzayer, Abdullah Abuolaim, Leung Chun Chan, Yang Yang, Ying Chen Lou, Jia-Bin Huang, Abhishek Kar
Computer Vision and Pattern Recognition (CVPR), 2023

project / arxiv / abstract / video / two minute papers / bibtex

@inproceedings{alzayer2023defocuscontrol,
title={DC2: Dual-Camera Defocus Control by 
  Learning to Refocus},
author={Alzayer, Hadi and 
  Abuolaim, Abdullah and 
  Chun Chan, Leung and 
  Yang, Yang and 
  Chen Lou, Ying and 
  Huang, Jia-Bin and 
  Kar, Abhishek},
booktitle={Proceedings of the IEEE/CVF 
  Conference on Computer Vision and 
  Pattern Recognition},
pages={--},
year={2023}
}
              

Rapsai: Accelerating Machine Learning Prototyping of Multimedia Applications Through Visual Programming
Ruofei Du, Na Li, Jing Jin, Michelle Carney, Scott Miles, Maria Kleiner, Xiuxiu Yuan, Yinda Zhang, Anuva Kulkarni, Xingyu "Bruce" Liu, Ahmed Sabie, Sergio Escolano, Abhishek Kar, Ping Yu, Ram Iyengar, Adarsh Kowdle, and Alex Olwal Proceedings of the Conference on Human Factors in Computing Systems (CHI), 2023
Best Paper Honorable Mention

project / blog / demo / video / paper / bibtex
@inproceedings{Du2023Rapsai,
  title = {{Rapsai: Accelerating Machine Learning Prototyping
    of Multimedia Applications Through Visual Programming}},
  author = {Du, Ruofei and Li, Na and 
    Jin, Jing and Carney, Michelle and 
    Miles, Scott and Kleiner, Maria and 
    Yuan, Xiuxiu and Zhang, Yinda and
    Kulkarni, Anuva and Liu, Xingyu and 
    Sabie, Ahmed and Escolano, Sergio and 
    Kar, Abhishek and Yu, Ping and 
    Iyengar, Ram and Kowdle, Adarsh and 
    Olwal, Alex},
  booktitle = {Proceedings of the 2023 CHI 
    Conference on Human Factors in Computing Systems},
  year = {2023},
  publisher = {ACM},
  series = {CHI},
  doi = {10.1145/3544548.3581338},
}

                  

SAMURAI: Shape And Material from Unconstrained Real-world Arbitrary Image collections
Mark Boss, Andreas Engelhardt, Abhishek Kar, Yuanzhen Li, Deqing Sun, Jonathan T. Barron, Hendrik P. A. Lensch, Varun Jampani
Neural Information Processing Systems (NeurIPS), 2022

project / video / paper / abstract / bibtex

@inproceedings{boss2022-samurai,
    author = {Boss, Mark and 
      Engelhardt, Andreas and 
      Kar, Abhishek and 
      Li, Yuanzhen and 
      Sun, Deqing and 
      Barron, Jonathan T. and 
      Lensch, Hendrik P.A. and 
      Jampani, Varun},
    title = {{SAMURAI}: {S}hape {A}nd {M}aterial 
    from {U}nconstrained {R}eal-world {A}rbitrary 
    {I}mage collections},
    booktitle = {Advances in Neural Information 
      Processing Systems (NeurIPS)},
    year = {2022}
}
                  

Learned Monocular Depth Priors in Visual-Inertial Initialization
Yunwen Zhou, Abhishek Kar, Eric Turner, Adarsh Kowdle, Chao X. Guo, Ryan C. DuToit, Konstantine Tsotsos
European Conference on Computer Vision (ECCV), 2022

project / video / paper / abstract / bibtex

@inproceedings{verse:monodepth-vio-init-eccv2022,
author = {Yunwen Zhou, 
  Abhishek Kar, 
  Eric Turner, 
  Adarsh Kowdle,
  Chao X. Guo, 
  Ryan C. DuToit, 
  and Konstantine Tsotsos},
title = {Learned Monocular Depth Priors 
  in Visual-Inertial Initialization},
booktitle = {European Conference on Computer Vision},
year = {2022},
}
                  

SLIDE: Single Image 3D Photography with Soft Layering and Depth-aware Inpainting
Varun Jampani* , Huiwen Chang*, Kyle Sargent, Abhishek Kar, Richard Tucker, Michael Krainin, Dominik Kaeser, William T. Freeman, David Salesin, Brian Curless, Ce Liu
International Conference on Computer Vision (ICCV), 2021 (Oral)

paper / project / supplementary / video / abstract / bibtex

@inproceedings{jampani:ICCV:2021,
	title = {SLIDE: Single Image 3D Photography with 
  Soft Layering and Depth-aware Inpainting},
	author = {Jampani, Varun and 
  Chang, Huiwen and 
  Sargent, Kyle and 
  Kar, Abhishek and 
  Tucker, Richard and 
  Krainin, Michael and 
  Kaeser, Dominik and 
  Freeman, William T and 
  Salesin, David and 
  Curless, Brian and 
  Liu, Ce},
	booktitle={Proceedings of the IEEE 
  International Conference on Computer Vision},
  year={2021}
}
                  

Local Light Field Fusion: Practical View Synthesis with Prescriptive Sampling Guidelines
Ben Mildenhall*, Pratul Srinivasan*, Rodrigo Ortiz-Cayon, Nima Khademi Kalantari, Ravi Ramamoorthi, Ren Ng, Abhishek Kar
SIGGRAPH, 2019

paper / project / code / video / abstract / bibtex / p r e s s

@article{mildenhall2019llff,
  title={Local Light Field Fusion: Practical View 
  Synthesis with Prescriptive Sampling Guidelines},
  author={Ben Mildenhall and 
  Pratul P. Srinivasan and 
  Rodrigo Ortiz-Cayon and 
  Nima Khademi Kalantari and 
  Ravi Ramamoorthi and 
  Ren Ng and 
  Abhishek Kar},
  journal={ACM Transactions on Graphics (TOG)},
  year={2019}
}

                  

Learning Independent Object Motion from Unlabelled Stereoscopic Videos
Zhe Cao, Abhishek Kar, Christian Häne, Jitendra Malik
Computer Vision and Pattern Recognition (CVPR), 2019

project / abstract / bibtex / arxiv

@incollection{sfCaoKHM2019,
author = {Zhe Cao and
Abhishek Kar and
Christian H\"ane and
Jitendra Malik},
title = {Learning Independent Object Motion 
from Unlabelled Stereoscopic Videos},
booktitle = CVPR,
year = {2019},
}
  
                  

Learning a Multi-View Stereo Machine
Abhishek Kar, Christian Häne, Jitendra Malik
Neural Information Processing Systems(NIPS), 2017

abstract / bibtex / supplementary / arxiv / blog / code

@incollection{lsmKarHM2017,
  author = {Abhishek Kar and
  Christian H\"ane and
  Jitendra Malik},
  title = {Learning a Multi-View Stereo Machine},
  booktitle = NIPS,
  year = {2017},
  }
  
                  

Learning Category-Specific Deformable 3D Models for Object Reconstruction
Shubham Tulsiani*, Abhishek Kar*, João Carreira, Jitendra Malik
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2017

abstract / bibtex / project

@article{pamishapeTulsianiKCM15,
author = {Shubham Tulsiani and
Abhishek Kar and
Jo{\~{a}}o Carreira and
Jitendra Malik},
title = {Learning Category-Specific Deformable 3D
Models for Object Reconstruction},
journal = {TPAMI},
year = {2016},
}
                

The three R's of computer vision: Recognition, reconstruction and reorganization
Jitendra Malik, Pablo Arbelaez, João Carreira, Katerina Fragkiadaki,
Ross Girshick, Georgia Gkioxari, Saurabh Gupta, Bharath Hariharan, Abhishek Kar, Shubham Tulsiani
Pattern Recognition Letters, 2016

paper / abstract / bibtex

@article{malik2016three,
title={The three R's of computer vision:
  Recognition, reconstruction and reorganization},
author={Malik, Jitendra and
  Arbel{\'a}ez, Pablo and
  Carreira, Jo{\~a}o and
Fragkiadaki, Katerina and
Girshick, Ross and
Gkioxari, Georgia and
Gupta, Saurabh and
Hariharan, Bharath and
Kar, Abhishek and
Tulsiani, Shubham},
journal={Pattern Recognition Letters},
volume={72},
pages={4--14},
year={2016},
publisher={North-Holland}
}

                
sym

Shape and Symmetry Induction for 3D Objects
Shubham Tulsiani, Abhishek Kar, Qixing Huang, João Carreira, Jitendra Malik
arXiv:1511.07845, 2015

abstract / bibtex / arxiv

@incollection{shapeSymTulsianiKHCM15,
author = {Shubham Tulsiani and
Abhishek Kar and
Qixing Huang and
Jo{\~{a}}o Carreira and
Jitendra Malik},
title = {Shape and Symmetry Induction
for 3D Objects},
booktitle = arxiv:1511.07845,
year = {2015},
}
                
amodal

Amodal Completion and Size Constancy in Natural Scenes
Abhishek Kar, Shubham Tulsiani, João Carreira, Jitendra Malik
International Conference on Computer Vision (ICCV), 2015

abstract / supplementary / bibtex

@incollection{amodalKarTCM15,
author = {Abhishek Kar and
Shubham Tulsiani and
Jo{\~{a}}o Carreira and
Jitendra Malik},
title = {Amodal Completion and
Size Constancy in Natural Scenes},
booktitle = ICCV,
year = {2015},
}
                
basisshapes

Category-Specific Object Reconstruction from a Single Image
Abhishek Kar*, Shubham Tulsiani*, João Carreira, Jitendra Malik
Computer Vision and Pattern Recognition (CVPR), 2015 (Oral)
Best Student Paper Award

project page / abstract / bibtex / supplementary / code / arxiv

@incollection{categoryShapesKar15,
author = {Abhishek Kar and
Shubham Tulsiani and
Jo{\~{a}}o Carreira and
Jitendra Malik},
title = {Category-Specific Object
Reconstruction from a Single Image},
booktitle = CVPR,
year = {2015},
}
                

Virtual View Networks for Object Reconstruction
João Carreira, Abhishek Kar, Shubham Tulsiani, Jitendra Malik
Computer Vision and Pattern Recognition (CVPR), 2015

abstract / bibtex / videos / arxiv

@incollection{vvnCarreira14,
author = {Jo{\~{a}}o Carreira and
Abhishek Kar and
Shubham Tulsiani and
Jitendra Malik},
title = {Virtual View Networks
for Object Reconstruction},
booktitle = CVPR,
year = {2015},
}
                

Looking At You: Fused Gyro and Face Tracking for Viewing Large Imagery on Mobile Devices
Neel Joshi, Abhishek Kar, Michael F. Cohen
ACM SIGCHI Conference on Human Factors in Computing Systems (CHI), 2012

abstract / bibtex / website / video

@inproceedings{joshi2012looking,
title={Looking at you: fused gyro and face
tracking for viewing large imagery on mobile devices},
author={Joshi, Neel and Kar, Abhishek and Cohen, Michael},
booktitle={Proceedings of the SIGCHI Conference
on Human Factors in Computing Systems},
pages={2211--2220},
year={2012},
organization={ACM}
}
                

Other Projects

Chemistry Studio: An Intelligent Tutoring System for the Periodic Table
Abhishek Kar*, Ankit Kumar*, Sumit Gulwani, Ashish Tiwari, Amey Karkare
Undergraduate Thesis, IIT Kanpur, 2012

slides / talk 1 / talk 2

Teaching

pacman

CS189: Introduction to Machine Learning - Spring 2013 (GSI)
Instructor: Prof. Jitendra Malik
Awarded the Outstanding GSI Award

CS188: Introduction to Artificial Intelligence - Spring 2014 (GSI)
Instructor: Prof. Pieter Abbeel


yet another Jon Barron website