Abhishek Kar

I am currently an applied research lead at Google AR. where we work on problems at the intersection of 3D computer vision, computer graphics, computational photography and machine learning. Some features I have worked on and shipped at Google include the ARCore Depth API, Cinematic Memories for Google Photos and Pixel and portrait mode for Google Pixel.

Prior to Google, I was the Director of Machine Learning at Fyusion Inc., a spatial photography startup based in San Francisco where we shipped multiple 3D technologies including casual light field capture, AI-driven damage estimation, creation of user generated AR/VR content and real-time style transfer on mobile devices. I graduated from UC Berkeley in 2017 from Jitendra Malik's group working on learnt 3D object reconstruction. I have also spent time at Microsoft Research and Adobe Research.

Email / CV / Google Scholar / LinkedIn

Publications

My primary research interests lie in 3D computer vision and computational photography. Specifically, I am excited about applied research problems with the potential to scale to billions of users.

	NeRFiller: Completing Scenes via Generative 3D Inpainting Ethan Weber, Aleksander Hołyński, Varun Jampani, Saurabh Saxena, Noah Snavely, Abhishek Kar, Angjoo Kanazawa Computer Vision and Pattern Recognition (CVPR), 2024 project / paper / abstract / bibtex @inproceedings{weber2023nerfiller, title = {NeRFiller: Completing Scenes via Generative 3D Inpainting}, author = {Ethan Weber and Aleksander Holynski and Varun Jampani and Saurabh Saxena and Noah Snavely and Abhishek Kar and Angjoo Kanazawa}, booktitle = {CVPR}, year = {2024}, } }
	Probing the 3D Awareness of Visual Foundation Models Mohamed El Banani, Amit Raj, Kevis-Kokitsi Maninis, Abhishek Kar, Yuanzhen Li, Michael Rubinstein, Deqing Sun, Leonidas Guibas, Justin Johnson, Varun Jampani Computer Vision and Pattern Recognition (CVPR), 2024 paper / abstract / bibtex @inproceedings{elbanani2024probe3d, title = {{Probing the 3D Awareness of Visual Foundation Models}}, author = {El Banani, Mohamed and Raj, Amit and Maninis, Kevis-Kokitsi and Kar, Abhishek and Li, Yuanzhen and Rubinstein, Michael and Sun, Deqing and Guibas, Leonidas and Johnson, Justin and Jampani, Varun}, booktitle = {CVPR}, year = {2024}, }
	SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild Andreas Engelhardt, Amit Raj, Mark Boss, Yunzhi Zhang, Abhishek Kar, Yuanzhen Li, Ricardo Martin Brualla, Deqing Sun, Jonathan T. Barron, Hendrik P. A. Lensch, Varun Jampani Computer Vision and Pattern Recognition (CVPR), 2024 project / video / paper / abstract / bibtex @inproceedings{engelhardt2024-shinobi, author = {Engelhardt, Andreas and Raj, Amit and Boss, Mark and Zhang, Yunzhi and Kar, Abhishek and Li, Yuanzhen and Sun, Deqing and Martin Brualla, Ricardo and Barron, Jonathan T. and Lensch, Hendrik P.A. and Jampani, Varun}, title = {SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild}, venue={Computer Vision and Pattern Recognition (CVPR)}, year = {2024} }
	Accelerating Neural Field Training via Soft Mining Shakiba Kheradmand, Daniel Rebain, Gopal Sharma, Hossam Isack, Abhishek Kar, Andrea Tagliasacchi, Kwang Moo Yi Computer Vision and Pattern Recognition (CVPR), 2024 project / paper / abstract / bibtex @inproceedings{kheradmand2024softmining, title={Accelerating Neural Field Training via Soft Mining}, year={2024}, venue={Computer Vision and Pattern Recognition (CVPR)}, arxiv={https://arxiv.org/abs/2312.00075}, authors={Shakiba Kheradmand and Daniel Rebain and Gopal Sharma and Hossam Isack and Abhishek Kar and Andrea Tagliasacchi and Kwang Moo Yi} } }
	Unsupervised Keypoints from Pretrained Diffusion Models Eric Hedlin, Gopal Sharma, Shweta Mahajan, Hossam Isack, Abhishek Kar, Helge Rhodin, Andrea Tagliasacchi, Kwang Moo Yi Computer Vision and Pattern Recognition (CVPR), 2024 project / paper / abstract / bibtex @inproceedings{hedlin2024keypoints, title={Unsupervised Keypoints from Pretrained Diffusion Models}, year={2024}, venue={Computer Vision and Pattern Recognition (CVPR)}, arxiv={https://arxiv.org/abs/2312.00065}, authors={Eric Hedlin and Gopal Sharma and Shweta Mahajan and Hossam Isack and Abhishek Kar and Helge Rhodin and Andrea Tagliasacchi and Kwang Moo Yi} } }
	Unsupervised Semantic Correspondence Using Stable Diffusion Eric Hedlin, Gopal Sharma, Shweta Mahajan, Hossam Isack, Abhishek Kar, Andrea Tagliasacchi, Kwang Moo Yi Neural Information Processing Systems (NeurIPS), 2023 project / paper / abstract / bibtex @inproceedings{hedlin2023unsupervised, title={Unsupervised Semantic Correspondence Using Stable Diffusion}, author={Eric Hedlin and Gopal Sharma and Shweta Mahajan and Hossam Isack and Abhishek Kar and Andrea Tagliasacchi and Kwang Moo Yi}, booktitle={arXiv preprint}, year={2023}, publisher_page={https://arxiv.org/abs/2305.15581}, homepage={https://ubc-vision.github.io/LDM_correspondences/} }
	The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation Saurabh Saxena, Charles Hermnann, Junwa Hur, Abhishek Kar, Mohammad Norouzi, Deqing Sun, David J. Fleet Neural Information Processing Systems (NeurIPS), 2023 project / paper / previous version / abstract / bibtex @misc{saxena2023surprising, title={The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation}, author={Saurabh Saxena and Charles Herrmann and Junhwa Hur and Abhishek Kar and Mohammad Norouzi and Deqing Sun and David J. Fleet}, year={2023}, eprint={2306.01923}, archivePrefix={arXiv}, primaryClass={cs.CV} }
	ASIC: Aligning Sparse in-the-wild Image Collections Kamal Gupta, Varun Jampani, Carlos Esteves, Abhinav Shrivastava, Ameesh Makadia, Noah Snavely, Abhishek Kar International Conference on Computer Vision (ICCV), 2023 project / arxiv / abstract / video / bibtex @inproceedings{gupta2023asic, author ={Gupta, Kamal and Jampani, Varun and Esteves, Carlos and Shrivastava, Abhinav and Makadia, Abhinav and Snavely, Noah and Kar, Abhishek}, title = {ASIC: Aligning Sparse in-the-wild Image Collections}, booktitle={Proceedings of the IEEE International Conference on Computer Vision}, year = {2023}, }
	LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs Zezhou Cheng, Carlos Esteves, Varun Jampani, Abhishek Kar, Subhransu Maji, Ameesh Makadia International Conference on Computer Vision (ICCV), 2023 project / paper / abstract / bibtex @inproceedings{cheng2023lunerf, title={LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs}, author={Cheng, Zezhou and Esteves, Carlos and Jampani, Varun and Kar, Abhishek and Maji, Subhransu and Makadia, Ameesh}, booktitle={Proceedings of the IEEE International Conference on Computer Vision}, year={2023} }
	DC^2: Dual-Camera Defocus Control by Learning to Refocus Hadi Alzayer, Abdullah Abuolaim, Leung Chun Chan, Yang Yang, Ying Chen Lou, Jia-Bin Huang, Abhishek Kar Computer Vision and Pattern Recognition (CVPR), 2023 project / arxiv / abstract / video / two minute papers / bibtex @inproceedings{alzayer2023defocuscontrol, title={DC2: Dual-Camera Defocus Control by Learning to Refocus}, author={Alzayer, Hadi and Abuolaim, Abdullah and Chun Chan, Leung and Yang, Yang and Chen Lou, Ying and Huang, Jia-Bin and Kar, Abhishek}, booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition}, pages={--}, year={2023} }
	Rapsai: Accelerating Machine Learning Prototyping of Multimedia Applications Through Visual Programming Ruofei Du, Na Li, Jing Jin, Michelle Carney, Scott Miles, Maria Kleiner, Xiuxiu Yuan, Yinda Zhang, Anuva Kulkarni, Xingyu "Bruce" Liu, Ahmed Sabie, Sergio Escolano, Abhishek Kar, Ping Yu, Ram Iyengar, Adarsh Kowdle, and Alex Olwal Proceedings of the Conference on Human Factors in Computing Systems (CHI), 2023 Best Paper Honorable Mention project / blog / demo / video / paper / bibtex @inproceedings{Du2023Rapsai, title = {{Rapsai: Accelerating Machine Learning Prototyping of Multimedia Applications Through Visual Programming}}, author = {Du, Ruofei and Li, Na and Jin, Jing and Carney, Michelle and Miles, Scott and Kleiner, Maria and Yuan, Xiuxiu and Zhang, Yinda and Kulkarni, Anuva and Liu, Xingyu and Sabie, Ahmed and Escolano, Sergio and Kar, Abhishek and Yu, Ping and Iyengar, Ram and Kowdle, Adarsh and Olwal, Alex}, booktitle = {Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems}, year = {2023}, publisher = {ACM}, series = {CHI}, doi = {10.1145/3544548.3581338}, }
	SAMURAI: Shape And Material from Unconstrained Real-world Arbitrary Image collections Mark Boss, Andreas Engelhardt, Abhishek Kar, Yuanzhen Li, Deqing Sun, Jonathan T. Barron, Hendrik P. A. Lensch, Varun Jampani Neural Information Processing Systems (NeurIPS), 2022 project / video / paper / abstract / bibtex @inproceedings{boss2022-samurai, author = {Boss, Mark and Engelhardt, Andreas and Kar, Abhishek and Li, Yuanzhen and Sun, Deqing and Barron, Jonathan T. and Lensch, Hendrik P.A. and Jampani, Varun}, title = {{SAMURAI}: {S}hape {A}nd {M}aterial from {U}nconstrained {R}eal-world {A}rbitrary {I}mage collections}, booktitle = {Advances in Neural Information Processing Systems (NeurIPS)}, year = {2022} }
	Learned Monocular Depth Priors in Visual-Inertial Initialization Yunwen Zhou, Abhishek Kar, Eric Turner, Adarsh Kowdle, Chao X. Guo, Ryan C. DuToit, Konstantine Tsotsos European Conference on Computer Vision (ECCV), 2022 project / video / paper / abstract / bibtex @inproceedings{verse:monodepth-vio-init-eccv2022, author = {Yunwen Zhou, Abhishek Kar, Eric Turner, Adarsh Kowdle, Chao X. Guo, Ryan C. DuToit, and Konstantine Tsotsos}, title = {Learned Monocular Depth Priors in Visual-Inertial Initialization}, booktitle = {European Conference on Computer Vision}, year = {2022}, }
	SLIDE: Single Image 3D Photography with Soft Layering and Depth-aware Inpainting Varun Jampani* , Huiwen Chang, Kyle Sargent, Abhishek Kar, Richard Tucker, Michael Krainin, Dominik Kaeser, William T. Freeman, David Salesin, Brian Curless, Ce Liu International Conference on Computer Vision (ICCV), 2021 (Oral)* paper / project / supplementary / video / abstract / bibtex @inproceedings{jampani:ICCV:2021, title = {SLIDE: Single Image 3D Photography with Soft Layering and Depth-aware Inpainting}, author = {Jampani, Varun and Chang, Huiwen and Sargent, Kyle and Kar, Abhishek and Tucker, Richard and Krainin, Michael and Kaeser, Dominik and Freeman, William T and Salesin, David and Curless, Brian and Liu, Ce}, booktitle={Proceedings of the IEEE International Conference on Computer Vision}, year={2021} }
	Local Light Field Fusion: Practical View Synthesis with Prescriptive Sampling Guidelines Ben Mildenhall, Pratul Srinivasan, Rodrigo Ortiz-Cayon, Nima Khademi Kalantari, Ravi Ramamoorthi, Ren Ng, Abhishek Kar SIGGRAPH, 2019 paper / project / code / video / abstract / bibtex / p r e s s @article{mildenhall2019llff, title={Local Light Field Fusion: Practical View Synthesis with Prescriptive Sampling Guidelines}, author={Ben Mildenhall and Pratul P. Srinivasan and Rodrigo Ortiz-Cayon and Nima Khademi Kalantari and Ravi Ramamoorthi and Ren Ng and Abhishek Kar}, journal={ACM Transactions on Graphics (TOG)}, year={2019} }
	Learning Independent Object Motion from Unlabelled Stereoscopic Videos Zhe Cao, Abhishek Kar, Christian Häne, Jitendra Malik Computer Vision and Pattern Recognition (CVPR), 2019 project / abstract / bibtex / arxiv @incollection{sfCaoKHM2019, author = {Zhe Cao and Abhishek Kar and Christian H\"ane and Jitendra Malik}, title = {Learning Independent Object Motion from Unlabelled Stereoscopic Videos}, booktitle = CVPR, year = {2019}, }
	Learning a Multi-View Stereo Machine Abhishek Kar, Christian Häne, Jitendra Malik Neural Information Processing Systems(NIPS), 2017 abstract / bibtex / supplementary / arxiv / blog / code @incollection{lsmKarHM2017, author = {Abhishek Kar and Christian H\"ane and Jitendra Malik}, title = {Learning a Multi-View Stereo Machine}, booktitle = NIPS, year = {2017}, }
	Learning Category-Specific Deformable 3D Models for Object Reconstruction Shubham Tulsiani, Abhishek Kar, João Carreira, Jitendra Malik IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2017 abstract / bibtex / project @article{pamishapeTulsianiKCM15, author = {Shubham Tulsiani and Abhishek Kar and Jo{\~{a}}o Carreira and Jitendra Malik}, title = {Learning Category-Specific Deformable 3D Models for Object Reconstruction}, journal = {TPAMI}, year = {2016}, }
	The three R's of computer vision: Recognition, reconstruction and reorganization Jitendra Malik, Pablo Arbelaez, João Carreira, Katerina Fragkiadaki, Ross Girshick, Georgia Gkioxari, Saurabh Gupta, Bharath Hariharan, Abhishek Kar, Shubham Tulsiani Pattern Recognition Letters, 2016 paper / abstract / bibtex @article{malik2016three, title={The three R's of computer vision: Recognition, reconstruction and reorganization}, author={Malik, Jitendra and Arbel{\'a}ez, Pablo and Carreira, Jo{\~a}o and Fragkiadaki, Katerina and Girshick, Ross and Gkioxari, Georgia and Gupta, Saurabh and Hariharan, Bharath and Kar, Abhishek and Tulsiani, Shubham}, journal={Pattern Recognition Letters}, volume={72}, pages={4--14}, year={2016}, publisher={North-Holland} }
	Shape and Symmetry Induction for 3D Objects Shubham Tulsiani, Abhishek Kar, Qixing Huang, João Carreira, Jitendra Malik arXiv:1511.07845, 2015 abstract / bibtex / arxiv @incollection{shapeSymTulsianiKHCM15, author = {Shubham Tulsiani and Abhishek Kar and Qixing Huang and Jo{\~{a}}o Carreira and Jitendra Malik}, title = {Shape and Symmetry Induction for 3D Objects}, booktitle = arxiv:1511.07845, year = {2015}, }
	Amodal Completion and Size Constancy in Natural Scenes Abhishek Kar, Shubham Tulsiani, João Carreira, Jitendra Malik International Conference on Computer Vision (ICCV), 2015 abstract / supplementary / bibtex @incollection{amodalKarTCM15, author = {Abhishek Kar and Shubham Tulsiani and Jo{\~{a}}o Carreira and Jitendra Malik}, title = {Amodal Completion and Size Constancy in Natural Scenes}, booktitle = ICCV, year = {2015}, }
	Category-Specific Object Reconstruction from a Single Image Abhishek Kar, Shubham Tulsiani, João Carreira, Jitendra Malik Computer Vision and Pattern Recognition (CVPR), 2015 (Oral) Best Student Paper Award project page / abstract / bibtex / supplementary / code / arxiv @incollection{categoryShapesKar15, author = {Abhishek Kar and Shubham Tulsiani and Jo{\~{a}}o Carreira and Jitendra Malik}, title = {Category-Specific Object Reconstruction from a Single Image}, booktitle = CVPR, year = {2015}, }
	Virtual View Networks for Object Reconstruction João Carreira, Abhishek Kar, Shubham Tulsiani, Jitendra Malik Computer Vision and Pattern Recognition (CVPR), 2015 abstract / bibtex / videos / arxiv @incollection{vvnCarreira14, author = {Jo{\~{a}}o Carreira and Abhishek Kar and Shubham Tulsiani and Jitendra Malik}, title = {Virtual View Networks for Object Reconstruction}, booktitle = CVPR, year = {2015}, }
	Looking At You: Fused Gyro and Face Tracking for Viewing Large Imagery on Mobile Devices Neel Joshi, Abhishek Kar, Michael F. Cohen ACM SIGCHI Conference on Human Factors in Computing Systems (CHI), 2012 abstract / bibtex / website / video @inproceedings{joshi2012looking, title={Looking at you: fused gyro and face tracking for viewing large imagery on mobile devices}, author={Joshi, Neel and Kar, Abhishek and Cohen, Michael}, booktitle={Proceedings of the SIGCHI Conference on Human Factors in Computing Systems}, pages={2211--2220}, year={2012}, organization={ACM} }