Skip to: Site menu | Main content


Alexey Tumanov


PostDoc, UC Berkeley
PhD, Carnegie Mellon
atumanov[at]berkeley[dot]edu
atumanov[at]cmu[dot]edu

Brief Bio

I am a PostDoc at the University of California Berkeley, working with Ion Stoica. I completed my PhD at Carnegie Mellon, advised by Greg Ganger and collaborating closely with Mor Harchol-Balter and Onur Mutlu. At CMU, I was honored by the prestigious NSERC Alexander Graham Bell Canada Graduate Scholarship (NSERC CGS-D3) and partially funded by the Intel Science and Technology Centre for Cloud Computing and Parallel Data Lab. Prior to Carnegie Mellon, I worked on agile stateful VM replication with para-virtualization at the University of Toronto. My interest in cloud computing brought me to the University of Toronto from industry, where I had worked on the development of cluster middleware responsible for distributed datacenter resource management.

Research Synopsis

At a high level, my research interests revolve around systems support and resource management for distributed machine learning frameworks and applications. My recent research focuses on modeling, design, and development of abstractions, primitives, algorithms and systems artifacts for a general resource management framework with support for static and dynamic heterogeneity, hard and soft placement constraints, time-varying resource capacity guarantees, and combinatorial constraints in heterogeneous datacenters. For more detail, please refer to my publication list below.

Publications

Real-Time Machine Learning: the Missing Pieces
Robert Nishihara, Philipp Moritz, Stephanie Wang, Alexey Tumanov, William Paul, Johann Schleier-Smith, Richard Liaw, Michael I. Jordan, Ion Stoica
In Proc. of HotOS XVI, May 2017.
[Abstract] [PDF] [BibTeX]

@inproceedings{ray-hotos17,
  title={Real-Time Machine Learning: The Missing Pieces},
  author={Robert Nishihara and Philipp Moritz and Stephanie Wang and Alexey Tumanov 
          and William Paul and Johann Schleier-Smith and Richard Liaw 
          and Michael I. Jordan and Ion Stoica},
  booktitle={Workshop on Hot Topics in Operating Systems},
  journal={arXiv preprint arXiv:1703.03924},
  year={2017}
}

Proteus: agile ML elasticity through tiered reliability in dynamic resource markets
Aaron Harlap, Alexey Tumanov, Andrew Chung, Gregory R. Ganger, Phil Gibbons
In Proc. of EuroSys'17, Apr 2017.
[Abstract] [PDF] [BibTeX]

@inproceedings{proteus-eurosys17,
 author = {Harlap, Aaron and Tumanov, Alexey and Chung, Andrew and 
           Ganger, Gregory R. and Gibbons, Phillip B.},
 title = {Proteus: Agile ML Elasticity Through Tiered Reliability 
          in Dynamic Resource Markets},
 booktitle = {Proceedings of the Twelfth European Conference on Computer Systems},
 series = {EuroSys '17},
 year = {2017},
 isbn = {978-1-4503-4938-3},
 location = {Belgrade, Serbia},
 pages = {589--604},
 numpages = {16},
 url = {http://doi.acm.org/10.1145/3064176.3064182},
 doi = {10.1145/3064176.3064182},
 acmid = {3064182},
 publisher = {ACM},
 address = {New York, NY, USA},
}

Morpheus: Towards Automated SLOs for Enterprise Clusters
S. Jyothi, C. Curino, I. Menache, S. Narayanamurthy, A. Tumanov, J. Yaniv, R. Mavlyutov, I. Goiri, S. Krishnan, J. Kulkarni, S. Rao
In Proc. of USENIX OSDI'16, Nov 2016.
[Abstract] [PDF] [Slides] [BibTeX]

@inproceedings{morpheus-osdi16,
  author = {Sangeetha Abdu Jyothi and Carlo Curino 
and Ishai Menache and Shravan Matthur Narayanamurthy 
and Alexey Tumanov and Jonathan Yaniv and Ruslan Mavlyutov 
and Inigo Goiri and Subru Krishnan and Janardhan Kulkarni and Sriram Rao},
  title = {Morpheus: Towards Automated SLAs for Enterprise Clusters},
  booktitle = {Proc. of the 12th USENIX OSDI (OSDI'16)},
  year = {2016},
  address = {GA},
  url = {https://www.usenix.org/conference/osdi16/technical-sessions/presentation/jyothi},
  publisher = {USENIX Association},
}

TetriSched: global rescheduling with adaptive plan-ahead in dynamic heterogeneous clusters. [best student paper]
Alexey Tumanov, Timothy Zhu, Jun Woo Park, Michael A. Kozuch, Mor Harchol-Balter, Gregory R. Ganger.
In Proc. of EuroSys'16, London, UK, April 2016.
[Abstract] [PDF] [Slides] [BibTeX]

@inproceedings{tetrisched,
 author = {Alexey Tumanov and Timothy Zhu and Jun Woo Park and Michael A. Kozuch
and Mor Harchol-Balter and Gregory R. Ganger},
 title = {{T}etri{S}ched: global rescheduling with adaptive plan-ahead in dynamic
heterogeneous clusters},
 booktitle = {Proc. of the 11th European Conference on Computer Systems},
 series = {EuroSys '16},
 year = {2016},
 month = {Apr},
 location = {London, UK},
 Publisher = {ACM},
}

PriorityMeister: Tail Latency QoS for Shared Networked Storage.
Timothy Zhu, Alexey Tumanov, Michael A. Kozuch, Mor Harchol-Balter, Gregory R. Ganger.
In Proc. of the 5th ACM Symposium on Cloud Computing, SoCC'14, Nov 2014.
[Abstract] [PDF] [BibTeX]

@inproceedings{pm-socc14,
    author = {Timothy Zhu and Alexey Tumanov and Michael A. Kozuch and
              Mor Harchol-Balter and Gregory R. Ganger},
    title = {{P}riority{M}eister: Tail Latency QoS for Shared Networked Storage},
    booktitle = {Proc. of the 5th ACM Symposium on Cloud Computing},
    series = {SOCC '14},
    year = {2014},
    location = {Seattle, WA},
    Publisher = {ACM},
}

Exploiting iterative-ness for parallel ML computations.
Henggang Cui, Alexey Tumanov, Jinliang Wei, Lianghong Xu, Wei Dai, Jesse Haber-Kucharsky, Qirong Ho, Gregory R. Ganger, Phil B. Gibbons, Garth A. Gibson, Eric P. Xing.
In Proc. of the 5th ACM Symposium on Cloud Computing, SoCC'14, Nov 2014.
[Abstract] [PDF] [BibTeX]


        

Agility and performance in elastic distributed storage.
Lianghong Xu, James Cipar, Elie Krevat, Alexey Tumanov, Nitin Gupta, Michael A. Kozuch, and Gregory R. Ganger.
Trans. Storage, 10(4):16:1– 16:27, October 2014.
[Abstract] [PDF] [BibTeX]

@article{Xu:2014,
 author = {Xu, Lianghong and Cipar, James and Krevat, Elie and Tumanov, Alexey 
           and Gupta, Nitin and Kozuch, Michael A. and Ganger, Gregory R.},
 title = {Agility and Performance in Elastic Distributed Storage},
 journal = {Trans. Storage},
 issue_date = {October 2014},
 volume = {10},
 number = {4},
 month = oct,
 year = {2014},
 issn = {1553-3077},
 pages = {16:1--16:27},
 articleno = {16},
 numpages = {27},
 url = {http://doi.acm.org/10.1145/2668129},
 doi = {10.1145/2668129},
 acmid = {2668129},
 publisher = {ACM},
 address = {New York, NY, USA},
 keywords = {Cloud storage, agility, distributed file systems, elastic storage, power, write offloading},
}

SpringFS: Bridging Agility and Performance in Elastic Distributed Storage
Lianghong Xu, James Cipar, Elie Krevat, Alexey Tumanov, Nitin Gupta, Michael A. Kozuch, Gregory R. Ganger.
In Proc. of Usenix FAST’14, Feb 2014.
[Abstract] [PDF] [BibTeX][acceptance: 18%]

@inproceedings{springfs,
    author = {Lianghong Xu and James Cipar and Elie Krevat and Alexey Tumanov
              and Nitin Gupta and Michael A. Kozuch and Gregory R. Ganger},
    title = {SpringFS: Bridging Agility and Performance in Elastic Distributed Storage},
    booktitle = {Proc. of the 12th USENIX FAST},
    year = {2014},
    isbn = {ISBN 978-1-931971-08-9},
    location = {Santa Clara, CA},
    pages = {243-255},
    publisher = {USENIX},
    address = {Berkeley, CA}
}

TetriSched: Space-Time Scheduling for Heterogeneous Datacenters.
Alexey Tumanov, Timothy Zhu, Michael A. Kozuch, Mor Harchol-Balter, Gregory R. Ganger.
Carnegie Mellon University PDL Technical Report CMU-PDL-13-112, Dec 2013.
[Abstract] [PDF] [BibTeX]

@techreport{tetrischedTR,
    Author = { Alexey Tumanov and Timothy Zhu and Michael A. Kozuch and 
               Mor Harchol-Balter and Gregory R. Ganger },
    Title = {{T}etri{S}ched: Space-Time Scheduling for Heterogeneous Datacenters},
    Institution = {Carnegie Mellon University},
    Year = {2013},
    Month = {Dec},
    URL = {http://www.pdl.cmu.edu/PDL-FTP/CloudComputing/CMU-PDL-13-112_abs.shtml},
    Number = {CMU-PDL-13-112},
}

Asymmetry-aware execution placement on manycore chips.
Alexey Tumanov, Joshua Wise, Onur Mutlu, Gregory R. Ganger.
In Proc. of the 3rd Workshop on Systems for Future Multicore Architectures (SFMA'13), EuroSys'13, April 2013.
[Abstract] [PDF] [BibTeX]

@inproceedings{atumanov-sfma13,
    author = {Alexey Tumanov and Joshua Wise and Onur Mutlu and Gregory R. Ganger},
    title = {Asymmetry-aware execution placement on manycore chips},
    booktitle = {Proc. of the 3rd Workshop on Systems for 
                 Future Multicore Architectures (SFMA'13)},
    series = {SFMA '13},
    year = {2013},
    location = {Prague, Czech Republic},
}

alsched: Algebraic Scheduling of Mixed Workloads in Heterogeneous Clouds"
Alexey Tumanov, James Cipar, Michael A. Kozuch, Gregory R. Ganger.
In Proc. of the 3rd ACM Symposium on Cloud Computing, SoCC'12, Oct 2012.
[Abstract] [PDF] [BibTeX] [acceptance: 15%]

@inproceedings{alsched-socc12,
    author = {Alexey Tumanov and James Cipar and Michael A. Kozuch and 
              Gregory R. Ganger},
    title = {{a}lsched: algebraic scheduling of mixed workloads in heterogeneous clouds},
    booktitle = {Proc. of the 3rd ACM Symposium on Cloud Computing},
    series = {SOCC '12},
    year = {2012},
    location = {San Jose, CA},
    Publisher = {ACM},
}

Heterogeneity and Dynamicity of Clouds at Scale: Google Trace Analysis
Charles Reiss, Alexey Tumanov, Gregory R. Ganger, Randy H. Katz, Michael A. Kozuch.
In Proc. of the 3rd ACM Symposium on Cloud Computing, SoCC'12, Oct 2012.
[Abstract] [PDF] [BibTeX] [acceptance: 15%]

@inproceedings{gtrace-socc12,
 author     = {Charles Reiss and Alexey Tumanov and Gregory R. Ganger and
            Randy H. Katz and Michael A. Kozuch},
 title      = {Heterogeneity and Dynamicity of Clouds at Scale: {G}oogle Trace
          Analysis},
 booktitle  = {Proc. of the 3nd ACM Symposium on Cloud Computing},
 series     = {SOCC '12},
 year       = {2012},
 location   = {San Jose, CA},
}

Kaleidoscope: Cloud Micro-Elasticity via VM State Coloring
Roy Bryant, Alexey Tumanov, Olga Irzak, Adin Scannell, Kaustubh Joshi, Matti Hiltunen, H. Andrés Lagar-Cavilla, Eyal de Lara.
In Proc. of EuroSys'11, Salzburg, Austria, April 2011.
[Abstract][PDF][BibTeX][acceptance: 15%]

@inproceedings{BryantEurosys11,
  author =       "Roy Bryant and Alexey Tumanov and Olga Irzak and 
                  Adin Scannell and Kaustubh Joshi and Matti Hiltunen
                  and H. Andr\'es Lagar-Cavilla and Eyal de Lara",
  title =        "{Kaleidoscope: Cloud Micro-Elasticity via VM State Coloring}",
  booktitle =    "{Proc. of Eurosys 2011}",
  address =      "{Salzburg, Austria}",
  month =        apr,
  year =         2011
}

Variability-Aware Latency Amelioration in Distributed Environments
Alexey Tumanov, Robert Allison, Wolfgang Stuerzlinger.
In Proc. of IEEE Virtual Reality Conference, 2007, pp. 123-130, March 2007.
[Abstract][PDF][BibTeX][acceptance: 20%]

@INPROCEEDINGS{Tumanov-ieeevr07,
  author={Alexey Tumanov and Robert Allison and Wolfgang Stuerzlinger},
  booktitle={Proc. of IEEE Virtual Reality Conference, 2007.},
  series = {IEEE VR'2007},
  title={Variability-Aware Latency Amelioration in Distributed Environments},
  year={2007},
  month={March},
  volume={},
  number={},
  pages={123--130},
  location={Charlotte, NC},
  doi={10.1109/VR.2007.352472},
}

Variability-Aware Latency Amelioration in Distributed Interactive Virtual Environments
Alexey Tumanov
M.Sc. Thesis, York University, Toronto, Canada, April 2006.
[PDF][BibTeX]

@mastersthesis{Tumanov-mscthesis,
  author = {Alexey Tumanov},
  title = {Variability-aware latency amelioration in distributed interactive
           virtual environments},
  school = {York University},
  address = {Toronto, Canada},
  year = {2006},
  month = {April},
}

Unpublished Manuscripts/Work in Progress

IDK Cascades: Fast Deep Learning by Learning not to Overthink
Xin Wang, Yujia Luo, Daniel Crankshaw, Alexey Tumanov, Fisher Yu, Joseph E. Gonzalez. CoRR abs/1706.00885, Jun 3, 2017. [ArXiV][BibTeX]

@article{DBLP:journals/corr/WangLCTG17,
  author    = {Xin Wang and
               Yujia Luo and
               Daniel Crankshaw and
               Alexey Tumanov and
               Joseph E. Gonzalez},
  title     = {{IDK} Cascades: Fast Deep Learning by Learning not to Overthink},
  journal   = {CoRR},
  volume    = {abs/1706.00885},
  year      = {2017},
  url       = {http://arxiv.org/abs/1706.00885},
  archivePrefix = {arXiv},
  eprint    = {1706.00885},
  timestamp = {Mon, 03 Jul 2017 13:29:02 +0200},
  biburl    = {http://dblp.org/rec/bib/journals/corr/WangLCTG17},
  bibsource = {dblp computer science bibliography, http://dblp.org}
}

JamaisVu: Robust Scheduling with Auto-Estimated Job Runtimes
Alexey Tumanov, Angela Jiang, Jun Woo Park, Michael A. Kozuch, Gregory R. Ganger.
Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-104. September 2016. [Abstract] [PDF] [BibTeX]

@techreport{jamaisvu-tr201609,
    author = {Alexey Tumanov and Angela Jiang and Jun Woo Park and 
              Michael A. Kozuch and Gregory R. Ganger},
    title = {{JamaisVu: Robust Scheduling with Auto-Estimated Job Runtimes}},
    institution = {{Carnegie Mellon University}},
    year = {2016},
    month = {September},
    number = {CMU-PDL-16-104}
}

Patents

Methods and Apparatus to Provision Virtual Machine Resources
Horacio Andres Lagar-cavilla, Roy Bryant, Matti Hiltunen, Olga Irzak, Kaustubh Joshi, Adin Matthew Scannell, Alexey Tumanov, Eyal De Lara.
Patent Number 20130055252. February 2013.

Teaching

Courses I TA'ed:

Courses I lectured for:

Select Fellowships and Awards

Academic Service

Reviewer for: ACM SoCC 2013, ACM SIGMETRICS 2014, IEEE/ACM MICRO 2014, ACM SoCC 2017