I am postdoc researcher with a joint appointment with AMPLab and BIDS (Berkeley Institute of Data Science) at University of California, Berkeley. I am working with Prof. Michael J. Franklin.

I received my Ph.D from Department of Computer Science at University of Chicago in June, 2014 under the supervision of Prof. Ian Foster. Prior to that, I received my master degree from Department of Computer Science at the University of Chicago in Dec, 2007. And I got my bachelor degree in School of Software Engineering from Beijing University of Posts and Telecommunications in June, 2006.

Research Interest

I am interseted in distributed computing, high performance computing, and applying the computing techniques to solve big data problems. I am also intertested in data management systems for domain science research and discovery.

My thesis research was to enable concise, fast and scalable execution of parallel scripting applications on large scale computers through proper design and implementations of programming model, runtime system and file system.

Research Projects


KIRA: where Astronomy meets Big Data
Tachyon: A Memory-centric distribued file system
AMFORA: Parallel scripting on large scale computers


AIMES:Abstractions and integrated middleware for extreme scales
ExM: System support for extreme-scale, many-task applications
Swift: Fast easy parallel scripting - on multicores, clusters, clouds and supercomputers
Falkon: A fast and light-weight task execution framework


My publication list can be found at google scholar.
  1. D. S. Katz, A. Merzky, Z. Zhang, S. Jha. Application Skeletons: Construction and Use in eScience, Future Generation Computer Systems, 2015.
  2. Z. Zhang, K. Barbary, F. A. Nothaft, E. Sparks, O. Zahn, M. J. Franklin, D. A. Patterson, S. Perlmutter. Scientific Computing Meets Big Data Technology: An Astronomy Use Case, 2015 IEEE International Conference on Big Data, 2015.
  3. F. A. Nothaft, M. Massie, T. Danford, Z. Zhang, U. Laserson, Carl Yeksigian, J. Kattalam, A. Ahuja, J. Hammerbacher, M. Linderman, M. J. Franklin, A. D. Joseph, D. A. Patterson. Rethinking Data-Intensive Science Using Scalable Analytics Systems, Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data. ACM, 2015.
  4. D. Crankshaw, P. Bailis, J. E. Gonzalez, H. Li, Z. Zhang, M. J. Franklin, A. Ghodsi, M. I. Jordan. The Missing Piece in Complex Analytics: Low Latency, Scalable Model Management and Serving with Velox, 7th Biennial Conference on Innovative Data Systems Research (CIDR), 2015.
  5. Z. Zhang, D. S. Katz. Using Application Skeletons to Improve eScience Infrastructure, 2014 IEEE 10th International Conference on e-Science (e-Science), 2014.
  6. Z. Zhang, D. S. Katz, T. G. Armstrong, J. Wozniak, I. Foster. Parallelizing the Execution of Sequential Scripts, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC13), 2013.
  7. Z. Zhang, D. S. Katz, M. Wilde, J. Wozniak, I. Foster. MTC Envelope: Defining the Capability of Large Scale Computers in the Context of Parallel Scripting Applications, Proceedings of 22nd International ACM Symposium on High-Performance Parallel and Distributed Computing (HPDC'13), 2013
  8. T. Li, X. Zhou, K. Brandstatter, D. Zhao, K. Wang, A. Rajendran, Z. Zhang, I. Raicu. ZHT: A Light-weight Reliable Persistent Dynamic Scalable Zero-hop Distributed Hash Table, IEEE International Parallel & Distributed Processing Symposium (IPDPS) 2013
  9. Z. Zhang, D. S. Katz, J. Wozniak, A. Espinosa, I. Foster. Design and Analysis of Data Management in Scalable Parallel Scripting, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC12), 2012.
  10. K. Maheshwari, A. Espinosa, D. S. Katz, M. Wilde, Z. Zhang, I. Foster, S. Callaghan, and P. Maechling. Job and Data Clustering for Aggregate Use of Multiple Production Cyberinfrastructures, Proceedings of Fifth International Workshop on Data Intensive Distributed Computting (DIDC'12), pp. 3-11, 2012.
  11. Emalayan Vairavanathan, Samer Al-Kiswany, Lauro Costa, Matei Ripeanu, Zhao Zhang, Daniel S. Katz, Michael Wilde. A Workflow-Aware Storage System: An Opportunity Study, Proceedings of the 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), 2012.
  12. Zhao Zhang, Daniel S. Katz, Matei Ripeanu, Michael Wilde, Ian Foster. AME: An Anyscale Many-Task Computing Engine, Proceedings of the 6th Workshop on Workflows in Support of Large-Scale Science, 2011.
  13. Tim Armstrong, Zhao Zhang, Daniel S. Katz, Michael Wilde, Ian Foster. Scheduling Many-Task Workloads on Supercomputers: Dealing with Trailing Tasks, Proceedings of 3rd IEEE Workshop on Many-Task Computing on Grids and Supercomputers (MTAGS10), (Best Paper,) 2010.
  14. Michael Wilde, Ian Foster, Kamil Iskra, Pete Beckman, Zhao Zhang, Allan Espinosa, Mihael Hategan, Ben Clifford, Ioan Raicu. Parallel Scripting for App lications at the Petascale and Beyond IEEE Computer Nov. 2009 Special Issue on Extreme Scale Computing, 2009.
  15. Ioan Raicu, Ian Foster, Mike Wilde, Zhao Zhang, Yong Zhao, Alex Szalay, Pete Beckman, Kamil Iskra, Philip Little, Christopher Moretti, Amitabh Chaudhary, Douglas Thain. Middleware Support for Many-Task Computing, to appear in Cluster Computing, The Journal of Networks, Software Tools and Applications, 2009.
  16. Z. Zhang, A. Espinosa, K. Iskra, I. Raicu, I. Foster and M. Wilde, Design and evaluation of a collective I/O model for loosely-coupled petascale programming, presented at the IEEE Workshop on Many-Task Computing on Grids and Supercomputers, Austin, TX, Nov. 2008.
  17. Ioan Raicu, Zhao Zhang, Mike Wilde, Ian Foster, Pete Beckman, Kamil Iskra, Ben Clifford, Toward Loosely Coupled Programming on Petascale Systems,IEEE/ACM Supercomputing 2008.


  1. Guest Co-editor: Journal of Future Generation Computing Systems (FGCS), Special Issue: eScience Applications and Infrastructure, March 2014.
  2. Publicity Chair: 5th Workshop on Many-Task Computing on Grids and Supercomputers (MTAGS) 2012, Salt Lake City, UT, Novement 2012.
  3. Proceedings Chair: IEEE International Conference on eScience, Chicago, IL, October 2012.
  4. Organizer: 1st Greater Chicago Area System Research Workshop, Chicago, IL, May 2012.
  5. Organizer: Weekly Systems Research Paper Seminar, UC Systems Group, 2010-present.

Contact Information:

465 Soda Hall, MC-1776
Berkeley, CA, 94720
zhaozhang (at) eecs.berkeley.edu