Dixin Tang (CV)
University of California, Berkeley
Soda Hall 465, Berkeley
totemtang [at] berkeley [dot] edu


I am a Postdoc researcher at UC Berkeley working with Prof. Aditya Parameswaran. I am part of Data Systems and Foundations group and affiliated with RISELab.

Before joining UC Berkeley, I obained my PhD degree in the database group of UChicago, advised by Prof. Aaron Elmore. At UChicago, I'm also fortunate to work with Prof. Sanjay Krishnan and Prof. Michael Franklin.

Research Interest

My current research interest is to build data analysis tools and systems for end users with three research goals:

  • Improving interactivity for analyzing large-scale datasets
  • Enabling easy and robust programming
  • Correctly modeling users' intent and providing consistent semantics
I am also interested in designing new query processing methods that unify batch processing and continuous query processing, providing query service in the cloud, and supporting high-performance transaction processing on multicore machines.

I am currently working on several exciting projects, including Dataspread, a data analysis tool that combines the intuitiveness and flexibility of spreadsheets with the scalability and power of databases, Modin, a scalable dataframe system, and Lux, a visualization recommendation library for data scientists to perform easy data exploration in dataframe workflow. At UChicago, I work on CrocodileDB, a new database architecture that exploits time slackness to enable new resource-efficient query execution.


2021-11: I gave a talk about Modin at RISECamp 2021

2021-10: Two papers from Modin and Lux projects are accepted by VLDB 2022


Professional Services

Program Committee: SIGMOD'22
Reviewer: IEEE VIS'21