Dixin Tang (CV)
University of California, Berkeley
Address:
Soda Hall 465, Berkeley
Email:
totemtang [at] berkeley [dot] edu

About

I am a postdoctoral scholar at UC Berkeley working with Prof. Aditya Parameswaran. I am part of Data Systems and Foundations group, and affiliated with EPIC Data and SKY Computing labs.

Before joining UC Berkeley, I obained my PhD degree in the database group of UChicago, advised by Prof. Aaron Elmore. At UChicago, I'm also fortunate to work with Prof. Sanjay Krishnan and Prof. Michael Franklin.

Research Interest

My current research is supporting user-centered analytical interfaces at scale by reshaping modern data analytical stacks on three aspects:

  • Usability: how do we help end-users easily consume and make sense of the visual results?
  • Scalability: how do we improve interactivity and reduce response time for analyzing a large-scale dataset?
  • Cost: how do we reduce resource usage for scalable data systems while not sacrificing performance?
I am also generally interested in user-centered data management, query processing and optimizations, and cloud databases.

Projects

I am currently working on several exciting projects. The major projects include:

  • FormS: a Python library for scalable spreadsheet formula execution
  • Transactional Panorama: a conceptual framework for user perception in analytical visual interfaces
  • Taco: efficient and compact spreadsheet formula graphs
  • Modin: a scalable dataframe system
  • Lux: a visualization recommendation library for data scientists to perform easy data exploration in dataframe workflow
In addition, I have also worked on several other projects, including explaining data shift and cleaning police roster data. At UChicago, I work on CrocodileDB, a new database architecture that exploits time slackness to enable new resource-efficient query execution. Please check out this video for an overview of my research on CrocodileDB.

News

2022-11: I am invited to be a PC member of EDBT 2024

2022-10: I gave a talk about Lux at EPIC Advance

2022-09: I gave a talk at VLDB'22 about our paper "Flexible Rule-Based Decomposition and Metadata Independence in Modin: A Parallel Dataframe System". Please check out the video here.

2022-06: I will attend SIGMOD'22 in person and chair Session 2: Query Processing and Optimization

2021-12: I am invited to be a PC member of SIGMOD 2023

2021-11: I gave a talk about Modin at RISECamp 2021

2021-10: Two papers from Modin and Lux projects are accepted by VLDB 2022

Publications

Professional Services

Conference Program Committee: SIGMOD'22, SIGMOD'22 (Demo Track), SIGMOD'23, EDBT'24
Conference Reviewer: IEEE VIS'21
Conference Session Chair: SIGMOD'22
Journal Reviewer: VLDB Journal (2022), Distributed and Parallel Databases Journal (2022)