Publications by Date
2025
- PAPER RequestAtlas: Supporting the Slow and Iterative Process of Requesting Public Records.
Rachel Warren, Aditya G. Parameswaran, Lisa Pickoff-White, Niloufar Salehi. 28th Int'l Conference on Computer-Supported Cooperative Work (CSCW), Bergen, Norway. November 2025
(Used by journalists for managing 1000s of public record requests.)
- PAPER Towards Accurate and Efficient Document Analytics with Large Language Models.
Yiming Lin, Madelon Hulsebos, Ruiying Ma, Shreya Shankar, Sepanta Zeighami, Aditya G. Parameswaran, Eugene Wu. 41st IEEE Int’l Conf on Data Engineering (ICDE), Hong Kong. May 2025
- PAPER PromptEvals: A Dataset of Assertions and Guardrails for Custom Production Large Language Model Pipelines.
Reya Vir*, Shreya Shankar*, Harrison Chase, William Hinthorn, Aditya G. Parameswaran. 20th Conf. of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), Albuquerque, USA. April 2025
(Done in collaboration with LangChain, a leading LLM workflow company.)
- PRE-PRINT RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines.
Quentin Romero Lauro*, Shreya Shankar*, Sepanta Zeighami, Aditya G. Parameswaran. Technical Report. April 2025
- PRE-PRINT TWIX: Automatically Reconstructing Structured Data from Templatized Documents.
Yiming Lin, Mawil Hasan, Rohan Kosalge, Alvin Cheung, Aditya G. Parameswaran. Technical Report. April 2025
- PRE-PRINT Steering Semantic Data Processing with DocWrangler.
Shreya Shankar*, Bhavya Chopra*, Mawil Hasan, Stephen Lee, Björn Hartmann, Joseph M. Hellerstein, Aditya G. Parameswaran, Eugene Wu. Technical Report. April 2025
(The deployed version of DocWrangler has been used over 1500 times.)
- PRE-PRINT The Cambridge Report on Database Research.
Anastasia Ailamaki, ..., Aditya Parameswaran, .... Technical Report. April 2025
(This is a once-every-five-years report on data management research, written by experts in the field.)
- PRE-PRINT Why Do Multi-Agent LLM Systems Fail?.
Mert Cemri, Melissa Z Pan, Shuyi Yang, Lakshya A Agrawal, Bhavya Chopra, Rishabh Tiwari, Kurt Keutzer, Aditya Parameswaran, Dan Klein, Kannan Ramchandran, Matei Zaharia, Joseph E Gonzalez, Ion Stoica. Technical Report. March 2025
- PAPER NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval.
Sepanta Zeighami, Zac Wellmer, Aditya G. Parameswaran. 13th Int’l Conf on Learning Representations (ICLR), Singapore. March 2025
(Deployed by LlamaIndex as part of their open-source.)
- PAPER LLM-Powered Proactive Data Systems.
Sepanta Zeighami, Yiming Lin, Shreya Shankar, Aditya G. Parameswaran. IEEE Data Engineering Bulletin, Issue on LLMs-meets-data. March 2025
- PAPER Flow with FlorDB: Incremental Context Maintenance for the Machine Learning Lifecycle.
Rolando Garcia, Pragya Kallanagoudar, Chithra Anand, Sarah E. Chasins, Joseph M. Hellerstein, Aditya G. Parameswaran. 25th Conference on Innovative Data Systems Research (CIDR), Amsterdam, Netherlands. January 2025
2024
- PAPER Benchmarking Table Retrieval for Generative Tasks.
Carl Ji, Aditya Parameswaran, Madelon Hulsebos. TRL Workshop @ NeurIPS 2024, Vancouver, Canada. November 2024
- PAPER 'We Have No Idea How Models will Behave in Production until Production': How Engineers Operationalize Machine Learning.
Shreya Shankar, Rolando Garcia, Joseph M. Hellerstein, Aditya G. Parameswaran. 27th Int'l Conference on Computer-Supported Cooperative Work (CSCW), San Jose, Costa Rica. November 2024
(The 3 V's of MLOps, coined by this paper, was covered in a number of industry blogs and podcasts.)
- PAPER Inferring Visualization Intent from Conversation.
Haotian Li, Nithin Chalapathi, Huamin Qu, Alvin Cheung, Aditya G. Parameswaran. 33rd Int’l Conf on Information and Knowledge Management (CIKM), Boise, USA. October 2024
- PRE-PRINT DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing.
Shreya Shankar, Aditya G. Parameswaran, Eugene Wu. Technical Report. October 2024
(Over 1.4K Github stars and multiple users across domains.)
- PAPER Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences.
Shreya Shankar, J.D. Zamfirescu-Pereira, Björn Hartmann, Aditya G. Parameswaran, Ian Arawjo. 39th ACM Symposium on User Interface Software and Technology (UIST), Pittsburgh, USA. October 2024
(Deployed by LangChain as part of their LangChain Hub.)
- PAPER Quilt: Custom UIs for Linking Unstructured Documents to Structured Datasets (Demo).
Pragya Kallanagoudar, Chithra Anand, Rolando Garcia, Rebecca M. M. Hicke, Aditya G. Parameswaran, Eunice Jun, Sarah E. Chasins. 39th ACM Symposium on User Interface Software and Technology (Adjunct Volume), Pittsburgh. October 2024
- PAPER SPADE: Synthesizing Assertions for Large Language Model Pipelines.
Shreya Shankar, Haotian Li, Parth Asawa, Madelon Hulsebos, Yiming Lin, J. D. Zamfirescu-Pereira, Harrison Chase, Will Fu-Hinthorn, Aditya G. Parameswaran, Eugene Wu. 50th International Conference on Very Large Data Bases (VLDB), Guangzhou, China. August 2024
(Deployed by LangChain as part of their LangChain Hub.)
- PAPER Dealing with Acronyms, Abbreviations, and Typos in Real-World Entity Matching..
Joshua Wu, Dixin Tang, Nithin Chalapathi, Tristan Chambers, Julie Ciccolini, Cheryl Philips, Lisa Pickoff-White, Aditya G. Parameswaran. 50th International Conference on Very Large Data Bases (VLDB), Guangzhou, China. August 2024
- PAPER 'It Took Longer than I was Expecting': Why is Dataset Search Still so Hard?.
Madelon Hulsebos, Wenjing Lin, Shreya Shankar, Aditya G. Parameswaran. Workshop on Human-in-the-Loop Data Analytics (HILDA) at the ACM SIGMOD Int'l Conf. on Management of Data, Santiago, Chile. June 2024
- PAPER Building Reactive Large Language Model Pipelines with Motion (Demo).
Shreya Shankar, Aditya G. Parameswaran. ACM SIGMOD Int'l Conf. on Management of Data , Santiago, Chile. June 2024
- PAPER Revisiting Prompt Engineering via Declarative Crowdsourcing.
Aditya G. Parameswaran, Shreya Shankar, Parth Asawa, Naman Jain, Yujie Wang. Conference on Innovative Database Research (CIDR), Chaminade, USA. January 2024
2023
- PAPER Automatic and Precise Data Valitation for Machine Learning.
Shreya Shankar, Labib Fawaz, Karl Gyllstrom, Aditya G. Parameswaran. 32nd Int’l Conf on Information and Knowledge Management (CIKM), Birmingham, UK. October 2023
- PAPER Transactional Panorama: A Conceptual Framework for User Perception in Analytical Visual Interfaces.
Dixin Tang, Alan D. Fekete, Indranil Gupta, Aditya G. Parameswaran. 49th International Conference on Very Large Data Bases (VLDB), Vancouver, Canada. September 2023
- PAPER Towards Observability for Production Machine Learning Pipelines.
Shreya Shankar, Aditya G. Parameswaran. 49th International Conference on Very Large Data Bases (VLDB), Vancouver, Canada. September 2023
- PAPER Bolt-on, Compact, and Rapid Program Slicing for Notebooks.
Shreya Shankar*, Stephen Macke*, Sarah Chasins, Andrew Head, Aditya G. Parameswaran. 49th International Conference on Very Large Data Bases (VLDB), Vancouver, Canada. September 2023
(The underlying open-source IPyflow tool has 275K Downloads, 1000 GitHub Stars as of October 2023.)
- PAPER Visualizing Spreadsheet Formula Graphs Compactly.
Fanchao Chen, Dixin Tang, Haotian Li, Aditya G. Parameswaran. 49th International Conference on Very Large Data Bases (VLDB), Vancouver, Canada. September 2023
- PAPER Efficient and Compact Spreadsheet Formula Graphs.
Dixin Tang, Fanchao Chen, Christopher De Leon, Tana Wattanawaroon, Jeaseok Yun, Srinivasan Seshadri, Aditya G. Parameswaran. 39th International Conf. on Data Engineering (ICDE), Anaheim, CA, USA. April 2023
2022
- PAPER Lux: Always-on Visualization Recommendations for Exploratory Data Science.
Doris Jung-Lin Lee, Dixin Tang, Kunal Agarwal, Thyne Boonmark, Caitlyn Chen, Jake Kang, Ujjaini Mukhopadhyay, Jerry Song, Micah Yong, Marti A. Hearst, Aditya G. Parameswaran. 48th International Conference on Very Large Data Bases (VLDB), Sydney, Australia and Zoom. September 2022
(Downloaded over 675K times as of October 2023, and used in a variety of industries.)
- PAPER Flexible Rule-Based Decomposition and Metadata Independence in Modin: A Parallel Dataframe System.
Devin Petersohn*, Dixin Tang*, Rehan Durrani, Areg Melik-Adamyan, Joseph E. Gonzalez, Anthony D. Joseph, Aditya G. Parameswaran. 48th International Conference on Very Large Data Bases (VLDB), Sydney, Australia and Zoom. September 2022
(Downloaded over 14M times as of October 2023, and used in a variety of industries.)
- PAPER Expressive Visual Querying for Accelerating Insight.
Tarique Siddiqui, Paul Luh, Zesheng Wang, Karrie Karahalios, Aditya G. Parameswaran. CACM, Volume 65 No. 7. 2022
(Invited Paper due to SIGMOD Best Paper Award.)
- PAPER Leveraging Analysis History for Improved In Situ Visualization Recommendation.
Will Epperson, Doris Lee, Leijie Wang, Kunal Agarwal, Aditya Parameswaran, Dominik Moritz, Adam Perer. EuroVis’22: 24th Eurographics Conference on Visualization, Rome, Italy. 2022
- PAPER Piloting Data Engineering at Berkeley.
Joe Hellerstein, Aditya G. Parameswaran. Data Ed Workshop at SIGMOD’22: SIGMOD International Conference on Data Management, Zoom. June 2022
- PAPER Rethinking Streaming Machine Learning Evaluation.
Shreya Shankar, Bernease Herman, Aditya G. Parameswaran. ML Evaluation Standards Workshop at ICLR’22: 11th International Conference on Learning Representations, Zoom. Apr 2022
2021
- PAPER Deconstructing Categorization in Visualization Recommendation: A Taxonomy and Comparative Study.
Doris Jung-Lin Lee, Vidya Setlur, Melanie Tory, Karrie Karahalios, Aditya Parameswaran. IEEE Int'l Conf. on Information Visualization (InfoVis), Zoom. October 2021
- PAPER Fine-Grained Lineage for Safer Notebook Interactions.
Stephen Macke, Hongpu Gong, Doris Jung-Lin Lee, Andrew Head, Doris Xin, Aditya Parameswaran. 47th International Conference on Very Large Data Bases (VLDB), Copenhagen, Denmark and Zoom. September 2021
(Downloaded over 240K times as of June 2023.)
- PAPER NOAH: Interactive Spreadsheet Exploration with Dynamic Hierarchical Overviews.
Sajjadur Rahman, Mangesh Bendre, Yuyang Liu, Shichu Zhu, Zhaoyuan Su, Karrie Karahalios, Aditya Parameswaran. 47th International Conference on Very Large Data Bases (VLDB), Copenhagen, Denmark and Zoom. September 2021
- PAPER Production Machine Learning Pipelines: Empirical Analysis and Optimization Opportunities..
Doris Xin, Hui Miao, Aditya Parameswaran, Neoklis Polyzotis. SIGMOD Int'l Conf. on Management of Data, Xi'an, China. June 2021
- PAPER Enhancing the Interactivity of Dataframe Queries by Leveraging Think Time..
Doris Xin, Devin Petersohn, Dixin Tang, Yifan Wu, Joseph E. Gonzalez, Joseph M. Hellerstein, Anthony D. Joseph, Aditya G. Parameswaran. IEEE Data Engineering Bulletin, Issue on Data Validation for Machine Learning. May 2021
- PAPER Whither AutoML? Understanding the Role of Automation in Machine Learning Workflows.
Doris Xin, Eva Wu, Doris Lee, Niloufar Salehi, Aditya Parameswaran. International Conference on Human Factors in Computing Systems (CHI), Yokohama, Japan and Zoom. May 2021
- PAPER From Sketching to Natural Language: Expressive Visual Querying for Accelerating Insight..
Tarique Siddiqui, Paul Luh, Zesheng Wang, Karrie Karahalios, Aditya G. Parameswaran. SIGMOD Record, 50(1): 51-58. May 2021
(Invited Paper due to SIGMOD Best Paper Award.)
- PAPER Rapid Approximate Aggregation with Distribution-Sensitive Interval Guarantees.
Stephen Macke, Maryam Aliakbarpour, Ilias Diakonikolas, Aditya Parameswaran, Ronitt Rubinfeld. 37th International Conf. on Data Engineering (ICDE), Chania, Greece and Zoom. April 2021
2020
- PAPER Three Lessons from Accelerating Scientific Insight Discovery via Visual Querying.
Doris Lee, Tarique Siddiqui, Karrie Karahalios, Aditya Parameswaran. Patterns, Cell Press, Volume 1, Issue 7, 100126. October, 2020
- PAPER Uncovering Effective Explanations for Interactive Genomic Data Analysis.
Silu Huang, Charles Blatti, Saurabh Sinha, Aditya Parameswaran. Patterns, Cell Press, Volume 1, Issue 6, 100093. September, 2020
- PAPER Towards Scalable Dataframe Systems.
Devin Petersohn, Stephen Macke, Doris Xin, William Ma, Doris Lee, Xiangxi Mo, Joseph E. Gonzalez, Joseph M. Hellerstein, Anthony D. Joseph, Aditya Parameswaran. 46th Int'l Conf. on Very Large Data Bases, Tokyo, Japan. August 2020
- PAPER ShapeSearch: A Flexible and Efficient System for Shape-based Exploration of Trendlines.
Tarique Siddiqui, Zesheng Wang, Paul Luh, Karrie Karahalios, Aditya Parameswaran. SIGMOD Int'l Conf. on Management of Data, Portland, USA. June 2020
(Best Paper Award: 2 our of 450+ submissions.)
- PAPER Benchmarking Spreadsheet Systems.
Sajjadur Rahman, Kelly Mack, Mangesh Bendre, Ruilin Zhang, Karrie Karahalios, Aditya Parameswaran. SIGMOD Int'l Conf. on Management of Data, Portland, USA. June 2020
(Covered in the Morning Paper, a popular industry blog)
- PAPER Demystifying a Dark Art: Understanding Real-World Machine Learning Model Development.
Angela Lee, Doris Xin, Doris Lee, Aditya Parameswaran. Workshop on Human-in-the-Loop Data Analytics (HILDA) at the ACM SIGMOD Int'l Conf. on Management of Data, Portland, USA. June 2020
- PAPER Understanding Data Analysis Workflows on Spreadsheets: Roadblocks and Opportunities.
Pingjing Yang, Ti-Chung Cheng, Sajjadur Rahman, Mangesh Bendre, Karrie Karahalios, Aditya Parameswaran. Workshop on Human-in-the-Loop Data Analytics (HILDA) at the ACM SIGMOD Int'l Conf. on Management of Data, Portland, USA. June 2020
- PAPER OrpheusDB: Bolt-on Versioning for Relational Databases (Extended Version).
Silu Huang, Liqi Xu, Jialin Liu, Aaron Elmore, Aditya Parameswaran. VLDB Journal, Volume 29 (509-538). January 2020
(Extended Version of VLDB 2017 Best Paper)
2019
- PAPER CRUX: Adaptive Querying for Efficient Crowdsourced Data Extraction.
Theodoros Rekatsinas, Amol Deshpande, Aditya Parameswaran. 28th ACM International Conference on Information and Knowledge Management (CIKM ’19), Beijing, China. November 2019
- PAPER You can't always sketch what you want: Understanding Sensemaking in Visual Query Systems.
Doris Jung-Lin Lee, John Lee, Tarique Siddiqui, Jaewoo Kim, Karrie Karahalios, Aditya Parameswaran. IEEE Int’l Conf. on Visual Analytics Science & Technology (TVCG Track at VAST’19 at VIS), Vancouver, Canada. October 2019
- PAPER ScatterSearch: Visual Querying of Scatterplot Visualizations (Poster).
Doris Jung-Lin Lee, Jaewoo Kim, Renxuan Wang, Aditya Parameswaran. IEEE Int'l Conf. on Information Visualization (InfoVis), Vancouver, Canada. October 2019
- PAPER Enabling Data Science for the Majority.
Aditya Parameswaran. 45th International Conference on Very Large Data Bases (VLDB), Los Angeles, USA. September 2019
(Invited Paper for the VLDB Early Career Research Contributions Award.)
- PAPER Helix: Holistic Optimization for Accelerating Iterative Machine Learning.
Doris Xin, Stephen Macke, Litian Ma, Jialin Liu, Shuchen Song, Aditya Parameswaran. 45th International Conference on Very Large Data Bases (VLDB), Los Angeles, USA. September 2019
- PAPER A Human-in-the-loop Perspective on AutoML: Milestones and the Road Ahead.
Doris Jung-Lin Lee, Stephen Macke, Doris Xin, Angela Lee, Silu Huang, Aditya Parameswaran. IEEE Data Engineering Bulletin, Issue on DB4AI and AI4DB. June 2019
- PAPER Anti-Freeze for Large and Complex Spreadsheets: Asynchronous Formula Computation.
Mangesh Bendre, Tana Wattanawaroon, Kelly Mack, Kevin Chang, Aditya Parameswaran. SIGMOD Int'l Conf. on Management of Data, Amsterdam, The Netherlands. June 2019
- PAPER An Exploratory User Study of Visual Causality Analysis.
Chi-Hsien Yen, Aditya Parameswaran, Wai-Tat Fu. 21st Eurographics Conference on Visualization (EuroVis), Porto, Portugal. June 2019
- PAPER Faster, Higher, Stronger: Redesigning Spreadsheets for Scale (Demo).
Mangesh Bendre, Tana Wattanawaroon, Sajjadur Rahman, Kelly Mack, Yuyang Liu, Shichu Zhu, Yu Lu, Ping-Jing Yang, Xinyan Zhou, Kevin Chang, Karrie Karahalios, Aditya Parameswaran. 35th International Conf. on Data Engineering (ICDE), Macau. April 2019
(Best Demo Award: Given to two out of 24 papers.)
- PAPER Avoiding drill-down fallacies with VisPilot: assisted exploration of data subsets.
Doris Jung-Lin Lee, Himel Dev, Huizi Hu, Hazem Elmeleegy, Aditya Parameswaran. 24th International Conference on Intelligent User Interfaces (IUI), Los Angeles, USA. March 2019
2018
- PAPER Holistic Crowd-Powered Sorting via AID: Optimizing for Accuracies, Inconsistencies, and Difficulties.
Shreya Rajpal and Aditya Parameswaran. 27th International Conference on Information and Knowledge Management (CIKM), Lingotto, Italy. October 2018
- PAPER The Case for a Visual Discovery Assistant: A Holistic Solution for Accelerating Visual Data Exploration.
Doris Jung-Lin Lee and Aditya Parameswaran. IEEE Data Engineering Bulletin, Issue on Insights and Explanations in Data Analysis. September 2018
- PAPER Adaptive Sampling for Rapidly Matching Histograms.
Stephen Macke, Yiming Zhang, Silu Huang, Aditya Parameswaran. 44th International Conference on Very Large Data Bases (VLDB), Rio de Janeiro, Brazil. September 2018
- PAPER ShapeSearch: Flexible Pattern-based Querying of Trend Line Visualizations (Demo).
Tarique Siddiqui, Paul Luh, Zesheng Wang, Karrie Karahalios, Aditya Parameswaran. 44th International Conference on Very Large Data Bases (VLDB), Rio de Janeiro, Brazil. September 2018
- PAPER Helix: Accelerating Human-in-the-loop Machine Learning (Demo).
Doris Xin, Litian Ma, Jialin Liu, Stephen Macke, Shuchen Song, Aditya Parameswaran. 44th International Conference on Very Large Data Bases (VLDB), Rio de Janeiro, Brazil. September 2018
- PAPER How Developers Iterate on Machine Learning Workflows -- A Survey of the Applied Machine Learning Literature.
Doris Xin, Litian Ma, Shuchen Song, Aditya Parameswaran. IDEA Workshop at KDD Int'l Conf. on Knowledge Discovery and Data Mining, London, UK. August 2018
- PRE-PRINT Directed Data Management: A New Frontier in Database Usability.
Mangesh Bendre, Sajjadur Rahman, Tana Wattanawaroon, Kelly Mack, Yu Lu, Kevin Chang, Karrie Karahalios, Aditya Parameswaran. Technical Report. August 2018
- PAPER Quality Evaluation Methods for Crowdsourced Image Segmentation.
Doris Jung-Lin Lee, Akash Das Sarma, Aditya Parameswaran. 6th International Conference on Human Computation and Crowdsourcing (HCOMP), Zurich, Switzerland. July 2018
- PAPER Navigating the Data Lake with Datamaran: Automatically Extracting Structure from Log Datasets.
Yihan Gao, Silu Huang, Aditya Parameswaran. SIGMOD Int'l Conf. on Management of Data, Houston, USA. June 2018
- PAPER Optimally Leveraging Density and Locality to Support LIMIT Queries.
Albert Kim, Liqi Xu, Tarique Siddiqui, Silu Huang, Sam Madden, Aditya Parameswaran. HILDA Workshop at SIGMOD Int'l Conf. on Management of Data, Houston, USA. June 2018
- PAPER Accelerating Human-in-the-loop Machine Learning: Challenges and Opportunities.
Doris Xin, Litian Ma, Jialin Liu, Stephen Macke, Shuchen Song, Aditya Parameswaran. DEEM Workshop at SIGMOD Int'l Conf. on Management of Data, Houston, USA. June 2018
- PAPER DataDiff: User-Interpretable Data Transformation Summaries for Collaborative Data Analysis (Demo).
Gunce Yilmaz, Tana Wattanawaroon, Liqi Xu, Abhishek Nigam, Aaron Elmore, Aditya Parameswaran. SIGMOD Int'l Conf. on Management of Data, Houston, USA. June 2018
- PAPER Towards a Holistic Integration of Spreadsheets with Databases: A Scalable Storage Engine for Presentational Data Management.
Mangesh Bendre, Vipul Venkataraman, Xinyan Zhou, Kevin Chang, Aditya Parameswaran. 34th International Conf. on Data Engineering (ICDE), Paris, France. April 2018
- PAPER Characterizing Scalability Issues in Spreadsheet Software using Online Forums (Case Study Paper).
Kelly Mack, John Lee, Kevin Chang, Karrie Karahalios, Aditya Parameswaran. International Conference on Human Factors in Computing Systems (CHI), Montreal, Canada. April 2018
- PAPER On the Interpretability of Conditional Probability Estimates in the Agnostic Setting.
Yihan Gao, Aditya Parameswaran, Jian Peng. Electronic Journal of Statistics, Volume 11, Number 2, (Special Issue of AISTATS 2017 Best Papers). January 2018
2017
- PRE-PRINT Towards a Theory of Data-Diff: Optimal Synthesis of Succinct Data Modification Scripts.
Tana Wattanawaroon, Stephen Macke, Aditya Parameswaran. Technical Report. December 2017
- PAPER I've Seen Enough: Incrementally Improving Visualizations to Support Rapid Decision Making.
Sajjadur Rahman, Maryam Aliakbarpour, Ha Kyung Kong, Eric Blais, Karrie Karahalios, Aditya Parameswaran, Ronitt Rubinfeld. 43rd International Conference on Very Large Data Bases (VLDB), Munich, Germany. September 2017
- PAPER OrpheusDB: Bolt-on Versioning for Relational Databases.
Silu Huang, Liqi Xu, Jialin Liu, Aaron Elmore, Aditya Parameswaran. 43rd International Conference on Very Large Data Bases (VLDB), Munich, Germany. September 2017
(Invited to: Special Issue of VLDB Journal for VLDB 2017 Best Papers)
- PAPER Effortless Visual Data Exploration with Zenvisage: An Interactive and Expressive Visual Analytics System.
Tarique Siddiqui, Albert Kim, John Lee, Karrie Karahalios, Aditya Parameswaran. 43rd International Conference on Very Large Data Bases (VLDB), Munich, Germany. September 2017
- PAPER Understanding Workers, Developing Effective Tasks, and Enhancing Marketplace Dynamics: A Study of a Large Crowdsourcing Marketplace.
Ayush Jain, Akash Das Sarma, Jennifer Widom, Aditya Parameswaran. 43rd International Conference on Very Large Data Bases (VLDB), Munich, Germany. September 2017
- PAPER OrpheusDB: A Light-weight Approach to Relational Dataset Versioning (Demo).
Liqi Xu, Silu Huang, Sili Hui, Aaron Elmore, Aditya Parameswaran. SIGMOD Int'l Conf. on Management of Data, Chicago, USA. June 2017
(Best Demo Honorable Mention)
- PAPER SLiMFast: Guaranteed Results for Data Fusion and Source Reliability.
Theo Rekatsinas, Manas Joglekar, Hector Garcia-Molina, Aditya Parameswaran, and Chris Re. SIGMOD Int'l Conf. on Management of Data, Chicago, USA. June 2017
- PAPER On the Interpretability of Conditional Probability Estimates in the Agnostic Setting (ORAL Presentation).
Yihan Gao, Aditya Parameswaran, Jian Peng. 20th Intl. Conf. on Artificial Intelligence and Statistics (AISTATS), Ft. Lauderdale, USA. April 2017
(Invited to: Special Issue of the Electronic Journal of Statistics)
- PAPER Interactive Data Exploration with Smart Drill-Down.
Manas Joglekar, Hector Garcia-Molina, Aditya Parameswaran. TKDE Journal, (Special Issue of ICDE 2016 Best Papers). March 2017
- PAPER Fast-forwarding to Desired Visualizations with Zenvisage.
Tarique Siddiqui, John Lee, Albert Kim, Edward Xue, Xiaofo Yu, Sean Zou, Lijin Guo, Changfeng Liu, Chaoran Wang, Karrie Karahalios, Aditya Parameswaran. Conference on Innovative Database Research (CIDR), Chaminade, USA. January 2017
2016
- PAPER Optimizing Open-Ended Crowdsourcing: The Next Frontier in Crowdsourced Data Management.
Aditya Parameswaran, Akash Das Sarma, Vipul Venkataraman. IEEE Data Engineering Bulletin, Issue on Human-in-the-loop Data Management. December 2016
- PAPER Towards Visualization Recommendation Systems.
Manasi Vartak, Silu Huang, Tarique Siddiqui, Samuel Madden, and Aditya Parameswaran. SIGMOD Record, Chicago, USA. December 2016
- PRE-PRINT It's a Matter of Perspective(s): Crowd-Powered Consensus Organization of Corpora.
Ayush Jain, Karan Goel, Joon Young Seo, Andrew Kuznetsov, Aditya Parameswaran, Hari Sundaram. Technical Report. November 2016
- PAPER FacetGist: Collective Extraction of Document Facets in Large Technical Corpora.
Tarique Siddiqui, Xiang Ren, Aditya Parameswaran, and Jiawei Han. 25th International Conference on Information and Knowledge Management (CIKM), Indianapolis, USA. October 2016
- PAPER SeeDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics.
Manasi Vartak, Sajjadur Rahman, Samuel Madden, Aditya Parameswaran, and Neoklis Polyzotis. 42nd International Conference on Very Large Data Bases (VLDB), New Delhi, India. September 2016
- PAPER Decibel: The Relational Dataset Branching System.
Michael Maddox, David Goehring, Aaron Elmore, Sam Madden, Aditya Parameswaran, and Amol Deshpande. 42nd International Conference on Very Large Data Bases (VLDB), New Delhi, India. September 2016
- PAPER Squish: Near-optimal Compression for Archival of Relational Datasets.
Yihan Gao, and Aditya Parameswaran. 22nd International Conf. on Knowledge Discovery and Data Mining (KDD), San Francisco, USA. August 2016
- PAPER Towards Globally Optimal Crowdsourcing Quality Management.
Akash Das Sarma, Aditya Parameswaran, Jennifer Widom. SIGMOD International Conf. on Management of Data, San Francisco, USA. June 2016
- PAPER Interactive Data Exploration with Smart Drill-Down.
Manas Joglekar, Hector Garcia-Molina, Aditya Parameswaran. 32nd International Conf. on Data Engineering (ICDE), Helsinki, Finland. May 2016
(Invited to: Special Issue of TKDE Journal for ICDE 2016 Best Papers)
- PAPER Challenges in Data Crowdsourcing.
Hector Garcia-Molina, Manas Joglekar, Adam Marcus, Aditya Parameswaran, and Vasilis Verios. IEEE TKDE: Transactions on Knowledge and Data Engineering, (Pages XX-YY). January 2016
2015
- PAPER Crowdsourced Data Management: Industry and Academic Perspectives (Book).
Adam Marcus and Aditya Parameswaran. Foundations and Trends® in Databases, Vol. 6: No. 1-2, pp 1-161. December 2015
- PAPER Surpassing Humans and Computers with JellyBean: Crowd-Vision-Hybrid Image Counting Algorithms.
Akash Das Sarma, Ayush Jain, Arnab Nandi, Aditya Parameswaran and Jennifer Widom. 3rd International Conference on Human Computation and Crowdsourcing (HCOMP), San Diego, USA. November 2015
- PAPER Towards Visualization Recommendation Systems.
Manasi Vartak, Silu Huang, Tarique Siddiqui, Samuel Madden, and Aditya Parameswaran. 1st Workshop on Data Systems for Interactive Analytics (DSIA), Chicago, USA. October 2015
- PAPER Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff.
Souvik Bhattacherjee, Amit Chavan, Silu Huang, Amol Deshpande, and Aditya Parameswaran. 41st International Conference on Very Large Data Bases (VLDB), Kohala Coast, Hawaii, USA. September 2015
- PAPER Finish Them!: Pricing Algorithms for Human Computation.
Yihan Gao, Aditya Parameswaran. 41st International Conference on Very Large Data Bases (VLDB), Kohala Coast, Hawaii, USA. September 2015
- PAPER Rapid Sampling for Visualizations with Ordering Guarantees.
Albert Kim, Eric Blais, Aditya Parameswaran, Piotr Indyk, Samuel Maddem, Ronitt Rubinfeld. 41st International Conference on Very Large Data Bases (VLDB), Kohala Coast, Hawaii, USA. September 2015
- PAPER Data-Spread: Unifying Databases and Spreadsheets (Demo).
Mangesh Bendre, Bofan Sun, Xinyan Zhou, Ding Zhang, Shy-Yauer Lin, Kevin Chang, and Aditya Parameswaran. 41st International Conference on Very Large Data Bases (VLDB), Kohala Coast, Hawaii, USA. September 2015
- PAPER Smart Drill-down: A New Data Exploration Operator (Demo).
Manas Joglekar, Hector Garcia-Molina, Aditya Parameswaran. 41st International Conference on Very Large Data Bases (VLDB), Kohala Coast, Hawaii, USA. September 2015
- PAPER Collaborative Data Analytics with Datahub (Demo).
Anant Bhardwaj, Amol Deshpande, Aaron Elmore, David Karger, Sam Madden, Aditya Parameswaran, Harihar Subramanyam, Eugene Wu, and Rebecca Zhang. 41st International Conference on Very Large Data Bases (VLDB), Kohala Coast, Hawaii, USA. September 2015
- PAPER Debiasing Crowdsourced Batches.
Honglei Zhuang, Aditya Parameswaran, Dan Roth, and Jiawei Han. 21th International Conf. on Knowledge Discovery and Data Mining (KDD), Sydney, Australia. August 2015
- PAPER Towards a Unified Query Language for Provenance and Versioning.
Amit Chavan, Silu Huang, Amol Deshpande, Aaron Elmore, Sam Madden, and Aditya Parameswaran. 7th International Workshop on Theory and Practice of Provenance (TaPP), Edinburgh, Scotland. July 2015
- PAPER Exploiting Correlations for Evaluating Complex Queries.
Manas Joglekar, Hector Garcia-Molina, Aditya Parameswaran and Christopher Re. SIGMOD International Conf. on Management of Data, Melbourne, Australia. May 2015
- PAPER Comprehensive and Reliable Crowd Assessment Algorithms.
Manas Joglekar, Hector Garcia-Molina, and Aditya Parameswaran . 31st International Conf. on Data Engineering (ICDE), Seoul, Korea. April 2015
- PAPER DataHub: Collaborative Data Science & Dataset Version Management at Scale.
Anant Bhardwaj, Souvik Bhattacherjee, Amit Chavan, Amol Deshpande, Aaron J. Elmore, Samuel Madden, Aditya Parameswaran. Conference on Innovative Database Research (CIDR), Asilomar, USA. January 2015
- PAPER GeoHashViz: Interactive Analytics for Mapping Spatiotemporal Diffusion of Twitter Hashtags.
Kiumars Soltani, Shaowen Wang, and Aditya Parameswaran. XSEDE, Miami, USA. 2015
2014
- PAPER Optimal Worker Quality and Answer Estimates in Crowd-Powered Filtering and Rating.
Akash Das Sarma, Aditya Parameswaran, Jennifer Widom. 2nd International Conference on Human Computation and Crowdsourcing (HCOMP), Pittsburgh, USA. November 2014
- PAPER SeeDB: Automatically Generating Query Visualizations (Demo).
Manasi Vartak, Samuel Madden, Aditya Parameswaran, Neoklis Polyzotis. 40th International Conf. on Very Large Data Bases (VLDB), Hangzhou, China. September 2014
- PAPER Optimal Crowd-Powered Rating and Filtering Algorithms.
Aditya Parameswaran, Stephen Boyd, Hector Garcia-Molina, Ashish Gupta, Neoklis Polyzotis, Jennifer Widom. 40th International Conf. on Very Large Data Bases (VLDB), Hangzhou, China. September 2014
- PAPER SeeDB: Visualizing Database Queries Efficiently (Vision Paper).
Aditya Parameswaran, Neoklis Polyzotis, and Hector Garcia-Molina. 40th International Conf. on Very Large Data Bases (VLDB), Hangzhou, China. September 2014
- PRE-PRINT Indexing Cost-Sensitive Prediction.
Leilani Battle, Edward Benson, Aditya Parameswaran, Eugene Wu. Technical Report. August 2014
- PAPER DataSift: A Crowd-Powered Search Toolkit (Demo).
Aditya Parameswaran, Ming Han Teh, Hector Garcia-Molina and Jennifer Widom. SIGMOD International Conf. on Management of Data, Snowbird, Utah, USA. June 2014
- PRE-PRINT NeedleTail: A System for Browsing Queries (Demo).
Albert Kim, Samuel Madden and Aditya Parameswaran . Technical Report. April 2014
- PAPER Crowd-Powered Find Algorithms.
Anish Das Sarma, Aditya Parameswaran, Hector Garcia-Molina and Alon Halevy. 30th International Conf. on Data Engineering (ICDE), Chicago, USA. April 2014
(Invited to: Special Issue of TKDE Journal for ICDE 2014 Best Papers)
2013
- PAPER Efficient Parsing-based Keyword Search over Databases.
Aditya Parameswaran, Raghav Kaushik and Arvind Arasu. 22th International Conf. on Information and Knowledge Management (CIKM), Burlingame, USA. November 2013
- PAPER An Expressive and Accurate Crowd-Powered Search Toolkit.
Aditya Parameswaran, Ming Han Teh, Hector Garcia-Molina and Jennifer Widom. 1st Conf. on Human Computation and Crowdsourcing (HCOMP), Palm Springs, USA. November 2013
- PAPER Human-Powered Data Management.
Aditya Parameswaran. Doctoral Dissertation, Stanford University. September 2013
(Thesis awards: Stanford U., SIGMOD's Jim Gray award, and SIGKDD's thesis award Runner-up)
- PAPER Active Sampling for Entity Matching with Guarantees.
Kedar Bellare, Suresh Iyengar, Aditya Parameswaran and Vibhor Rastogi. ACM Transactions on Knowledge Discovery from Data (TKDD) - Special Issue on ACM SIGKDD 2012, Volume 7(3). September 2013
- PAPER Evaluating the Crowd with Confidence.
Manas Joglekar, Hector Garcia-Molina and Aditya Parameswaran. 19th International Conf. on Knowledge Discovery and Data Mining (KDD), Chicago, USA. August 2013
2012
- PAPER An Overview of the Deco System: Data Model and Query Language; Query Processing and Optimization.
Hyunjung Park, Richard Pang, Aditya Parameswaran, Hector Garcia-Molina, Neoklis Polyzotis, and Jennifer Widom. SIGMOD Record, Volume 41. December 2012
- PAPER Human-Powered Debugging of Large Data Pipelines.
Nilesh Dalvi, Aditya Parameswaran and Vibhor Rastogi. 25th International Conf. on Neural Information Processing Systems (NIPS), Tahoe, Nevada, USA. December 2012
- PAPER Deco: Declarative Crowdsourcing.
Aditya Parameswaran, Hyunjung Park, Hector Garcia-Molina, Neoklis Polyzotis and Jennifer Widom. 21th International Conf. on Information and Knowledge Management (CIKM), Maui, Hawaii, USA. November 2012
- PAPER Deco: A System for Declarative Crowdsourcing (Demo).
Hyunjung Park, Richard Pang, Aditya Parameswaran, Hector Garcia-Molina, Neoklis Polyzotis and Jennifer Widom. 38th International Conf. on Very Large Data Bases (VLDB), Istanbul, Turkey. September 2012
- PRE-PRINT Query Processing over Crowdsourced Data.
Hyunjung Park, Aditya Parameswaran and Jennifer Widom. Infolab Technical Report. August 2012
- PAPER Active Sampling for Entity Matching.
Kedar Bellare, Suresh Iyengar, Aditya Parameswaran and Vibhor Rastogi. 18th International Conf. on Knowledge Discovery and Data Mining (KDD), Beijing, China. August 2012
(Invited to: Special Issue of TKDD Journal for KDD 2012 Best Papers)
- PRE-PRINT Identifying Reliable Workers Swiftly.
Aditya Ramesh, Aditya Parameswaran, Hector Garcia-Molina and Neoklis Polyzotis. Infolab Technical Report. June 2012
- PAPER So Who Won? Dynamic Max Discovery with the Crowd.
Stephen Guo, Aditya Parameswaran and Hector Garcia-Molina. SIGMOD International Conf. on Management of Data, Scottsdale, Arizona, USA. June 2012
- PAPER CrowdScreen: Algorithms for Filtering Data with Humans.
Aditya Parameswaran, Hector Garcia-Molina, Hyunjung Park, Neoklis Polyzotis, Aditya Ramesh and Jennifer Widom. SIGMOD International Conf. on Management of Data, Scottsdale, Arizona, USA. June 2012
- PAPER Fuzzy Joins using MapReduce.
Foto Afrati, Anish Das Sarma, David Menestrina, Aditya Parameswaran and Jeffrey D. Ullman. 28th International Conf. on Data Engineering (ICDE), Washington DC, USA. April 2012
2011
- PAPER Information Seeking: Convergence of Search, Recommendations and Advertising.
Hector Garcia-Molina, Georgia Koutrika and Aditya Parameswaran. Communications of the ACM, Viewpoint Article. November 2011
- PAPER Recommendation Systems with Complex Constraints: A CourseRank Perspective.
Aditya Parameswaran, Petros Venetis and Hector Garcia-Molina. ACM Transactions on Information Systems, Volume 29(4). November 2011
- PAPER Optimal Schemes for Robust Web Extraction.
Aditya Parameswaran, Nilesh Dalvi, Hector Garcia-Molina and Rajeev Rastogi. 37th International Conf. on Very Large Data Bases (VLDB), Seattle, USA. September 2011
- PAPER Human-assisted Graph Search: It's Okay to Ask Questions.
Aditya Parameswaran, Anish Das Sarma, Hector Garcia-Molina, Neoklis Polyzotis and Jennifer Widom. 37th International Conf. on Very Large Data Bases (VLDB), Seattle, USA. September 2011
- PAPER Answering Queries using Humans, Algorithms and Databases.
Aditya Parameswaran and Neoklis Polyzotis. Conference on Innovative Database Research (CIDR), Asilomar, USA. January 2011
2010
- PAPER Evaluating, Combining and Generalizing Recommendations with Prerequisites.
Aditya Parameswaran, Hector Garcia-Molina and Jeffrey D. Ullman,. 19th International Conf. on Information and Knowledge Management (CIKM), Toronto, Canada. October 2010
- PAPER Towards the Web of Concepts: Extracting Concepts from Large Datasets.
Aditya Parameswaran, Hector Garcia-Molina and Anand Rajaraman. 36th International Conf. on Very Large Data Bases (VLDB), Singapore. September 2010
(Invited to: Special Issue of VLDB Journal for VLDB 2010 Best Papers.)
- PAPER Recsplorer: Recommendation Algorithms Based on Precedence Mining.
Aditya Parameswaran, Georgia Koutrika, Benjamin Berkovitz and Hector Garcia-Molina,. SIGMOD International Conf. on Management of Data, Indianapolis, USA. June 2010
- PAPER Synthesizing View Definitions from Data.
Anish Das Sarma, Aditya Parameswaran, Hector Garcia-Molina and Jennifer Widom. 13th International Conf. on Database Theory (ICDT), Lausanne, Switzerland. March 2010
2009
- PAPER Social Sites Research Through CourseRank.
Benjamin Berkovitz, Filip Kaliszan, Georgia Koutrika, Henry Liou, Aditya Parameswaran, Petros Venetis, Zahra Mohammadi Zadeh and Hector Garcia-Molina,. SIGMOD Record, Volume XXX. December 2009
- PAPER Recommendations with Prerequisites (Short Paper).
Aditya Parameswaran and Hector Garcia-Molina. 3rd ACM Conference on Recommender Systems, New York, USA. October 2009
- PAPER Blogs as Predictors of Movie Success (Short Paper).
Eldar Sadikov, Aditya Parameswaran and Petros Venetis,. AAAI Conf. on Weblogs and Social Media (ICWSM) 2009, San Jose, USA. May 2009
2008
- PAPER Robust Construction of the Three-dimensional Flow Complex.
Frederic Cazals, Aditya Parameswaran and Sylvain Pion,. ACM Symposium on Computational Geometry (SOCG) 2008, Maryland, USA. June 2008