We research methods and tools that allow users of all technical levels to clean, manipulate and wrangle their datasets painlessly. Check out our latest publications!

News & Updates



Data-Driven Domain Discovery for Structured Datasets.

Masayo Ota, Heiko Mueller, Juliana Freire, Divesh Srivastava.

Proceedings of the VLDB Endowment, 13(7), 2020.

Your notebook is not crumby enough, REPLace it.

Mike Brachmann, William Spoth, Oliver Kennedy, Boris Glavic, Heiko Mueller, Sonia Castelo, Carlos Bautista, Juliana Freire.

Conference on Innovative Data Systems Research (CIDR), 2020.

Data Debugging and Exploration with Vizier.

Mike Brachmann, Carlos Bautista, Sonia Castelo, Su Feng, Juliana Freire, Boris Glavic, Oliver Kennedy, Heiko Mueller, Remi Rampin, William Spoth, Ying Yang.

ACM International Conference on Management of Data (SIGMOD), Demo Track, 2019.

Data Quality: The Role of Empiricism.

Shazia Wasim Sadiq, Tamraparni Dasu, Xin Luna Dong, Juliana Freire, Ihab F. Ilyas, Sebastian Link, Renée J. Miller, Felix Naumann, Xiaofang Zhou, Divesh Srivastava.

SIGMOD Rec. 46(4): 35-43, 2017.

The exception that improves the rule.

Juliana Freire, Boris Glavic, Oliver Kennedy, Heiko Mueller.

Workshop on Human-In-the-Loop Data Analytics (HILDA), 2016.

Exploring What not to Clean in Urban Data: A Study Using New York City Taxi Trips.

Juliana Freire, Aline Bessa, Fernando Chirigati, Huy T. Vo, Kai Zhao.

IEEE Data Eng. Bull. 39(2): 63-77, 2016.

RioBusData: Outlier Detection in Bus Routes of Rio de Janeiro.

Aline Bessa, Fernando de Mesentier Silva, Rodrigo Frassetto Nogueira, Enrico Bertini, Juliana Freire.

Symposium on Visualization in Data Science (VDS at IEEE VIS), 2015.