skip to main content
research-article
Artifacts Available / v1.1

Erica: Query Refinement for Diversity Constraint Satisfaction

Published:01 August 2023Publication History
Skip Abstract Section

Abstract

Relational queries are commonly used to support decision making in critical domains like hiring and college admissions. For example, a college admissions officer may need to select a subset of the applicants for in-person interviews, who individually meet the qualification requirements (e.g., have a sufficiently high GPA) and are collectively demographically diverse (e.g., include a sufficient number of candidates of each gender and of each race). However, traditional relational queries only support selection conditions checked against each input tuple, and they do not support diversity conditions checked against multiple, possibly overlapping, groups of output tuples. To address this shortcoming, we present Erica, an interactive system that proposes minimal modifications for selection queries to have them satisfy constraints on the cardinalities of multiple groups in the result. We demonstrate the effectiveness of Erica using several real-life datasets and diversity requirements.

References

  1. Wesley W. Chu and Qiming Chen. 1994. A structured approach for cooperative query answering. ACM TKDE 6, 5 (1994), 738--749.Google ScholarGoogle Scholar
  2. Daniel Deutch, Zachary G. Ives, Tova Milo, and Val Tannen. 2013. Caravan: Provisioning for What-If Analysis. In CIDR. www.cidrdb.org.Google ScholarGoogle Scholar
  3. Frances Ding, Moritz Hardt, John Miller, and Ludwig Schmidt. 2021. Retiring Adult: New Datasets for Fair Machine Learning. In Proceedings of NeurIPS.Google ScholarGoogle Scholar
  4. Marina Drosou, HV Jagadish, Evaggelia Pitoura, and Julia Stoyanovich. 2017. Diversity in big data: A review. Big data 5, 2 (2017), 73--84.Google ScholarGoogle Scholar
  5. Todd J. Green, Gregory Karvounarakis, and Val Tannen. 2007. Provenance semirings. In Proceedings of PODS.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Tomasz Imielinski and Witold Lipski Jr. 1984. Incomplete Information in Relational Databases. J. ACM 31, 4 (1984).Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Jinyang Li, Yuval Moskovitch, Julia Stoyanovich, and H. V. Jagadish. 2023. Query Refinement for Diversity Constraint Satisfaction. https://github.com/JinyangLi01/Query_refinement/blob/master/FullPaper/Query_Refinement.pdf.Google ScholarGoogle Scholar
  8. Chaitanya Mishra and Nick Koudas. 2009. Interactive query refinement. In EDBT.Google ScholarGoogle Scholar
  9. Suraj Shetiya, Ian P Swift, Abolfazl Asudeh, and Gautam Das. 2022. Fairnessaware range queries for selecting unbiased data. In Proceedings of IEEE ICDE.Google ScholarGoogle Scholar
  10. Quoc Trung Tran and Chee-Yong Chan. 2010. How to conquer why-not questions. In Proceedings of ACM SIGMOD. 15--26.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Quoc Trung Tran, Chee-Yong Chan, and Srinivasan Parthasarathy. 2009. Query by output. In Proceedings of ACM SIGMOD. 535--548.Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in

Full Access

  • Article Metrics

    • Downloads (Last 12 months)56
    • Downloads (Last 6 weeks)4

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader