Skip to main content

Techniques for Discrimination-Free Predictive Models

  • Chapter

Part of the book series: Studies in Applied Philosophy, Epistemology and Rational Ethics ((SAPERE,volume 3))

Abstract

In this chapter, we give an overview of the techniques developed ourselves for constructing discrimination-free classifiers. In discrimination-free classification the goal is to learn a predictive model that classifies future data objects as accurately as possible, yet the predicted labels should be uncorrelated to a given sensitive attribute. For example, the task could be to learn a gender-neutral model that predicts whether a potential client of a bank has a high income or not. The techniques we developed for discrimination-aware classification can be divided into three categories: (1) removing the discrimination directly from the historical dataset before an off-the-shelf classification technique is applied; (2) changing the learning procedures themselves by restricting the search space to non-discriminatory models; and (3) adjusting the discriminatory models, learnt by off-the-shelf classifiers on discriminatory historical data, in a post-processing phase. Experiments show that even with such a strong constraint as discrimination-freeness, still very accurate models can be learnt. In particular,we study a case of income prediction,where the available historical data exhibits a wage gap between the genders. Due to legal restrictions, however, our predictions should be gender-neutral. The discrimination-aware techniques succeed in significantly reducing gender discrimination without impairing too much the accuracy.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Australian Law. Australian sex discimination act 1984 (1984), http://www.comlaw.gov.au/Details/C2010C00056

  • Calders, T., Kamiran, F., Pechenizkiy, M.: Building classifiers with independency constraints. In: Saygin, Y., et al. (eds.) ICDM Workshops 2009, IEEE International Conference on Data Mining Workshops, Miami, Florida, USA, December 6, pp. 13–18. IEEE Computer Socity (2009)

    Google Scholar 

  • Calders, T., Verwer, S.: Three naive bayes approaches for discrimination-free classification. Data Mining and Knowledge Discovery 21, 277–292 (2010)

    Article  MathSciNet  Google Scholar 

  • Frank, A., Asuncion, A.: UCI machine learning repository (2010), http://archive.ics.uci.edu/ml

  • Hajian, S., Domingo-Ferrer, J., Martinez-Balleste, A.: Discrimination prevention in data mining for intrusion and crime detection. In: IEEE Symposium on Computational Intelligence in Cyber Security (CICS), pp. 47–54 (2011)

    Google Scholar 

  • Hajian, S., Domingo-Ferrer, J., Martínez-Ballesté, A.: Rule Protection for Indirect Discrimination Prevention in Data Mining. In: Torra, V., Narakawa, Y., Yin, J., Long, J. (eds.) MDAI 2011. LNCS, vol. 6820, pp. 211–222. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  • Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.: The WEKA data mining software: An update. ACM SIGKDD Explorations Newsletter 11(1), 110–118 (2009)

    Article  Google Scholar 

  • Heilman, M., Smith, N.A.: Tree edit models for recognizing textual entailments, paraphrases, and answers to questions. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 1011–1019. Association for Computational Linguistics, Stroudsburg (2010)

    Google Scholar 

  • Johnson, M.: PCFG models of linguistic tree representations. Comput. Linguist. 24, 613–632 (1998)

    Google Scholar 

  • Kamiran, F.: Discrimination-aware Classification. Doctoral dissertation, Eindhoven University of Technology, The Netherlands (2011)

    Google Scholar 

  • Kamiran, F., Calders, T.: Classifying without discriminating. In: 2nd IEEE International Conference on Computer, Control and Communication (IC4), pp. 1–6 (2009a)

    Google Scholar 

  • Kamiran, F., Calders, T.: Discrimination-aware classification. In: 21st Benelux Conference on Artificial Intelligence (BNAIC), pp. 333–334 (2009b)

    Google Scholar 

  • Kamiran, F., Calders, T.: Classification with No discrimination by preferential sampling. In: Proceedings Machine Learning Conference of Belgium and The Netherlands, BENELEARN (2010)

    Google Scholar 

  • Kamiran, F., Calders, T.: Data preprocessing techniques for classification without discrimination. Knowledge and Information Systems (to Appear, 2012)

    Google Scholar 

  • Kamiran, F., Calders, T., Pechenizkiy, M.: Discrimination aware decision tree learning (Tech. Rep. No. CS 10-13). Eindhoven University of Technolgy (2010a)

    Google Scholar 

  • Kamiran, F., Calders, T., Pechenizkiy, M.: Discrimination aware decision tree learning. In: IEEE International Conference on Data Mining, pp. 869–874 (2010b)

    Google Scholar 

  • Lerman, R., Yitzhaki, S.: A note on the calculation and interpretation of the gini index. Economics Letters 15(3-4), 363–368 (1984)

    Article  Google Scholar 

  • Levit, M., Alshawi, H., Gorin, A.L., Nöth, E.: Context-sensitive evaluation and correction of phone recognition output. In: Proc. of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003. ISCA (2003)

    Google Scholar 

  • Luong, B., Ruggieri, S., Turini, F.: k-NN as an Implementation of Situation Testing for Discrimination Discovery and Prevention. In: Proc. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 502–510 (2011)

    Google Scholar 

  • Mazhelis, O., Žliobaitė, I., Pechenizkiy, M.: Context-Aware Personal Route Recognition. In: Elomaa, T., Hollmén, J., Mannila, H. (eds.) DS 2011. LNCS, vol. 6926, pp. 221–235. Springer, Heidelberg (2011)

    Google Scholar 

  • Morris, A., Misra, H.: Confusion matrix based posterior probabilities correction (Idiap-RR No. Idiap-RR-53-2002). IDIAP (2002)

    Google Scholar 

  • Quinlan, J.: C4. 5: programs for machine learning. Morgan Kaufmann (1993)

    Google Scholar 

  • US Law. The US equal credit opportunity act (1968), http://www.fdic.gov/regulations/laws/rules/6500-1200.html

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Faisal Kamiran .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Kamiran, F., Calders, T., Pechenizkiy, M. (2013). Techniques for Discrimination-Free Predictive Models. In: Custers, B., Calders, T., Schermer, B., Zarsky, T. (eds) Discrimination and Privacy in the Information Society. Studies in Applied Philosophy, Epistemology and Rational Ethics, vol 3. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30487-3_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-30487-3_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-30486-6

  • Online ISBN: 978-3-642-30487-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics