Elsevier

Fuzzy Sets and Systems

Volume 292, 1 June 2016, Pages 85-97
Fuzzy Sets and Systems

Factorization of matrices with grades

Dedicated to Francesc Esteva on the occasion of his 70th birthday
https://doi.org/10.1016/j.fss.2015.03.020Get rights and content

Abstract

We present an approach to decomposition and factor analysis of matrices with ordinal data. The matrix entries are grades to which objects represented by rows satisfy attributes represented by columns, e.g. grades to which an image is red, a product has a given feature, or a person performs well in a test. We assume that the grades are taken from bounded scales equipped with certain aggregation operators that are involved in the decompositions. Particular cases of the decompositions include the well-known Boolean matrix decomposition, and the sup-t-norm and inf-residuum decompositions. We consider the problem of decomposition of a given matrix into a product of two matrices with grades such that the number of factors, i.e. the inner dimension, be as small as possible. We observe that computing such decompositions is NP-hard and present a greedy approximation algorithm. Our algorithm is based on a geometric insight provided by a theorem identifying particular rectangular-shaped submatrices as optimal factors for the decompositions. These factors correspond to fixpoints of certain Galois connections associated with the input matrix, which are called formal concepts, and allow an easy interpretation of the decomposition. We present illustrative examples and experimental evaluation of the algorithm.

Introduction

In traditional approaches to dimensionality reduction, such as factor analysis, a decomposition (factorization) of an object–variable matrix is sought into an object–factor matrix and a factor-variable matrix with the number of factors reasonably small. The factors are considered as new variables, hidden in the data and likely more fundamental than the original variables. Computing the factors and interpreting them is the central topic of this paper.

We consider decompositions of matrices I with a particular type of ordinal data. Namely, each entry Iij of I represents a grade which the object corresponding to the ith row has, or is incident with, the attribute corresponding to the jth column. Examples of such data are results of questionnaires where respondents (rows) rate services, products, etc., according to various criteria (columns); results of performance evaluation of people (rows) by various tests (columns); or binary data in which case there are only two grades, 0 (no, failure) and 1 (yes, success). Our goal is to decompose an n×m object–attribute matrix I into a productI=AB of an n×k object–factor matrix A and a k×m factor–attribute matrix B with the number k of factors as small as possible.

The scenario is thus similar to ordinary matrix decomposition problems but there are important differences. First, we assume that the entries of I, i.e. the grades, as well as the entries of A and B are taken from bounded scales L of grades, such as the real unit interval L=[0,1] or the Likert scale L={1,,5} of degrees of satisfaction. Second, the matrix composition operation ∘ used in our decompositions is not the usual matrix product. Instead, we use a general product based on supremum-preserving aggregation operators introduced in [5], [6], see also [12]. Two important, well-known [2], [11] cases of this product are the sup-t-norm-product defined by(AB)ij=l=1kAilBlj, and the inf-residuum-product (denoted also by ◃) defined by(AB)ij=l=1kAilBlj, where ⊗ and → denote a (left-)continuous t-norm and its residuum [11], [15], and ⋁ and ⋀ denote the supremum and infimum. The ordinary Boolean matrix product is a particular case of the sup-t-norm product in which the scale L has 0 and 1 as the only grades and ab=min(a,b). It is to be emphasized that we attempt to treat graded incidence data in a way which is compatible with its semantics. This need has been recognized long ago in mathematical psychology, in particular in measurement theory [16]. For example, even if we represent the grades by numbers such as 0strongly disagree, 14disagree,,1strongly agree, addition, multiplication by real numbers, and linear combination of graded incidence data may not have natural meaning. Consequently, decomposition of a matrix I with grades into the ordinary matrix product of arbitrary real-valued matrices A and B may suffer from a difficulty to interpret A and B, see [20], [28]. In this paper, we present an algorithm which is based on a theorem from [6] regarding the role of fixpoints of certain Galois connections associated with I as factors for decomposition of I. This is important both from the technical viewpoint, since due to [6] optimal decompositions may be obtained this way, and the knowledge discovery viewpoint, since the fixpoints, called formal concepts may naturally be interpreted. The algorithm runs in polynomial time and delivers suboptimal decompositions. This is a necessity because, as we show, computing optimal decompositions is an NP-hard optimization problem. In addition, we present an illustrative example demonstrating the usefulness of such decompositions, and an experimental evaluation of the algorithm.

Recently, new methods of matrix decomposition and dimensionality reduction have been developed. One aim is to have methods which are capable of discovering possibly non-linear relationships between the original space and the lower dimensional space [23], [29]. Another is driven by the need to take into account constraints imposed by the semantics of the data. Examples include nonnegative matrix factorization, in which the matrices are constrained to those with nonnegative entries and which leads to additive parts-based discovery of features in data [17]. Another example, relevant to this paper, is Boolean matrix decomposition. Early work on this problem was done in [22], [26]. Recent work on this topic includes [7], [9], [19], [20], [22]. As was mentioned above, Boolean matrix decomposition is a particular case of the problem considered in this paper. Note also that partly related to this paper are methods for decomposition of binary matrices into non-binary ones such as [18], [24], [25], [27], [31], see also [28] for further references.

Section snippets

Decomposition and the factor model

As was mentioned above, we assume that the matrix entries contain elements from scales (grades) equipped with certain aggregation operators. In particular, we assume a general model of (1) in which the entries of A, B, and I are elements of three complete lattices L1,1, L2,2, and L3,3. That is, we assume AilL1, BljL2, and IijL3. We denote the operations as usual, adding subscript i; for instance, the infima, suprema, the least, and the greatest element in L2 are denoted by 2, 2, 02

Algorithm and complexity of decompositions

Let us define our problem precisely. For a given (that is, constant for the problem) aggregation structure L consisting of complete lattices L1, L2, L3, and an aggregation operation ⎕, the problem we discuss is a minimization (optimization) problem [1] specified as follows:

Problem:DECOMP(L)
Input:n × m matrix I with entries in L3;
Feasible Solution:n × k and k × m matrices A and B with entries
in L1 and L2, respectively, for which I = AB;
Cost of Solution:k.

Throughout this section, we assume that Lis are

Examples and experiments

In Section 4.1, we examine in detail a factor analysis of 2004 Olympic Decathlon data. We include this example to illustrate that the algorithm developed in this paper can be used to obtain reasonable factors from data with grades. In Section 4.2, we present an experimental evaluation of our algorithm.

Conclusions

We presented an approach to decomposition and factor analysis of matrices with grades, i.e. of a particular form of ordinal data. The factors in this approach correspond to fixpoints of certain Galois connections associated with the input matrix, which are known as formal concepts. The approach is justified by a theorem according to which optimal decompositions are attained by using formal concepts as factors. The relationship between the factors and original attributes is a non-linear one. An

Acknowledgements

R. Belohlavek acknowledges support by grant No. P202/10/0262, and during revision by No. 15-17899S of the Czech Science Foundation. V. Vychodil acknowledges support by the ESF project No. CZ.1.07/2.3.00/20.0059, the project is co-financed by the European Social Fund and the state budget of the Czech Republic. This paper is an extended version of the authors' paper “Factor analysis of incidence data via novel decomposition of matrices,” Proc. ICFCA 2009, 83–97.

References (31)

  • B. Ganter et al.

    Formal Concept Analysis. Mathematical Foundations

    (1999)
  • S. Gottwald

    A Treatise on Many-Valued Logic

    (2001)
  • M. Grabisch et al.

    Aggregation Functions

    (2009)
  • P. Hájek

    Metamathematics of Fuzzy Logic

    (1998)
  • K.H. Kim

    Boolean Matrix Theory and Applications

    (1982)
  • Cited by (9)

    • Heterogeneous formal context and its decomposition by heterogeneous fuzzy subsets

      2022, Fuzzy Sets and Systems
      Citation Excerpt :

      First, we present the overview of generalizations of Formal concept analysis in a fuzzy setting in Table 1. We remind that L-fuzzy formal concepts as the factors of L-fuzzy formal context were initially investigated by Bělohlávek [9] and the important properties of decomposition of matrices with grades from a complete residuated lattice L were thoroughly studied in [3,11,12,14,17–20]. Moreover, the factorization of L-fuzzy concept lattices by congruence relation was explored in [10].

    • Factorization of matrices with grades via essential entries

      2019, Fuzzy Sets and Systems
      Citation Excerpt :

      Matrices with grades in partially ordered sets L (matrices over L), and fuzzy relations with truth degrees in L, are examined in many papers; for those with L being residuated structures of truth degrees we refer e.g. to [1,11]. As far as factorization of matrices over L is concerned, we refer to [3,8], in which both the fundamental properties of formal concepts of I as optimal factors as well as the first decomposition algorithm are presented. [5] provides factor analyses of various sports datasets by means of this algorithm and examines additional theoretical problems inspired by the analyses.

    • Ranks of fuzzy matrices. Applications in state reduction of fuzzy automata

      2018, Fuzzy Sets and Systems
      Citation Excerpt :

      Later, in [5] fixed points of fuzzy Galois connections were proven to be special cases of maximal subdecompositions of fuzzy matrices. Through a series of papers, while studying formal concept analysis, Bělohlávek and his coworkers provided significant results regarding properties of row and column spaces associated to fuzzy matrices [11,12], optimal decompositions of fuzzy matrices [8,9], and efficient computation of suboptimal decompositions of fuzzy matrices [14]. It is worth noting that Ćirić et al. [17–21,23,30,31] made a great contribution to the theory of fuzzy matrices over complete residuated lattices, and offered a comprehensive survey on the theory of fuzzy automata, particularly relevant to problems of state reduction, determinization and of bisimulation of fuzzy automata.

    View all citing articles on Scopus
    View full text