Factorization of matrices with grades

doi:10.1016/j.fss.2015.03.020

Fuzzy Sets and Systems

Volume 292, 1 June 2016, Pages 85-97

Dedicated to Francesc Esteva on the occasion of his 70th birthday

https://doi.org/10.1016/j.fss.2015.03.020 Get rights and content

Abstract

We present an approach to decomposition and factor analysis of matrices with ordinal data. The matrix entries are grades to which objects represented by rows satisfy attributes represented by columns, e.g. grades to which an image is red, a product has a given feature, or a person performs well in a test. We assume that the grades are taken from bounded scales equipped with certain aggregation operators that are involved in the decompositions. Particular cases of the decompositions include the well-known Boolean matrix decomposition, and the sup-t-norm and inf-residuum decompositions. We consider the problem of decomposition of a given matrix into a product of two matrices with grades such that the number of factors, i.e. the inner dimension, be as small as possible. We observe that computing such decompositions is NP-hard and present a greedy approximation algorithm. Our algorithm is based on a geometric insight provided by a theorem identifying particular rectangular-shaped submatrices as optimal factors for the decompositions. These factors correspond to fixpoints of certain Galois connections associated with the input matrix, which are called formal concepts, and allow an easy interpretation of the decomposition. We present illustrative examples and experimental evaluation of the algorithm.

Introduction

In traditional approaches to dimensionality reduction, such as factor analysis, a decomposition (factorization) of an object–variable matrix is sought into an object–factor matrix and a factor-variable matrix with the number of factors reasonably small. The factors are considered as new variables, hidden in the data and likely more fundamental than the original variables. Computing the factors and interpreting them is the central topic of this paper.

We consider decompositions of matrices I with a particular type of ordinal data. Namely, each entry $I_{i j}$ of I represents a grade which the object corresponding to the ith row has, or is incident with, the attribute corresponding to the jth column. Examples of such data are results of questionnaires where respondents (rows) rate services, products, etc., according to various criteria (columns); results of performance evaluation of people (rows) by various tests (columns); or binary data in which case there are only two grades, 0 (no, failure) and 1 (yes, success). Our goal is to decompose an $n \times m$ object–attribute matrix I into a product $I = A \circ B$ of an $n \times k$ object–factor matrix A and a $k \times m$ factor–attribute matrix B with the number k of factors as small as possible.

The scenario is thus similar to ordinary matrix decomposition problems but there are important differences. First, we assume that the entries of I, i.e. the grades, as well as the entries of A and B are taken from bounded scales L of grades, such as the real unit interval $L = [0, 1]$ or the Likert scale $L = {1, \dots, 5}$ of degrees of satisfaction. Second, the matrix composition operation ∘ used in our decompositions is not the usual matrix product. Instead, we use a general product based on supremum-preserving aggregation operators introduced in [5], [6], see also [12]. Two important, well-known [2], [11] cases of this product are the sup-t-norm-product defined by ${(A \circ B)}_{i j} = ⋁_{l = 1}^{k} A_{i l} \otimes B_{l j},$ and the inf-residuum-product (denoted also by ◃) defined by ${(A \circ B)}_{i j} = ⋀_{l = 1}^{k} A_{i l} \to B_{l j},$ where ⊗ and → denote a (left-)continuous t-norm and its residuum [11], [15], and ⋁ and ⋀ denote the supremum and infimum. The ordinary Boolean matrix product is a particular case of the sup-t-norm product in which the scale L has 0 and 1 as the only grades and $a \otimes b = \min (a, b)$ . It is to be emphasized that we attempt to treat graded incidence data in a way which is compatible with its semantics. This need has been recognized long ago in mathematical psychology, in particular in measurement theory [16]. For example, even if we represent the grades by numbers such as $0 \sim strongly$ disagree, $\frac{1}{4} \sim disagree, \dots, 1 \sim strongly$ agree, addition, multiplication by real numbers, and linear combination of graded incidence data may not have natural meaning. Consequently, decomposition of a matrix I with grades into the ordinary matrix product of arbitrary real-valued matrices A and B may suffer from a difficulty to interpret A and B, see [20], [28]. In this paper, we present an algorithm which is based on a theorem from [6] regarding the role of fixpoints of certain Galois connections associated with I as factors for decomposition of I. This is important both from the technical viewpoint, since due to [6] optimal decompositions may be obtained this way, and the knowledge discovery viewpoint, since the fixpoints, called formal concepts may naturally be interpreted. The algorithm runs in polynomial time and delivers suboptimal decompositions. This is a necessity because, as we show, computing optimal decompositions is an NP-hard optimization problem. In addition, we present an illustrative example demonstrating the usefulness of such decompositions, and an experimental evaluation of the algorithm.

Recently, new methods of matrix decomposition and dimensionality reduction have been developed. One aim is to have methods which are capable of discovering possibly non-linear relationships between the original space and the lower dimensional space [23], [29]. Another is driven by the need to take into account constraints imposed by the semantics of the data. Examples include nonnegative matrix factorization, in which the matrices are constrained to those with nonnegative entries and which leads to additive parts-based discovery of features in data [17]. Another example, relevant to this paper, is Boolean matrix decomposition. Early work on this problem was done in [22], [26]. Recent work on this topic includes [7], [9], [19], [20], [22]. As was mentioned above, Boolean matrix decomposition is a particular case of the problem considered in this paper. Note also that partly related to this paper are methods for decomposition of binary matrices into non-binary ones such as [18], [24], [25], [27], [31], see also [28] for further references.

Section snippets

Decomposition and the factor model

As was mentioned above, we assume that the matrix entries contain elements from scales (grades) equipped with certain aggregation operators. In particular, we assume a general model of (1) in which the entries of A, B, and I are elements of three complete lattices $〈 L_{1}, \leq_{1} 〉$ , $〈 L_{2}, \leq_{2} 〉$ , and $〈 L_{3}, \leq_{3} 〉$ . That is, we assume $A_{i l} \in L_{1}$ , $B_{l j} \in L_{2}$ , and $I_{i j} \in L_{3}$ . We denote the operations as usual, adding subscript i; for instance, the infima, suprema, the least, and the greatest element in $L_{2}$ are denoted by $⋀_{2}$ , $⋁_{2}$ , 0₂

Algorithm and complexity of decompositions

Let us define our problem precisely. For a given (that is, constant for the problem) aggregation structure L consisting of complete lattices $L_{1}$ , $L_{2}$ , $L_{3}$ , and an aggregation operation ⎕, the problem we discuss is a minimization (optimization) problem [1] specified as follows:

Problem:	DECOMP(L)
Input:	n × m matrix I with entries in L₃;
Feasible Solution:	n × k and k × m matrices A and B with entries
	in L₁ and L₂, respectively, for which I = A∘B;
Cost of Solution:	k.

Throughout this section, we assume that $L_{i}$ s are

Examples and experiments

In Section 4.1, we examine in detail a factor analysis of 2004 Olympic Decathlon data. We include this example to illustrate that the algorithm developed in this paper can be used to obtain reasonable factors from data with grades. In Section 4.2, we present an experimental evaluation of our algorithm.

Conclusions

We presented an approach to decomposition and factor analysis of matrices with grades, i.e. of a particular form of ordinal data. The factors in this approach correspond to fixpoints of certain Galois connections associated with the input matrix, which are known as formal concepts. The approach is justified by a theorem according to which optimal decompositions are attained by using formal concepts as factors. The relationship between the factors and original attributes is a non-linear one. An

Acknowledgements

R. Belohlavek acknowledges support by grant No. P202/10/0262, and during revision by No. 15-17899S of the Czech Science Foundation. V. Vychodil acknowledges support by the ESF project No. CZ.1.07/2.3.00/20.0059, the project is co-financed by the European Social Fund and the state budget of the Czech Republic. This paper is an extended version of the authors' paper “Factor analysis of incidence data via novel decomposition of matrices,” Proc. ICFCA 2009, 83–97.

References (31)

W. Bandler et al.
Semantics of implication operators and fuzzy relational products
Int. J. Man-Mach. Stud.
(1980)
R. Belohlavek
Concept lattices and order in fuzzy logic
Ann. Pure Appl. Log.
(2004)
R. Belohlavek
Sup-t-norm and inf-residuum are one type of relational product: unifying framework and consequences
Fuzzy Sets Syst.
(2012)
R. Belohlavek et al.
Discovery of optimal factors in binary data via a novel method of matrix decomposition
J. Comput. Syst. Sci.
(2010)
D.S. Nau et al.
A mathematical analysis of human leukocyte antigen serology
Math. Biosci.
(1978)
G. Ausiello
Complexity and Approximation. Combinatorial Optimization Problems and Their Approximability Properties
(2003)
R. Belohlavek
Fuzzy Galois connections
Math. Log. Q.
(1999)
R. Belohlavek
Optimal decompositions of matrices with entries from residuated lattices
J. Log. Comput.
(2012)
P. Cortez et al.
A data mining approach to predict forest fires using meteorological data
A.A. Frolov et al.
Boolean factor analysis by Hopfield-like autoassociative memory
IEEE Trans. Neural Netw.
(2007)

B. Ganter et al.

Formal Concept Analysis. Mathematical Foundations

(1999)

S. Gottwald

A Treatise on Many-Valued Logic

(2001)

M. Grabisch et al.

Aggregation Functions

(2009)

P. Hájek

Metamathematics of Fuzzy Logic

(1998)

K.H. Kim

Boolean Matrix Theory and Applications

(1982)

Cited by (9)

Avoiding flatness in factoring ordinal data
2023, Information Sciences
Factorization of classical, two-valued Boolean data became a widely studied topic in the past decade due to its role in analyzing relational data as well as its significance for other fields. Recently, various extensions to factorization of ordinal data, or data with graded (fuzzy) attributes, have been proposed. We identify and describe a fundamental problem regarding quality of factors, which is non-existent in the Boolean case, but naturally appears in the more general setting of ordinal data. As we demonstrate, the problem gets more significant with growing size of the factorized data. We analyze the problem, propose a method to alleviate it, and evaluate experimentally our solution to the problem. We also provide a discussion regarding ramifications of our findings for the concept of cardinality of fuzzy sets.
Heterogeneous formal context and its decomposition by heterogeneous fuzzy subsets
2022, Fuzzy Sets and Systems
Citation Excerpt :
First, we present the overview of generalizations of Formal concept analysis in a fuzzy setting in Table 1. We remind that L-fuzzy formal concepts as the factors of L-fuzzy formal context were initially investigated by Bělohlávek [9] and the important properties of decomposition of matrices with grades from a complete residuated lattice L were thoroughly studied in [3,11,12,14,17–20]. Moreover, the factorization of L-fuzzy concept lattices by congruence relation was explored in [10].
Fuzzy relations are the mappings which are intensively studied in decision, reasoning, or data analysis. We propose a method for decomposition of heterogeneous formal context by heterogeneous fuzzy subsets, which allows representing the heterogeneous structure of truth degrees for each object, heterogeneous structure of truth degrees for each attribute and heterogeneous structure of truth degrees for relationship between each pair of object and attribute. We formulate and prove the properties of decomposition in heterogeneous extension of Formal concept analysis. We put emphasis on the role of special heterogeneous formal concepts which are generated from heterogeneous singleton fuzzy subsets. We illustrate our novel results in our running example of personal requirements for accommodation as the natural continuation of our previous studies.
Selection of appropriate bonds between L-fuzzy formal contexts for recommendation tasks
2022, Information Sciences
The bond between $L$ -fuzzy formal contexts can be defined as a (Galois) connection between $L$ -fuzzy concept lattices of $L$ -fuzzy formal contexts. The selection of appropriate bond from the set of all bonds between $L$ -fuzzy formal contexts is an important challenge to apply it in recommendation tasks. We propose the general method for the selection of bonds regarding external information given by $L$ -fuzzy relation. The alternative versions of direct products of $L$ -fuzzy formal contexts are formulated and explored since we demonstrate that the extent of direct product is a bond between input $L$ -fuzzy formal contexts. We present examples of the benevolent and rigorous recommendations in several application domains including the real dataset about music genres. Finally, the connections with factorization of $L$ -fuzzy formal contexts and Sugeno integral are thoroughly studied in our paper.
Factorization of matrices with grades via essential entries
2019, Fuzzy Sets and Systems
Citation Excerpt :
Matrices with grades in partially ordered sets L (matrices over L), and fuzzy relations with truth degrees in L, are examined in many papers; for those with L being residuated structures of truth degrees we refer e.g. to [1,11]. As far as factorization of matrices over L is concerned, we refer to [3,8], in which both the fundamental properties of formal concepts of I as optimal factors as well as the first decomposition algorithm are presented. [5] provides factor analyses of various sports datasets by means of this algorithm and examines additional theoretical problems inspired by the analyses.
We present new results regarding the problem of factorization of matrices with grades, or, equivalently, decomposition of fuzzy relations. In particular, we examine geometry of factorizations and the role of fuzzy concept lattices in factorizations of matrices with grades. The results make it possible to reduce input data and enable a more focused search for factors in the search space, and are intended to guide the design of greedy and other approximation algorithms for the decomposition problem, which itself is NP-hard. To demonstrate usefulness of these results, we propose a new factorization algorithm based on these results. Our experiments demonstrate improvements in the quality of factorizations due to the new approach. We conclude by presenting further research topics implied by our findings.
Ranks of fuzzy matrices. Applications in state reduction of fuzzy automata
2018, Fuzzy Sets and Systems
Citation Excerpt :
Later, in [5] fixed points of fuzzy Galois connections were proven to be special cases of maximal subdecompositions of fuzzy matrices. Through a series of papers, while studying formal concept analysis, Bělohlávek and his coworkers provided significant results regarding properties of row and column spaces associated to fuzzy matrices [11,12], optimal decompositions of fuzzy matrices [8,9], and efficient computation of suboptimal decompositions of fuzzy matrices [14]. It is worth noting that Ćirić et al. [17–21,23,30,31] made a great contribution to the theory of fuzzy matrices over complete residuated lattices, and offered a comprehensive survey on the theory of fuzzy automata, particularly relevant to problems of state reduction, determinization and of bisimulation of fuzzy automata.
In this paper we consider different types of ranks of fuzzy matrices over residuated lattices. We investigate relations between ranks and prove that row rank, column rank and Schein rank of idempotent fuzzy matrices are equal. In particular, ranks and corresponding decompositions of fuzzy matrices representing fuzzy quasi-orders are studied in detail. We show that fuzzy matrix decomposition by ranks can be used in the state reduction of fuzzy automata. Moreover, we prove that using rank decomposition of fuzzy matrices improves results of any state reduction method based on merging indistinguishable states of fuzzy automata.
The Discrete Basis Problem and Asso Algorithm for Fuzzy Attributes
2019, IEEE Transactions on Fuzzy Systems

View all citing articles on Scopus

View full text

Factorization of matrices with grades

Abstract

Introduction

Section snippets

Decomposition and the factor model

Algorithm and complexity of decompositions

Examples and experiments

Conclusions

Acknowledgements

Int. J. Man-Mach. Stud.

Ann. Pure Appl. Log.

Fuzzy Sets Syst.

J. Comput. Syst. Sci.

Math. Biosci.

Complexity and Approximation. Combinatorial Optimization Problems and Their Approximability Properties

Fuzzy Galois connections

Math. Log. Q.

Optimal decompositions of matrices with entries from residuated lattices

J. Log. Comput.

A data mining approach to predict forest fires using meteorological data

Boolean factor analysis by Hopfield-like autoassociative memory

IEEE Trans. Neural Netw.

Formal Concept Analysis. Mathematical Foundations

A Treatise on Many-Valued Logic

Aggregation Functions

Metamathematics of Fuzzy Logic

Boolean Matrix Theory and Applications