Two-Party Privacy-Preserving Set Intersection with FHE

Cai, Yunlu; Tang, Chunming; Xu, Qiuxia

doi:10.3390/e22121339

Open AccessArticle

Two-Party Privacy-Preserving Set Intersection with FHE

by

Yunlu Cai

¹

,

Chunming Tang

^1,2,* and

Qiuxia Xu

³

¹

School of Mathematics and Information Science, Guangzhou University, Guangzhou 510006, China

²

State Key Laboratory of Cryptology, P.O. Box 5159, Beijing 100878, China

³

School of Mathematics and Systems Science, Guangdong Polytechnic Normal University, Guangzhou 510665, China

^*

Author to whom correspondence should be addressed.

Entropy 2020, 22(12), 1339; https://doi.org/10.3390/e22121339

Submission received: 28 October 2020 / Revised: 18 November 2020 / Accepted: 22 November 2020 / Published: 25 November 2020

Download

Browse Figures

Versions Notes

Abstract

:

A two-party private set intersection allows two parties, the client and the server, to compute an intersection over their private sets, without revealing any information beyond the intersecting elements. We present a novel private set intersection protocol based on Shuhong Gao’s fully homomorphic encryption scheme and prove the security of the protocol in the semi-honest model. We also present a variant of the protocol which is a completely novel construction for computing the intersection based on Bloom filter and fully homomorphic encryption, and the protocol’s complexity is independent of the set size of the client. The security of the protocols relies on the learning with errors and ring learning with error problems. Furthermore, in the cloud with malicious adversaries, the computation of the private set intersection can be outsourced to the cloud service provider without revealing any private information.

Keywords:

private set intersection; privacy-preserving; fully homomorphic encryption; secure multiparty computation

1. Introduction

In 1978, Rivest first presented the idea of fully homomorphic encryption (FHE) [1]. Gentry constructed the first specific FHE scheme in 2009 [2]. Since then, dramatic progress in FHE is made by Gentry and many other researchers around the world. The first generation is based on an approximate GCD problem of integers and ideal lattices [2,3]; the second generation is based on ring learning with errors (RLWE) and learning with errors (LWE) problems, and developed several techniques, including re-linearization, key switch and modulus reduction, for decreasing noise growth [4,5]; the third generation involves the GSW scheme, which is based on approximate eigenvalues and RLWE [6]. Shuhong Gao’s scheme [7] is a compressed fully homomorphic encryption scheme, denoted by SGFHE below, and this scheme has three features: (1) The cipher with private key encryption is expanded six times and with public key encryption is

10 + l o g 2 (n)

, where n (a power of 2) is the block length of the message; the computation of all ciphertexts is modulo r, where

r = 16 n

; and the boundary of noise size is

n - 1

. (2) The bootstrapping algorithm needs only a bootstrapping key and the boundaries of the noise size of the output ciphers are still

n - 1

with no failure at all. (3) the security of Shuhong Gao’s scheme is based on the learning with errors problems and ring learning with errors problems, and for the block length of any message

n \geq 512

, it costs at least

2^{160}

bit operations for breaking the scheme with the current approaches. In addition, with TFHE bootstrapping [8], the LWE cipher produced could be invalid with a probability of about

2^{- 33}

(for

n = 500

). That probability is very small, and for computing many functions it is useful; however, it cannot be applied to functions that require more than

2^{33}

bit operations (unless increasing n). In SGFHE, the error size of the

L W E

ciphers after bootstrapping are always bound by

n - 1

; this feature is not available in other FHE schemes. The total time cost for the bootstrapping procedure of the SGFHE scheme is about 130 ms, that is, 10 times as much as TFHE.

Secure multi-party computing (SMPC) is mainly about how to compute a function safely without a trusted third party. Secure multi-party computing was first proposed by Yao Qizhi in 1982. After being developed by Goldreich, Micali, Wigderson et al. [9], secure multi-party computing became a very active research field in modern cryptography. The research on MPC [10] is divided into general schemes and specific schemes designed for certain computing scenarios; the general scheme is not as efficient as a specific optimized scheme that is specially designed for a certain application. In practical applications, specific schemes are more widely used [11]. Secret sharing [12], garbled circuit [13,14], oblivious transfer [15], commitment schemes [16] and homomorphic encryption [17] are the key pieces of technology to realize SMPC, and SMPC is of great significance in the study of secret sharing schemes and privacy protection, where it is widely used in correlation analysis, data security queries, trusted data exchanges, etc. [18,19,20,21,22].

Private preserving set intersection (PSI) computing is an important aspect in secure multi-party computing. It not only performs well in scientific computing, but in real life many data can be represented by sets, so it can be used in privacy protection computing to complete corresponding data computing in the sets. The private preserving set intersection computing is the basic operation in many applications, such as machine learning, data mining [23], secure distributed data connection [24] and in privacy protection law enforcement, where it is especially widely used.

1.1. Related Work

Several specialized PSI protocols have been proposed in the literature which are more efficient than using general secure computation [33]. The main methods are: based on oblivious polynomial evaluation [25], based on an oblivious pseudo-random function [26], based on a blind signature [27], based on homomorphic encryption [28], based on the Bloom filter [29], etc. Shen Liyan et al. [30] gave a detailed overview of the development prospects of private preserving set intersection computing, the protocol developed by Google scholar. Mihaela Ion et al. [11] applied private preserving set intersection computing to advertising cooperation.

1.2. Contributions

We present three private set intersection protocols. First, we propose a novel private set intersection protocol based on Shuhong Gao’s fully homomorphic encryption scheme and prove the security of the protocol in the honest-but-curious model. We then present a variant of promoted protocol. We also present a variant of the protocol which is a completely novel construction for computing the intersection based on the Bloom filter and a fully homomorphic encryption; this protocol’s complexity is independent of the set size of the client. The security of the protocol relies on the learning with errors and ring learning with errors problems. Furthermore, in a cloud with malicious adversaries, the computation of the private set intersection can be outsourced to the cloud service provider without revealing any private information. The ciphertext extension of the protocols is small so that the protocols have strong practicability.

The remainder of the paper is structured as follows: We next review the basic concepts and techniques used in Section 2. In Section 3, we introduce the homomorphic operation used. We describe the basic two-party computing protocol, the improvement protocol and the two-party computing protocol based on the Bloom filter in Section 4. We present our conclusions in Section 5.

2. Basic Concepts and Techniques

2.1. Notation

Let

χ

be an error distribution; according to the distribution

χ

,

x \leftarrow χ

is randomly chosen. For an integer

n \geq 1

, let

R_{n} = Z [x] / (x^{n} + 1)

,

R_{n, q} = Z [x] / (x^{n} + 1, q)

, where

(x^{n} + 1, q)

represents the ideal of

Z [x]

generated by

x^{n} + 1

and q. For any polynomial

f (x) = \sum_{i = 0}^{d} f_{i} (x^{i}) \in R (x)

, we define the ∞-norm as

{| | f (x) | |}_{\infty} = max_{0 \leq i \leq d} | f_{i} |

.

2.2. LWE Ciphers and Modulus Reduction

Regev proposed LWE problem [31,32] over

Z_{q}

. Let

χ

be a probabilistic distribution, and

s \in Z_{q}^{n}

be an arbitrary vector that is a secret key of any user.

(a, b)

is an LWE sample, where

a \in Z_{q}^{n}

is selected randomly and uniformly,

b \equiv 〈s, a〉 + e (mod q)

,

e \leftarrow χ

.

Let

D_{q} = ⌊q / 4⌋

,

1 \leq τ \leq D_{q} / 2

,

a \leftarrow Z_{q}^{n}

, and compute

b \equiv 〈s, a〉 + e + x D_{q} (mod q)

for encrypting one bit,

e \in [- τ, τ]

. Let

E_{s} (x) = (a, b) \in Z_{q}^{n} \times Z_{q}

,

(a, b)

is the LWE ciphertext for

x \in {0, 1}

. Note that

D_{q} = ⌊q / 2⌋

in Regev’s but

D_{q} = ⌊q / 4⌋

in SGFHE scheme for homomorphic bit operations.

Modulus reduction can reduce the LWE ciphers of

Z_{q}

to

Z_{r}

where r is far less than q.

Lemma 1

([7]). Let

s, a \in Z_{q}^{n}

,

e \in Z_{q}^{n}

with

| e | \leq τ

,

D_{r} = ⌊r / 4⌋

, and

b \equiv 〈s, a〉 + e + x D_{q} (mod q) .

(1) Suppose

τ \in q (n - 3) / (2 r), q \geq 4 r

and

s \in {0, 1}^{n}

.

b^{'} = ⌊r b / q⌉, a^{'} = ⌊r a / q⌉

; then

b^{'} \equiv 〈s, a^{'}〉 + e + x D_{r} (mod r) .

(2) Let

ℓ = ⌈l o g_{2}^{q}⌉

,

q \geq 16

. Suppose that

τ \leq q (n ℓ - 5) / (2 r)

with

s \leftarrow Z_{q}^{n}

. Then there exist

s^{'} \in {0, 1}^{n l}, a^{'} \in Z_{r}^{n ℓ}

and

b^{'} \in Z_{r}

, satisfying

b^{'} \equiv s^{'} {(a^{'})}^{t} + e^{'} + x D_{r} (mod r),

where

e^{'} \in Z

,

| e^{'} | \leq n ℓ

.

2.3. RLWE Ciphers

Lyubashevsky et al. introduced the RLWE problem to acquire more efficient encryption schemes [33]. An RLWE sample

v = (a (x), b (x)) \in R_{n}^{2}

, where

a (x) \leftarrow R_{n, q}, a (x) = \sum_{i = 0}^{n - 1} a_{i} x^{i}

, and

b (x) : = s (x) a (x) + e (x) (mod (x^{n} + 1, q))

for some

e (x) \leftarrow R_{n}

,

{| | e (x) | |}_{\infty} \leq τ

,

τ

is the bound of error.

v {(- s (x), 1)}^{t} \equiv e (x) (mod (x^{n} + 1, q)) .

Let

m (x) = \sum_{i = 0}^{n - 1} m_{i} x^{i}

, where

m_{i} \in {0, 1}

denotes an n-bit message. The RLWE cipher of

m (x)

with error size

τ

is

R E_{s} (m (x)) = v + m (x) D_{q} (0, 1) \in R_{n, q}^{2} .

Suppose

R E_{s} (m (x)) = (a (x), b (x))

. We have

b (x) - s (x) a (x) \equiv m (x) D_{q} + e (x) (mod (x^{n} + 1, q)),

when

τ \leq D_{q} / 2

, the message

m (x)

can be recovered from

m (x) \equiv b (x) - s (x) a (x) (mod (x^{n} + 1, q))

.

2.4. GSW Ciphers and External Product

2.4.1. Gadget Matrix

Suppose that B and l are positive integers so that

B^{ℓ} \geq q

. Suppose that when

g = (1, B, \dots, B^{ℓ - 1})

, an arbitrary

a \in Z_{q}

could be denoted by

a = a_{0} + a_{1} + \dots + a_{ℓ - 1} B^{ℓ - 1} = (a_{0} + a_{1}, \dots + a_{ℓ - 1}) g^{t},

where

a_{i} \in Z

has a small size. Let

- B / 2 \leq a_{i} \leq B / 2

; then

(a_{0} + a_{1}, \dots + a_{ℓ - 1})

is unique. Let

- 2 B \leq a_{i} \leq 2 B

; the lemma as following is straightforward to prove.

Lemma 2

([7]). Let

B^{ℓ} \geq q

,

a \in Z

. For

0 \leq i \leq ℓ - 1

, choose

x_{i} \leftarrow Z, | x_{i} | \leq 3 B / 2

, which is uniform, random and independent. Suppose that

\begin{matrix} a - (x_{0} + x_{1} B + \dots + x_{ℓ - 1} B^{ℓ - 1}) \\ \equiv y_{0} + y_{1} B + \dots + y_{ℓ - 1} B^{ℓ - 1} (mod q) \end{matrix}

where

| y_{i} | \leq B / 2

. Set

a_{i} = x_{i} + y_{i}

; then

(a_{0}, a_{1}, \dots, a_{ℓ - 1})

is uniform random solution to

a \equiv a_{0} + a_{1} B + \dots + a_{ℓ - 1} B^{ℓ - 1} (mod q)

with

| a_{i} | \leq B / 2

.

Hence, any list of elements in

Z_{q}

can be extended. That is, each polynomial

a (x) \in R_{n, q}

can be denoted by

\begin{matrix} a (x) & = a_{0} (x) + a_{1} (x) B + \dots + a_{ℓ - 1} (x) B^{ℓ - 1} \\ = (a_{0} (x), a_{1} (x), \dots, a_{ℓ - 1} (x)) g^{t}, \end{matrix}

where

| | a_{i} (x) {| |}_{\infty} \leq 2 B

. A gadget matrix of

(2 ℓ) \times 2

is defined as

G = (\begin{matrix} g^{t} & 0 \\ 0 & g^{t} \end{matrix}) .

Any

(a (x), b (x)) \in R_{n, q}^{2}

can be denoted by

(a (x), b (x)) = u (x) G

(1)

where

u (x) \in R_{n}^{2 ℓ}

is selected randomly and uniformly, and

{| | u (x) | |}_{\infty} \leq 2 B

. Here

G^{- 1}

, only as an operator, acts on the right of

(a (x), b (x))

(G is not a square matrix, so it has no inverse).

u (x) = (a (x), b (x)) ◃ G^{- 1}

A row vector

u (x)

has

2 ℓ

polynomials; the coefficients of the polynomials are small and at most

2 B

. This can increase the dimension to decease the coefficient. By the above definition, we have the following equation.

(v ◃ G^{- 1}) G = v, \forall v \in R_{n, q}^{2}

2.4.2. External Product

Suppose that a row vector

v = (a (x), b (x)) \in R_{n, q}^{2}

, and arbitrary matrices

A \in R_{n, q}^{2 ℓ \times 2}

of

2 ℓ \times 2

, define the external product of

v

and A as

v ⊙ A = (v ◃ G^{- 1}) A \in R_{n, q}^{2};

it is a random vector; for

v ◃ G^{- 1}

is a random vector of

1 \times 2 ℓ

. By definition, the external product satisfies the right distributive, namely, for arbitrary two matrices

A, B \in R_{n, q}^{(2 ℓ) \times 2}

of

2 ℓ \times 2

, we have

\begin{matrix} v ⊙ (A + B) & \equiv (v ◃ G^{- 1}) (A + B) \\ = (v ◃ G^{- 1}) A + (v ◃ G^{- 1}) B \\ = v ⊙ A + v ⊙ B (mod (x^{n} + 1, q)) . \end{matrix}

2.4.3. GSW Ciphers

Let an n-bit secret key

s (x) = \sum_{i = 0}^{n - 1} s_{i} x^{i}

, where

s_{i} \in {0, 1}

,

R L W E

sample

A \leftarrow R_{n, q}^{2 ℓ \times 2}

(the rows of A are

R L W E

samples) and a GSW cipher for

m (x) \in R_{n}

is

G S W_{s} (m (x)) = A + m (x) G \in R_{n, q}^{2 ℓ \times 2};

according to the definition of

R L W E

sample

A {(- s (x), - 1)}^{t} \equiv w (x) (mod (x^{n} + 1, q)),

where

w (x) \in R_{n}^{2 ℓ}

, and

| | w (x) | | \leq τ

;

τ

is the error size of GSW ciphers.

Lemma 3

([7]). Let

m_{0} (x), m_{1} (x) \in R_{n}

be any two polynomials. For any

R E_{s} (m_{0} (x))

with error size τ and any

G S W_{s} (m_{1} (x))

with error size

τ_{1}

, we have

R E_{s} (m_{0} (x)) ⊙ G S W_{s} (m_{1} (x)) = R E_{s} (m_{0} (x) m_{1} (x))

and

R E_{s} (m_{0} (x) m_{1} (x))

which has an error size of at most

τ | | m_{1} (x) {| |}_{\infty} + 4 B n ℓ τ_{1}

.

2.5. Bloom Filter

A Bloom filter [34] is a compact data structure for probabilistic set membership testing, and can insert and query data efficiently. The Bloom filter provides a time and space-efficient method to check whether there is an element in the set. A Bloom filter consists of a binary vector and a set of hash functions;

b_{j}

represents the j-th bit of the Bloom filter b and all elements of the empty Bloom filter are 0. Any Bloom filter b includes the three steps as follows:

C r e a t e (α) :

Create an empty Bloom filter with

α

bits; the hash function

{h_{i} | 0 \leq i < β}

is:

h_{i} : {0, 1}^{*} \to {0, \dots, α - 1} .

A d d (x) :

Compute

β

hash values

g_{i} = h_{i} (x)

of the element x using the hash function

h_{i} (0 \leq i < β)

. Set the Bloom filter cell with subscript

g_{i}

to 1.

g_{i} = h_{i} (x) ⟹ b_{g_{i}} = 1

T e s t (x) :

Test whether the element x is in the Bloom filter b. Compute

β

hash values

g_{i} = h_{i} (x)

of the element x; if the

β

cells with subscript

g_{i}

are 1 (

b_{g_{i}} = 1

), then return 1 (true).

T e s t (x) = \land_{i = 0}^{β - 1} b_{h_{i} (x)} = \land_{i = 0}^{β - 1} b_{g_{i}}

(2)

The Bloom filter has a negligible false positive probability;

T e s t (x)

will return 1, although x cannot be added to the Bloom filter. Given

ω

elements to be added and the expected maximum false positive probability

2^{- k}

, the Bloom filter size

α

needs to satisfy:

α \geq \frac{ω k}{l n^{2} 2} .

A Bloom filter is widely used in cryptography. Bellovin and Cheswick [35] and Goh [36] implemented a securely document search using a Bloom filter. Raykov and Bellovin [37] realized a secure database query. Qiu L and Li Y [38] realized privacy data mining and BIP-0037 put forward the application of a Bloom filter in Bitcoin. Reference [39,40,41] realized the set intersection computing based on Bloom filters.

3. Homomorphic Operations

In SGFHE scheme, let any two LWE ciphers be

E_{s} (x_{1})

and

E_{s} (x_{2})

with

x_{1}, x_{2} \in {0, 1}

; one bootstrapping can compute three bit operations

E_{s} (x_{1} \land x_{2}), E_{s} (x_{1} \lor x_{2})

and

E_{s} (x_{1} \oplus x_{2})

; the scheme follows the approach in Ducas et al. [42] and Chillotti [43], but does not need to perform a key switch.

3.1. Key Generations

Let n be a power of 2,

n \geq 64

. Suppose that r can be divided by 8;

m = r / 2, B = 35 r^{2} n

,

r \geq 16 n, q \geq n r, 1220 r^{4} n^{2} \leq Q < 1225 r^{4} n^{2} = B^{2} .

(3)

D_{r} = ⌊r / 4⌋, D_{q} = ⌊q / 4⌋, {\tilde{D}}_{Q} = ⌊Q / 8⌋, a n d G = (\begin{matrix} 1 & 0 \\ B & 0 \\ 0 & 1 \\ 0 & B \end{matrix}) .

Secret key Pick

s \leftarrow {0, 1}^{n}

uniformly and randomly; let

s (x) = \sum_{i = 0}^{n - 1} s_{i} x^{i}

.

Public key

p k = (k_{0} (x), k_{1} (x)), k_{0} (x) \leftarrow R_{n, q}

,

k_{1} (x) \equiv k_{0} (x) s (x) + e (x) (mod x^{n} + 1, q),

where

e (x) \leftarrow R_{n} {, | | e (x) | |}_{\infty} < D_{q} / (41 n)

.

Bootstrapping key A bootstrapping key

b k = (C_{0}, C_{1}, \dots, C_{n - 1})

can be generated as follows:

For

1 \leq i \leq n - 1

do:

(1): Pick $a_{j i} \leftarrow R_{m, Q}$ , $1 \leq j \leq 4$ ;
(2): Pick $e_{j i} (x) \in R_{m}$ , $| | e_{j i} {| |}_{\infty} \leq n$ , $1 \leq j \leq 4$ ;
(3): Compute $b_{j i} (x) : = a_{j i} (x) s (x) + e_{j i} (x) (mod x^{m} + 1, Q)$ , $1 \leq j \leq 4$ ;
(4): Set $C_{i} : = (\begin{matrix} a_{1 i} (x) & b_{1 i} (x) \\ a_{2 i} (x) & b_{3 i} (x) \\ a_{3 i} (x) & b_{2 i} (x) \\ a_{4 i} (x) & b_{4 i} (x) \end{matrix}) + s_{i} G (mod Q)$ .

3.2. Bootstrapping Algorithm

Lemma 4.

Suppose that a bootstrapping key

b k

has an error size at most

τ_{1}

; r is divisible by 8 and

r \geq 16 n, Q \geq \frac{n}{n - 3} 16 B r^{2} ℓ τ_{1}

. Then, for any two LWE ciphers

E_{s} (x_{i}) = v_{i} \in Z_{r}^{n} \times Z

, with error size

\leq D_{r} / 4

where

x_{i} \in {0, 1}

for

i = 1, 2

, the bootstrapping algorithm in Algorithm 1 outputs random LWE ciphers

E_{s} (x_{1} \land x_{2}), E_{s} (x_{1} \lor x_{2}), E_{s} (x_{1} \oplus x_{2}) \in Z_{r}^{n} \times Z_{r}

all with error size

< n \leq D_{r} / 4

[7].

Algorithm 1 Bootstrapping Algorithm:

B T_{b k} (v_{1}, v_{2}) \to c_{1}, c_{2}, c_{3}

.

Input:

b k = (C_{0}, C_{1}, \dots, C_{n - 1}) \in R_{m, Q}^{2 ℓ \times 2}

: bootstrapping key;

(v_{1}, v_{2}) \in Z_{r}^{n} \times Z_{r}

:

v_{i} = E_{s} (x_{i}), x_{1}, x_{2} \in {0, 1}

;

Output:

E_{s} (x_{1} \land x_{2}), E_{s} (x_{1} \lor x_{2}), E_{s} (x_{1} \oplus x_{2}) \in Z_{r}^{n} \times Z_{r}

;

1:: Compute $u : = v_{1} + v_{2} = (u_{0}, u_{1}, \dots, u_{n - 1}, u_{n}) \in Z_{r}^{n} \times Z_{r}$ ;
2:: $T : = {j \in Z : - D_{r} \leq j \leq D_{r}}, t (x) : = \sum_{j \in T} x^{j}$ ;
3:: $A : = (0, t (x) x^{- u_{n}} {\tilde{D}}_{Q}) \in R_{m, Q}^{2}$ ;
4:: for $k from 0 to n - 1$ do
5:: $A : = A ⊙ (G + (x^{u_{k}} - 1) C_{k})$ ;
6:: end for
7:: Let $A = (\sum_{i = 0}^{m - 1} a_{i} x^{i}, \sum_{i = 0}^{m - 1} b_{i} x^{i})$ . Set
$a_{1} : = (E x t r a c t (a (x), 3 m / 4), {\tilde{D}}_{Q} + b_{3 m / 4}) \in Z_{Q}^{n} \times Z_{Q}$ ;
$a_{2} : = (E x t r a c t (a (x), m / 4), {\tilde{D}}_{Q} - b_{m / 4}) \in Z_{Q}^{n} \times Z_{Q}$ ;
$a_{3} : = a_{2} - a_{1} \in Z_{Q}^{n} \times Z_{Q}$ ;
8:: for $i from 1 to 3$ do
9:: $c_{i} : = ⌊r a / Q⌉ \in Z_{r}^{n} \times Z_{r}$ ;
10:: end for
11:: Return $c_{1}, c_{2}, c_{3}$ ;

3.3. Encryption Scheme

Lemma 5.

Suppose that

r = 2^{t + 1}

;

(a (x), b (x)) \in R_{n, r}^{2}

can be computed from Algorithm 2. Then for some

ω_{3} (x), | | ω_{3} (x) {| |}_{\infty} \leq D_{r} / 4

, so that

2^{t - 4} b (x) - s (x) a (x) \equiv ω_{3} (x) + m (x) D_{r} (mod x^{n} + 1, r) .

Specifically, if

r = 16 n

, then

(u, v)

returned in Algorithm 2 has 6n bits and represents an

R L W E

cipher

R E_{s} (m (x))

, and the error size

< n

[7].

Algorithm 2 Encryption with private key:

R E_{s} (m (x)) \to (u, v

).

Input:n-bit secret key

s (x) = \sum_{i = 0}^{n - 1} s_{i} x^{i}, s_{i} \in {0, 1}

;

n-bit message

m (x) = \sum_{i = 0}^{n - 1} m_{i} x^{i}, m_{i} \in {0, 1}

;

t : = ⌈l o g_{2} (r)⌉

, hence

2^{t} \leq r \leq 2^{t - 1}

;

P : {0, 1} \to {0, 1}^{n (t + 1)}

;

Output:

(u, v) \in {0, 1}^{n} \times {{0, 1}^{5}}^{n}

1:: $u \leftarrow {0, 1}^{n}, a (x) : = P (u, x) \in R_{n, r}$ ;
2:: $ω (x) \leftarrow R_{n} {, | | ω (x) | |}_{\infty} \leq D_{r} / 8, b_{1} (x) : = a (x) s (x) + ω (x) + m (x) D_{r} (mod x^{n} + 1, r)$ ;
3:: $b (x) = \sum_{i = 0}^{n - 1} b_{i} x^{i} : = ⌊b_{1} (x) / 2^{t - 4}⌋$ ;
4:: $v = (b_{1}, b_{2}, \dots, b_{n - 1}) \in {({0, 1}^{5})}^{n}$ ;
5:: return $(u, v);$

Lemma 6.

Suppose that

r = 2^{t + 1}, r \geq 16 n, q \geq 4 r a n d n \geq 164

. Suppose that

R E_{p k} (m (x)) : = (a (x), b (x)) \in R_{n, r}^{2}

be any ciphertext output by Algorithm 3. Then

2^{t - 5} b (x) - s (x) a (x) \equiv ω_{3} (x) + m (x) D_{r} (mod x^{n} + 1, r)

for some

ω_{3} (x) \in R_{n}

with

| | ω_{3} (x) {| |}_{\infty} \leq D_{r} / 4

.

Specifically, if

r = 16 n

, then any ciphertext

(a (x), b (x))

has

n (10 + l o g_{2} (n))

bits and the error, that is, each coefficient of

ω_{3} (x)

, is in

(- n, n)

randomly [7].

We can divide the data

x

into d blocks of length n. Let

N = d n

,

x = (x_{1}, x_{2}, \dots, x_{d}) \in {0, 1}^{N}

,

x_{k} = (x_{k, 0}, x_{k, 1}, \dots, x_{k, n - 1}), x_{k} \in {0, 1}^{n}

. Each

x_{k}

can be expressed as a polynomial

\sum_{i = 0}^{n - 1} x_{k, i} x^{i} \in R_{n}

. Then—encrypted using the private-key scheme

c_{k} = R E_{s} (x_{k}), 1 \leq k \leq d

by Algorithm 2—note that the cipher text size

c_{k}

is about 6N bits and then encrypted using the public-key scheme

c_{k}^{'} = R E_{p k} (x_{k}), 1 \leq k \leq d

by Algorithm 3; note that the cipher text size

c_{k}

’ is about

N (10 + l o g_{2} (n))

. Homomorphic computing can be performed in three steps as follows:

Algorithm 3 Encryption under public key:

R E_{p k} (m (x)) \to (a (x), b (x)) \in R_{n, r}^{2}

.

Input:

p k = (k_{0} (x), k_{1} (x)), k_{0} (x) \leftarrow R_{n, q}

;

m (x) = \sum_{i = 0}^{n - 1} m_{i} x^{i}

:n-bit message where each

m_{i} \in {0, 1}

;

t : = ⌈l o g_{2} (r)⌉

;

Output:

(a (x), b (x)) \in R_{n, r}^{2}

u (x) \leftarrow R_{n}

,

1:: $u (x) \leftarrow R_{n}$ , each coefficient random from ${- 1, 0, 1}$ ;
2:: $ω_{1} (x) \leftarrow R_{n}, | | ω_{1} (x) {| |}_{\infty} \leq D_{q} / (41 n)$ ;
3:: $ω_{2} (x) \leftarrow R_{n}, | | ω_{2} (x) {| |}_{\infty} \leq D_{q} / 82$ ;
4:: $a_{1} (x) : = k_{0} (x) u (x) + ω_{1} (x) (mod x^{n} + 1, q)$ ;
5:: $b_{1} (x) : = k_{1} (x) u (x) + ω_{2} (x) + m (x) D_{q} (mod x^{n} + 1, q)$ ;
6:: $a (x) : = ⌊\frac{r}{q} a_{1} (x)⌉, b (x) : = ⌊\frac{r}{2^{t - 5} q} b_{1} (x)⌉$ ;
7:: Return $(a (x), b (x))$

(1): Unpacking the $R L W E$ ciphertexts $R E (x_{k})$ to get $L W E$ ciphers in $Z_{r}^{n} \times Z_{r}$ for the bits of $x$ .

$R E (x_{k}) \overset{u n p a c k}{⟶} E_{s} (c_{k, i})$
(2): Homomorphic computing of $f (x) = y = {y_{0}, y_{1}, \dots, y_{M}} \in {0, 1}^{M}$ on $L W E$ ciphers.

$f (x) \overset{B T_{b k}}{⟶} {E_{s} (y_{0}), E_{s} (y_{1}), \dots, E_{s} (y_{M})}$
(3): Packing the $L W E$ ciphers ${E_{s} (y_{0}), E_{s} (y_{1}), \dots, E_{s} (y_{M})}$ of function f into $R L W E$ ciphers in $R_{n, r}^{2}$ .

${E_{s} (y_{0}), E_{s} (y_{1}), \dots, E_{s} (y_{M})} \overset{p a c k}{⟶} R E_{s} (y)$

4. Privacy-Preserving Set Intersection

We abstract the privacy set intersection computation model as follows. The client C owns a set

{c_{1}, \dots, c_{v}}

of size v, and the server S holds a set

{s_{1}, \dots, s_{ω}}

of size

ω

. After the end of the protocol, the client C only obtains the intersection

{c_{1}, \dots, c_{v}} ⋂ {s_{1}, \dots, s_{ω}}

; however, the server cannot get any information for the input and the set intersection of the client (including the size of the intersection).

4.1. The Basic Two-Party Computing Protocol

The summary of basic private two-party intersection protocol is shown in Figure 1. The specific steps are as follows:

1.: The client C encrypts the set with private key and sends ciphertexts to the server S.
2.: The server S implements homomorphic computing with bootstrapping key and sends the result to the client C.
3.: The client C decrypts and computes the intersection of the two sets; the server S cannot acquire any information about the input and output.

Our basic two-party computing protocol is shown in Figure 2. At step

C \to S

, the client sends

p k, b k

and

R E_{s k} (c_{k})

to the server. At step S, the server unpacks

R E_{s k} (c_{k})

to get

E_{s k} (c_{k, j})

, unpacks

R E_{p k}

to get

E_{s k} (s_{i, j})

, samples

u \in {0, 1}^{n}

, calls bootstrapping operations to compute

E_{s k} (z_{k, i})

, computes

L W E

ciphers

E_{s k} (w_{i, j})

, packs the resulted LWE ciphers

E_{s k} (w_{i, j})

into RLWE ciphers

R E_{s k} (w_{i})

and sends them to the client. At step C, the client decrypts

R E_{s k} (w_{i})

to get

w_{i}

and computes the intersection.

4.2. Correctness of the Basic Two-Party Computing Protocol

First, the correctness of SGFHE scheme has been proven.

Let

c_{k}, s_{i}

be the set elements’ binary representation of the client and server respectively. The insufficient bits are filled with 0s and we extend the length to n.

c_{k} = {c_{k, 1}, \dots, c_{k, n}} = {0, 1}^{n}, 1 \leq k \leq v

s_{i} = {s_{i, 1}, \dots, s_{i, n}} = {0, 1}^{n}, 1 \leq i \leq ω

u \overset{s a m p l e}{⟵} {0, 1}^{n}

z_{k, i} = ⋁_{j = 1}^{n} (c_{k, j} \oplus s_{i, j})

(4)

If

z_{k, i} = 1

, then

c_{k} \neq s_{i}

; if

z_{k, i} = 0

, then

c_{k} = s_{i}

.

The server can acquire

E_{s k} (z_{k, i})

by

R E_{s k} (c_{k}) \overset{u n p a c k}{⟶} E_{s k} (c_{k, j}), R E_{p k} (s_{i}) \overset{u n p a c k}{⟶} E_{s k} (s_{i, j})

and call

(2 n - 1)

bootstrapping operations, denoted by

z_{k, i} = ⋁_{j = 1}^{n} (c_{k, j} \oplus s_{i, j}) \overset{B T_{b k}}{⟶} E_{s k} (z_{k, i})

.

Remark: RE represents RLWE cipher; E represents LWE cipher.

Let

z_{i} = ⋀_{k = 1}^{v} z_{k, i};

(5)

E_{s k} (z_{i}) \in Z_{r}^{n} \times Z_{r}

can be computed from

E_{s k} (z_{k, i})

by implementing

(v - 1)

bootstrapping operations. Hence, implementing

(2 n + v - 2)

bootstrapping operations by (6) can compute

E_{s k} (z_{i})

.

z_{i} = ⋀_{k = 1}^{v} z_{k, i} = ⋀_{k = 1}^{v} ⋁_{j = 1}^{n} (c_{k, j} \oplus s_{i, j}) \overset{B T_{b k}}{⟶} E_{s k} (z_{i})

(6)

If

z_{i} = 1

, then

w_{i} = u

is a random value with

\forall k, c_{k} \neq s_{i}

; if

z_{i} = 0

, then there

\exists k

so that

c_{k} = s_{i}, w_{i} = c_{k} = s_{i}

is in the intersection. For

s_{i}

and

u

, each bit

w_{i, j} = z_{k} \land u_{j} \oplus (1 - z_{k}) \land s_{i, j}

can be computed by

w_{i} = {w_{i, 1}, \dots, w_{k, n}} = z_{i} u \oplus (1 - z_{i}) s_{i}

(7)

For plaintexts

u_{j}

and

s_{i, j}

, an LWE cipher of any bit

z_{k} \land u_{j} \oplus (1 - z_{k}) \land s_{i, j}

can be computed as

u_{j} E_{s k} (z_{i}) + s_{i, j} (E_{s k} (1) - E_{s k} (z_{k})),

which still has error size

< D_{r} / 4

.

The

L W E

cipher is

E_{s k} (w_{i, j}) = u_{j} E_{s k} (z_{i}) + s_{i, j} (E_{s k} (1) - E_{s k} (z_{k})) .

(8)

The server can pack the resulted LWE ciphers

E_{s k} (w_{i, j})

into RLWE ciphers

R E_{s k} (w_{i})

and send them to the client.

In the end, the client decrypts

D e c (R E_{s k} (w_{i})) ⟹ w_{i}

and computes the intersection

{c_{1}, \dots, c_{v}} ⋂ {w_{1}, \dots, w_{ω}} ⟹

{c_{1}, \dots, c_{v}} ⋂ {s_{1}, \dots, s_{ω}} .

4.3. Security Analysis of the Basic Two-Party Computing Protocol

We analyze the security of the protocol by comparing the real model and the ideal model. The real model is the actual implementation of the basic private intersection protocol and it is a trusted server for computing the intersection. The trusted server receives the input

{c_{1}, \dots, c_{v}}

of the client and the input

{s_{1}, \dots, s_{ω}}

of the server, and will return the intersection with the client; however, the server cannot get any information about the output. The ideal model maintains all security evidence. In the semi-honest model, the participant’s view includes its own input and the information received from other participants during the progression of the protocol. The simulator can use the participant’s input and output to build a simulation that is computationally indistinguishable from the views. That proves that the participants cannot obtain any other information besides the inputs and outputs.

Theorem 1.

If SGFHE is held, then the basic two-party computing protocol can realize the private set intersection computing under the semi-honest model.

Proof.

In the protocol, the server cannot obtain any other information besides receiving the

R L W E

ciphers. Its view can only be simulated with ciphertexts and its security is based on IND-CPA security of

R L W E

scheme.

The client only receives the

R L W E

ciphers of the intersections and the random

R L W E

ciphers. Therefore, it just includes the output information of the set intersection and the view of simulator is only the output information of the set intersection. □

4.4. The Improvement of the Basic Two-Party Computing Protocol

In the basic two-party computing protocol, the server will return the ciphertexts of the intersection elements or the random ciphertexts, and computes the intersection by decrypting the ciphertexts. In our improvement protocol shown in Figure 3, we just need to determine whether

c_{k}

is in

{s_{1}, \dots, s_{ω}}

without computing the ciphertexts of the intersection elements by the server. On the one hand, it can reduce the computational complexity; on the other hand, it will not reveal the size of the server set.

Let

c_{k}, s_{i}

be the set elements’ binary representations of the client and the server respectively. The insufficient bits are filled with 0s and we extend the length to n.

c_{k} = {c_{k, 1}, \dots, c_{k, n}} = {0, 1}^{n}, 1 \leq k \leq v

s_{i} = {s_{i, 1}, \dots, s_{i, n}} = {0, 1}^{n}, 1 \leq i \leq ω

z_{k, i} = ⋁_{j = 1}^{n} (c_{k, j} \oplus s_{i, j})

(9)

If

z_{k, i} = 1

, then

c_{k} \neq s_{i}

; if

z_{k, i} = 0

, then

c_{k} = s_{i}

.

The server can acquire

E_{s k} (z_{k, i})

by

R E_{s k} (c_{k}) \overset{u n p a c k}{⟶} E_{s k} (c_{k, j})

,

R E_{p k} (s_{i}) \overset{u n p a c k}{⟶} E_{s k} (s_{i, j})

and call

(2 n - 1)

bootstrapping operations, denoted by

z_{k, i} = ⋁_{j = 1}^{n} (c_{k, j} \oplus s_{i, j}) \overset{B T_{b k}}{⟶} E_{s k} (z_{k, i})

. The server packs

L W E

ciphers

E_{s k} (z_{k, i})

to

R L W E

ciphers

R E_{s k} (z_{k})

and sends them to the client.

{E_{s k} (z_{k, 1}), E_{s k} (z_{k, 2}), \dots, E_{s k} (z_{k, ω})} \overset{p a c k}{⟶} R E_{s k} (z_{k}),

the client decrypts

R E_{s k} (z_{k})

, if all

z_{k}

is 1, then

c_{k} \notin {c_{1}, \dots, c_{v}} ⋂ {s_{1}, \dots, s_{ω}};

else

c_{k} \in {c_{1}, \dots, c_{v}} ⋂ {s_{1}, \dots, s_{ω}} .

In the protocol, the server cannot obtain any other information besides

R L W E

ciphers and the view can only be simulated by the ciphertexts. Its security is based on IND-CPA security of

R L W E

scheme.

The client acquires

z_{k, i}

by (9), however, the probability of obtaining

s_{i, j}

from

z_{k, i}

and

c_{k, j}

is

2^{- n}

, and it is negligible. The client only receives the output of the intersection; therefore, the view of simulator is just the output of the set intersection.

4.5. Two-Party Computing Protocol Based on a Bloom Filter

In this section, we construct a two-party protocol based on Bloom filter shown in Figure 4, in which the client C encrypts each bit of the Bloom filter with private key and sends it to the server S. The server S homomorphic computes

T e s t (s_{j})

with the bootstrapping key of client C and sends it to the client. C will obtain the intersection of the two sets by decrypting, but the server cannot get any information about the input and output (including the size of the intersection).

Let

c_{k}, s_{i}

be the set elements’ binary representations of the client and the server respectively. The insufficient bits are filled with 0s and we extend the length to n.

c_{k} = {c_{k, 1}, \dots, c_{k, n}} = {0, 1}^{n}, 1 \leq k \leq v .

s_{j} = {s_{j, 1}, \dots, s_{j, n}} = {0, 1}^{n}, 1 \leq j \leq ω .

The client C constructs a Bloom filter

b = c r e a t e (α)

and sends

p k, b k, R E_{s k} (b)

to the server S.

z_{j} = T e s t (s_{j}) = \land_{i = 0}^{β - 1} b_{h_{i} (s_{j})}

(10)

According to (10), input

E_{s k} (b_{1}), \dots, E_{s k} (b_{α})

,

E_{s k} (z_{j}) = \land_{i = 0}^{β - 1} E_{s k} (b_{h_{i} (s_{j})}) .

Call

(β - 1)

bootstrapping operations to obtain

E_{s k} (z_{j})

, denoted by

z_{j} = T e s t (s_{j}) = \land_{i = 0}^{β - 1} b_{h_{i} (s_{j})} \overset{B T_{b k}}{⟶} E_{s k} (z_{j}) .

w_{j} = {w_{j, 1}, \dots, w_{j, n}} = z_{j} s_{j} \oplus (1 - z_{j}) u

(11)

If

z_{j} = 1

, then there

\exists k

such that

c_{k} = s_{j}

, and computing

w_{j} = s_{j}

by (11); similarly, if

z_{j} = 0

, then

\forall k

such that

c_{k} \neq s_{j}

, and computing

w_{j} = u

by (11). For plaintexts

s_{j}

and

u

, each bit can be computed by (11),

w_{j, t} = z_{j} \land s_{j, t} \oplus (1 - z_{j}) \land u_{t}, 1 \leq t \leq n .

The corresponding

L W E

cipher is

E_{s k} (w_{j, t}) = s_{j, t} E_{s k} (z_{j}) + u_{t} (E_{s k} (1) - E_{s k} (z_{j})) .

(12)

The correctness and security of the two-party computing protocol based on Bloom filter is similar to the basic two-party computing protocol. Please refer to Section 4.2 and Section 4.3.

5. Conclusions

We constructed the set intersection two-party computing protocols based on a fully homomorphic encryption scheme. The protocols are simple and only need two rounds of communication, and the security is based on

R L W E

and

L W E

problems in the semi-honest model. The ciphertext extension of the protocols is small so that the protocols have strong practicability. Furthermore, we can extended the set intersection protocol by outsourcing computing under the malicious model. The limitation of our schemes is they are two-party protocols. In future work, we shall extend them to multi-party protocols. The disadvantage of the private set intersection protocols is they are not efficient enough due to bottleneck the bootstrapping operation. On the theoretical side, with the development of fully homomorphic encryption technology, its performance has been greatly improved, but the efficiency of it is still worthy of in-depth study. The bottleneck of the SGFHE scheme is its bootstrapping operation; therefore, its parallelization and hardware implementation will be further studied to improve the overall efficiency of the protocol.

Author Contributions

Conceptualization, Y.C., C.T. and Q.X.; methodology, Y.C. and C.T.; validation Y.C., C.T. and Q.X.; writing-original draft preparation, Y.C., Q.X. and C.T.; writing-review and editing, Y.C. and Q.X. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Foundation of National Natural Science of China under grant 61772147, in part by Guangdong Province Natural Science Foundation of major basic research and Cultivation project under grant 2015A030308016, in part by Project of Ordinary University Innovation Team Construction of Guangdong Province under grant 2015KCXTD014, in part by Basic Research Major Projects of Department of education of Guangdong Province under grant 2014KZDXM044, in part by Collaborative Innovation Major Projects of Bureau of Education of Guangzhou City under grant 1201610005 and in part by the Key-Area Research and Development Plan of Guangdong province under grant 2019B020215004.

Conflicts of Interest

The authors declare no conflict of interest.

References

Rvest, R.L.; Adieman, L.; DERtouzos, M.L. On data banks and privacy homomorphisms. Found. Secur. Comput. 1978, 4, 169–180. [Google Scholar]
Craig, G. Fully Homomorphic Encryption Using Ideal Lattices. In Proceedings of the Annual ACM Symposium on Theory of Computing, Bethesda, MD, USA, 31 May–2 June 2009; pp. 169–178. [Google Scholar] [CrossRef] [Green Version]
Van Dijk, M.; Gentry, C.; Halevi, S.; Vaikuntanathan, V. Fully homomorphic encryp-tion over the integers. IACR Cryptol. Eprint Arch. 2009, 616. [Google Scholar] [CrossRef] [Green Version]
Brakerski, Z.; Vaikuntanathan, V. Efficient fully homomorphic encryption from (standard) LWE. SIAM J. Comput. 2014, 43, 831–871. [Google Scholar] [CrossRef]
Brakerski, Z.; Vaikuntanathan, V. Fully homomorphic encryption from ring-LWE and security for key dependent messages. In Proceedings of the Advances in Cryptology-CRYPTO 2011, Santa Barbara, CA, USA, 14–18 August 2011; Springer: Berlin/Heidelberg, Germany, 2011; pp. 505–524. [Google Scholar] [CrossRef] [Green Version]
Gentry, C.; Sahai, A.; Waters, B. Homomorphic encryption from learning with errors: Conceptually-simpler, asymptotically-faster, attribute-based. In Advances in Cryptology-CRYPTO 2013; Springer: Berlin/Heidelberg, Germany, 2013; pp. 75–92. [Google Scholar] [CrossRef] [Green Version]
Gao, S. Efficient Fully Homomorphic Encryption Scheme. Cryptology ePrint Archive: Report 2018/637. 2018. Available online: https://eprint.iacr.org/2018/637 (accessed on 28 October 2020).
Chillotti, I.; Gama, N.; Georgieva, M. Improving TFHE: Faster Packed Homomorphic Operations and Effcient Circuit Bootstrapping. Cryptology ePrint Archive: Report 2017/ 430. 2017. Available online: https://eprint.iacr.org/2017/430 (accessed on 28 October 2020).
Oded, G.; Micali, S.; Avi, W. How to play ANY mental game. In Proceedings of the 19th Annual ACM Symposium on Theory of Computing, New York, NY, USA, 25–27 May 1987; pp. 218–229. [Google Scholar] [CrossRef]
Ion, M.; Kreuter, B.; Nergiz, E. Private Intersection-Sum Protocol with Applications to Attributing Aggregate Ad Conversions. Cryptology ePrint Archive: Report 2017/738. 2017. Available online: https://eprint.iacr.org/2017/738 (accessed on 28 October 2020).
Ion, M.; Kreuter, B.; Erhan, A. On Deploying Secure Computing Commercially: Private Intersection-Sum Protocols and their Business Applications. Cryptology ePrint Archive: Report 2019/723. 2019. Available online: https://eprint.iacr.org/2019/723. (accessed on 28 October 2020).
Prasetyo, H.; Guo, J.M. A Note on Multiple Secret Sharing Using Chinese Remainder Theorem and Exclusive-OR. IEEE Access 2019, 7, 37473–37497. [Google Scholar] [CrossRef]
Yang, Q.; Peng, G.; Gasti, P. MEG: Memory and Energy Efficient Garbled Circuit Evaluation on Smartphones. IEEE Trans. Inf. Forensics Secur. 2019, 14, 913–922. [Google Scholar] [CrossRef]
Zhang, Z.; Zhang, F.G. Garbled Circuits and Indistinguishability Obfuscation. J. Cryptologic Res. 2019, 6, 541–560. [Google Scholar] [CrossRef]
Qin, H.; Wang, H.; Wei, X. Privacy-Preserving Wildcards Pattern Matching Protocol for IoT Applications. IEEE Access 2019, 1. [Google Scholar] [CrossRef]
Gama, M.; Mateus, P.; Souto, A. A Private Quantum Bit String Commitment. Entropy 2020, 22, 272. [Google Scholar] [CrossRef] [Green Version]
Zhao, C.; Zhao, S.N.; Jia, Z.T. Advances in Practical Secure Two-party Computation and Its Application in Genomic Sequence Comparison. J. Cryptol. Res. 2019, 6, 194–204. [Google Scholar]
Hazay, C.; Lindell, Y. Efficient Protocols for Set Intersection and Pattern Matching with Security against Malicious and Covert Adversaries. J. Cryptol. 2010, 23, 422–456. [Google Scholar] [CrossRef]
Cristofaro, E.D.; Lu, E.; Tsudik, Y.A. Gene Efficient Techniques for Privacy-Preserving Sharing of Sensitive Information. Cryptology ePrint Archive: Report 2011/113. 2011. Available online: https://eprint.iacr.org/2011/113 (accessed on 28 October 2020).
Saracevic, M.; Adamovic, S.; Miskovic, V.; Macek, N.; Sarac, M. A novel approach to steganography based on the properties of Catalan numbers and Dyck words. Future Gener. Comput. Syst. 2019, 100, 186–197. [Google Scholar] [CrossRef]
Saracevic, M.; Adamovic, S.; Bisevac, E. Applications of Catalan numbers and Lattice Path combinatorial problem in cryptography. Acta Polytech. Hung. 2018, 15, 91–110. [Google Scholar]
Coppolino, L.; D’Antonio, S.; Formicola, V.; Mazzeo, G.; Romano, L. VISE: Combining Intel SGX and Homomorphic Encryption for Cloud Industrial Control Systems. IEEE Trans. Comput. 2020, 99. [Google Scholar] [CrossRef]
Lindell, Y.; Pinkas, B. Secure Multiparty Computation for Privacy-Preserving Data Mining. J. Priv. Confidentiality 2009. [Google Scholar] [CrossRef]
Michael, L.; Nejdl, W.; Papapetrou, O.; Siberski, W. Improving Distributed Join Efficiency with Extended Bloom Filter Operations. In Proceedings of the 21st International Conference on Advanced Networking and Applications, Niagara Falls, ON, Canada, 21–23 May 2007. [Google Scholar]
Naor, M.; Pinkas, B. Oblivious Transfer and Polynomial Evaluation. In Proceedings of the Thirty-First Annual ACM Symposium on Theory of Computing, Atlanta, GA, USA, 1–4 May 1999. [Google Scholar]
Freedman, M.J.; Ishai, Y.; Pinkas, B. Keyword Search and Oblivious Pseudorandom Functions. In Second International Conference on Theory of Cryptography; Springer: Berlin/Heidelberg, Germany, 2005. [Google Scholar] [CrossRef] [Green Version]
Chaum, D.; Rivest, R.L.; Sherman, A.T. Blind Signatures for Untraceable Payments. Adv. Cryptol. 1983. [Google Scholar] [CrossRef]
Florian, K. Outsourced private set intersection using homomorphic encryption. ACM Symp. Inf. 2012, 85–86. [Google Scholar] [CrossRef]
Nojima, R.; Kadobayashi, Y. Cryptographically Secure Bloom-Filters. Trans. Data Priv. 2009, 2, 131–139. [Google Scholar]
Shen, L.; Chen, X.; Shi, J.; Hu, L. Survey on Private Preserving Set Intersection Technology. J. Comput. Res. Dev. 2017, 54, 2153–2169. [Google Scholar] [CrossRef]
Regev, O. On lattices, learning with errors, random linear codes, and cryptography. In Proceedings of the 37th Annual ACM Symposium on Theory of Computing, Baltimore, MD, USA, 22–24 May 2005. [Google Scholar] [CrossRef]
Regev, O. On lattices, learning with errors, random linear codes, and cryptography. J. ACM 2009, 34. [Google Scholar] [CrossRef]
Lyubashevsky, V.; Peikert, C.; Regev, O. On ideal lattices and learning with errors over rings. In Cryptology-EUROCRYPT 2010; Springer: Berlin/Heidelberg, Germany, 2010; p. 6110. [Google Scholar]
Bloom, B.H. Space/time trade-offs in hash coding with allowable errors. Commun. ACM 1970, 13, 422–426. [Google Scholar] [CrossRef]
Bellovin, S.; Cheswick, W. Privacy-Enhanced Searches Using Encrypted Bloom Filters. Cryptology ePrint Archive: Report 2004/022. 2004. Available online: https://eprint.iacr.org/2004/022 (accessed on 28 October 2020).
Goh, E. Secure Indexes. Cryptology ePrint Archive: Report 2003/216. 2003. Available online: https://eprint.iacr.org/2003/216 (accessed on 28 October 2020).
Raykova, M.; Vo, B.; Bellovin, S.; Malkin, T. Secure Anonymous Database Search. In Proceedings of the ACM Cloud Computing Security Workshop, Chicago, IL, USA, 13 November 2009; pp. 115–126. [Google Scholar] [CrossRef] [Green Version]
Qiu, L.; Li, Y.; Wu, X. Preserving privacy in association rule mining with bloom filters. J. Intell. Inf. Syst. 2007, 29, 253–278. [Google Scholar] [CrossRef]
Debnath, S.K.; Dutta, R. Secure and Efficient Private Set Intersection Cardinality Using Bloom Filter. Int. Inf. Secur. Conf. 2015, 9290, 209–226. [Google Scholar] [CrossRef]
Changyu, D.; Liqun, C.; Zikai, W. When private set intersection meets big data: An efficient and scalable protocol. In Proceedings of the ACM Conference on Computer and Communications Security, Berlin, Germany, 4–8 November 2013; ACM: New York, NY, USA, 2013; pp. 789–800. [Google Scholar] [CrossRef] [Green Version]
Egert, R.; Fischlin, M.; Gens, D. Privately Computing Set-Union and Set-Intersection Cardinality via Bloom Filters. Eur. J. Oper. Res. 2015, 139, 371–389. [Google Scholar] [CrossRef]
Ducas, L.; Micciancio, D. FHEW: Bootstrapping homomorphic encryption in less than a second. In Proceedings of the Advances in Cryptology-EUROCRYPT 2015, Sofia, Bulgaria, 26–30 April 2015; Springer: Berlin/Heidelberg, Germany, 2015. Part I. Volume 9056, pp. 617–640. [Google Scholar] [CrossRef] [Green Version]
Chillotti, I.; Gama, N.; Georgieva, M.; Izabachéne, M. Faster fully homomorphic encryption: Bootstrapping in less than 0.1 seconds. In Proceedings of the Advances in Cryptology-ASIACRYPT 2016: 22nd International Conference on the Theory and Application of Cryptology and Information Security, Hanoi, Vietnam, 4–8 December 2016; Springer: Berlin/Heidelberg, Germany, 2016. Part I. Volume 10031, pp. 3–33. [Google Scholar] [CrossRef]

Figure 1. Summary of the intersection protocol.

Figure 2. The basic two-party computing protocol.

Figure 3. Improvement.

Figure 4. Protocol based on a Bloom filter.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cai, Y.; Tang, C.; Xu, Q. Two-Party Privacy-Preserving Set Intersection with FHE. Entropy 2020, 22, 1339. https://doi.org/10.3390/e22121339

AMA Style

Cai Y, Tang C, Xu Q. Two-Party Privacy-Preserving Set Intersection with FHE. Entropy. 2020; 22(12):1339. https://doi.org/10.3390/e22121339

Chicago/Turabian Style

Cai, Yunlu, Chunming Tang, and Qiuxia Xu. 2020. "Two-Party Privacy-Preserving Set Intersection with FHE" Entropy 22, no. 12: 1339. https://doi.org/10.3390/e22121339

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Two-Party Privacy-Preserving Set Intersection with FHE

Abstract

1. Introduction

1.1. Related Work

1.2. Contributions

2. Basic Concepts and Techniques

2.1. Notation

2.2. LWE Ciphers and Modulus Reduction

2.3. RLWE Ciphers

2.4. GSW Ciphers and External Product

2.4.1. Gadget Matrix

2.4.2. External Product

2.4.3. GSW Ciphers

2.5. Bloom Filter

3. Homomorphic Operations

3.1. Key Generations

3.2. Bootstrapping Algorithm

3.3. Encryption Scheme

4. Privacy-Preserving Set Intersection

4.1. The Basic Two-Party Computing Protocol

4.2. Correctness of the Basic Two-Party Computing Protocol

4.3. Security Analysis of the Basic Two-Party Computing Protocol

4.4. The Improvement of the Basic Two-Party Computing Protocol

4.5. Two-Party Computing Protocol Based on a Bloom Filter

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI