Risk evaluation in information systems using continuous and discrete distribution laws

: The paper construct continuous and discrete distribution laws, used to assess risks in information systems. Generalized expressions for continuous distribution laws with maximum entropy are obtained. It is shown that, in the general case, the entropy also depends on the type of moments used to determine the numerical characteristics of the distribution law. Also, probabilistic model have been developed to analyze the sequence of independent trials with three outcomes. Expressions for their basic numerical characteristics are obtained, as well as for calculating the probabilities of occurrence of the corresponding events


Introduction
A t the present stage of the development of society, which is characterized by the intensive introduction of information systems in virtually areas of activity, issues related to the assessment of the risks that occur during their operation are of particular importance. When analyzing and assessing risks, issues related to the definition of distribution laws are of the greatest importance. The given work is devoted to the construction of distribution laws.
In the modeling of information systems, risk is a random variable and is described by a probability distribution on a given set [1][2][3]. In contrast to experiments conducted in physics, where the possibility of their multiple conduct, the conditions of the functioning of information systems are characterized by a constant impact of negative external influences and are constantly changing [4], and consequently the repetition of the experiment under the same conditions is practically impracticable. The laws of probability distribution of risk events, as a rule, do not correspond to the law of the normal Gaussian distribution [5,6].

Construction of continuous distribution laws with the maximum entropy
Entropy coefficient is often used [7,8] with the classification of distribution laws of random continuous value (RV) with number characteristics.
In the formula (1), σ = √ µ 2 is standard deviation, and µ 2 is the second central power moment for this distribution law; value H is the entropy, which is defined as: where p(x) is the density of probability distribution (PDD) SV. Entropy coefficient has the maximum value for Gaussian law is δ e = 2.066; for uniform law is δ e = 1.73 and for Koshi distribution is δ e = 0 etc. The entropy value does not depend on shift parameter, to simple computation let's consider, that it is equal to zero. Firstly we need to find distribution law from unilateral laws of distribution of unlimited RV, for which entropy value (2) reaches the maximum with the following limitations imposed on probability density p(x): where β is scale parameter and ν is value of maximum existing primary direct moment. Here and next we'll consider positive power moment as a direct moment in accordance with (3) and negative power moment as a reverse moment. To find the extremum we'll use the method of indefinite Lagrange multipliers [9]. We need to maximize by inserting Lagrange multipliers λ 1 and λ 2 and considering the limitations (3). Equating the result of variation integrand expression in (4) when p(x) = 0, we'll take the equation relatively to p(x) : So, the density p(x) which satisfy (3) and maximizes H can be found from the equation (5): By substituting (6) in (3) and integrating, we have From (7), we find that λ 2 = −1 . Consequently whereĀ(z) is gamma function. From (8), it follows that if only the first beginning direct moment ν = 1 exists than exponential law has the maximum entropy; if there are two moments (ν = 2) then unilateral Gaussian law and if all direct moments exist (ν → ∞) than unilateral uniform law. Indeed, the limiting moment (8) So, if all direct moments exist then uniform law has the maximum entropy from unilateral distribution laws of RV.
Analogically, for bilateral symmetry laws of distribution of RV, it can be shown that if the first ν of absolute central direct moments then the probability density has the maximum entropy: From (9), it follows that if only first absolute central moment exists (ν = 1) then Laplace distribution has the biggest entropy; if there are two moments (ν = 2) then Gaussian law and if all direct moments exist (ν → ∞) then uniform law. Indeed, the limiting case for (9) is a uniform law p(x) = 0.5β − 1, − β < x < β. So, if all direct moments exist then uniform law has the biggest entropy from bilateral symmetry distribution laws of RV. Considered private cases of bilateral laws with the maximum entropy coincide with already known laws (Laplace and Gaussian) which have maximum entropy that confirms the correctness of received results.
From analysis of the received expressions (8) and (9), it follows that for increasing the amount of information about evaluating parameters of distribution laws with big length (with long "tails") with the help of a method of moments is necessary to use direct moments of lesser order, including fractional order. If the parameters of distribution laws with lesser length are used then it is necessary to use direct moment of higher order.
Let's find from unilateral distribution laws of unlimited RV distribution law with which entropy value H reaches maximum with the following limitations imposed on probability density p(x): where ν is value of maximum existing beginning reverse moment. Considering the entropy is defined by an expression: where y −2 p( 1 y ) is a probability density RV η which is reverse to ξ and has the probability density p(x). As a result of using the method of indefinite Lagrange numerators, we'll receive following expression for distribution law with the maximum entropy: The limiting case for (12) with ν → ∞ (all reverse moments exist) is a unilateral distribution law of limitations down from RV p( Let's define the bilateral distribution laws of RV for which entropy value H reaches the maximum with the following limitations imposed on probability density p(x) where ν is the value of maximum existing primary direct exponential moment. Considering the entropy H is defined by the expression; By using the method of indefinite Lagrange numerators we'll receive the following expression for distribution law with the maximum entropy; The limiting case for (15) when ν → ∞ (all direct exponential moments exist) is a distribution law of bordered above RV p( . Now let's find such distribution law from bilateral distribution laws of unlimited RV for which the value of entropy H reaches maximum with the following limitations imposed on probability density p(x): where ν is the value of maximum existing primary reverse exponential moment. Considering an entropy H is defined by the expression: As a result of using the method of indefinite Lagrange numerators, we'll receive the following expression for distribution law with the maximum entropy: The limiting case for (18) when ν → ∞ (all direct exponential moments exist) is a distribution law of bordered above RV p( From the analysis of expressions (15) and (18), it follows that exponential transformation of RV leads to transformation of form parameter ν in scale parameter and β parameter in shift parameter.
Finally let's define such distribution law from unilateral distribution laws of unlimited RV for which the value of entropy H reaches maximum with the following limitations imposed on probability density p(x): where ν is the value of maximum existing primary direct logarithmic moment. Considering an entropy H is defined by the expression: As a result of using the method of indefinite Lagrange numerators, we'll receive the following expression for distribution law with the maximum entropy: From (21), it follows that if only two absolute logarithmic moments exist (ν = 2 ) then logarithmic normal law has the biggest entropy. If ν → ∞ (all absolute primary moments exist) then (21) is transforming in Shannon law for limitations from above and down of RV p(x) = 0.5/βx, exp(−β) < x < exp(β). It is necessary to notice that with logarithmic transformation of RV scale parameter transforms in form parameter and shift parameter transforms in scale parameter.
In general case, if RV η connected with RV η by a ratio y = f (x) and known PDD p(y) of continuous RV ξ , then PDD p(x) can be found by a method of functional transformation with the help of expression: Considering (22) where q(x) = dy dx −1 and Ω is the areas of existence RV η and ξ respectively.

Distributions arising in the analysis of the sequence of independent tests with three outputs
Next, consider the development of a probabilistic model of a sequence of independent trials with three outcomes which becomes particularly important in the formation of estimates of the information security of information processing systems [10].
During the test, it is taken into account that its result is either event A or the opposite event C. The probability of event A in any test is independent of the outcomes of all other tests (the tests are independent) and equal to the probability (this is ensured by the same set of conditions for each test). This scheme of tests was first considered by J. Bernoulli and bears his name [11][12][13][14].
The probability P A (k) of the fact that event A in N tests will come precisely k times (k = 1, 2, . . . , N) is defined by Bernoulli's formula [13][14][15]: which represents binomial distribution. For N = 1, it transforms to Bernoulli's distribution.
The limiting case of binomial distribution when p → 0 and N → ∞ and product N p aims to some positive constant value λ (i.e., N p → ∞) is Poisson's distribution [13][14][15].
If sequence of tests with Bernoulli's scheme continues to appear m "failures" then the number of successes k obeys to negative binomial distribution where Γ(m) is the gamma function. Main purpose of this work is to invent sequence probability model of independent tests with three outputs and with it's help receive formulas analogue to (24), (26) and (27) for defining the probabilities of coming coinciding events.
Let it be produced N of independent tests. Every test can end with three outputs: either event A with the probability p 1 will come, or event B with the probability p 2 will come, or event C with the probability (1 − p 1 − p 2 ) will come. Let's match random discrete value to random output of every test which takes three values: -1 if event A happened; 0 if event C happened and 1 if event B happened. Positive or negative output of every test we'll consider as a "success" and zero output -"failure". In this the probability of coming events A, C and B in every test can be found by an expression where 0 < p 1 < 1, 0 < p 2 < 1, p 1 + p 2 < 1. This distribution of probabilities, analogically to Bernoulli's distribution (25), can be called bilateral Bernoulli's distribution.
Using (28), we'll get Since ongoing tests are independent so characteristic function θ N (j, ϑ) of distribution laws P(k) in N tests will be equal to expression: In this probability distribution P(k) in N tests can be found by the formula: Let's find obvious expression for probability distribution P(k) in N tests by substituting (31) in (32) and integrating (10) can be simplified for five private cases: 1. If p 1 = p 2 = p < 0.5, then Probability distribution (35), just like distribution (24), is a binomial distribution with not-zero shift parameter. 3. Let's view limiting case for distribution (33), when probability of coming value C is aims to zero, i.e., (p 1 + p 2 ) → 1. In this case every test will end in two outputs: either coming of event A with the probability (1p), or event B with the probability p. Those outputs can be matched discrete random value, which takes two values: -1, if event A happened and 1, if event B happened. In this probability distribution (33), the result can be transformed to distribution: 4. Let's view the second limiting case for distribution (33), when probability of coming event A aims to zero, i.e. p 1 → 0. In this case every test will end in two outputs: either coming of event C with a probability (i − p), or event B with a probability p. Those outputs can be matched random discrete value, which takes two values: 0, if event C happened and 1, if event B happened. This probability distribution (33) as a result of limiting transition transforms is the binomial distribution (24) and that's why received probability distribution (33) can be called generalized Bernoulli's formula, or bilateral binomial distribution. 5. Let's view the third limiting case for distribution (33), when p 1 → 0, p 2 → 0, N → ∞, and products N p 1 , N p 2 aim to some positive constant values λ 1 , λ 2 (i.e. N p 1 → λ 1 , N p 2 → λ 2 ). This probability distribution (33) in result of limiting transition transforms is the probability distribution either or where I ν (z) is the modified Bessel's function.
If parameter λ 1 → 0, and parameter λ 2 → λ, then distribution (37) or (38) transforms in Poisson's distribution (26). That's why probability distribution (37) or (38) can be called bilateral Poisson's distribution. Characteristic function for it is; The first, second, third and fourth order for distribution (33) can be found from the expressions To compute these moments, we need asymmetry coefficient K a and excess coefficient K e , which are given as; Expressions (41), (41) and (42) for moments are significantly simplified for private distribution cases (33). So, for distribution (33), we have: In this case For distribution (45), we have: where For distribution (36), we have where For expression (37) or (38), we have where Probability P B (k) of fact, that event B in N tests will come k times can be found from formula (33), or from it's private cases (34), (35), (36), (37) or (38). In this we suppose that P B (k) = P(k), k = 1, 2, . . . , N.
Probability P C of coming event C in N tests can be found using formula (33), or it's private cases (35), (36), (37) or (38). In that we suppose, that P C = P(0). Probability P C matches to probability of fact, that in N cases events A and B won't come.
Let's view the example. Two symmetric coins are being thrown for ten rimes. In every throw three outputs are possible: two "eagles" with probability 0.25; two "tails of coin" with probability 0.25 and "eagle and tail of coin" with probability 0.5. It's necessary to find: 1) probability of fact, that precisely five times two "eagles" drop; 2) probability P tt of fact, that precisely three times two "tails of coin" drop; 3) probability P et of fact, that precisely five times two "eagles" and three "tales of coin" drop. In the match with example's condition we have i.e., p 1 = p 2 , then we use expression (9) as a counting formula. With it's help we find, that either P ee ≈ 0.015, P tt ≈ 0.075, or where F(k) = F 1 (0.5(m + |k|)), 0.5(m + |k| + 1), 1 + |k|, 4p 1 p 2 is the Hypergeometric Gaussian function. Characteristic function of distribution (54) or (55) has the form Primary moment of the first order and central moments of the second, the third and the fourth orders for expressions (54) or (55) are defined by expressions Let's view limiting case for distribution (31) or (32), when probability p 1 → 0, and probability p 2 = p. In the probability distribution (31) or (32) as a result of limiting transaction transforms in negative binomial distribution (4). That's why received probability distribution (31) or (32) can be called bilateral negative binomial distribution.
Choosing from bilateral binomial, Poisson's and negative binomial distributions, we can use following properties of those distributions: Binomial -K a M 2 < 1, Poisson's -K e M 2 = 1, Negative binomial -K e M 2 > 1.
So, there was developed probability model for sequence of independent tests with three outputs, were received expressions for it's general number characteristics, and also for calculating the probabilities of coming matched events precisely k times. It was shown, that limiting cases of received bilateral distributions are binomial, negative binomial and Poisson's distributions.

Conclusion
The following results are obtained in this paper • Generalized expressions for one-way and two-way continuous distribution laws with maximum entropy depending on the number of existing power, exponential or logarithmic moments. With their help, one can more reasonably choose the a priori distribution under the conditions of a priori uncertainty in the analysis of the risks of information systems. From the analysis of expression (23) and its particular cases (2), (11), (14), (17), (20) at the appropriate values q(x) it follows that in the general case the entropy depends also on the type of moments used to determine the numerical characteristics of the distribution law. • Probabilistic model for a sequence of independent trials with three outcomes, which acquire special significance in the formation of information security assessments of information systems. Expressions for its basic numerical characteristics are obtained. It is shown that the limiting cases of the obtained two-way distributions are the binomial, negative binomial and Poisson distributions.