`Journal of ProbabilityVolume 2014 (2014), Article ID 979312, 10 pageshttp://dx.doi.org/10.1155/2014/979312`
Research Article

Marshall-Olkin Discrete Uniform Distribution

1Department of Statistics, Prajyothi Niketan College, Pudukkad, Thrissur-680 301, India
2Department of Statistics, St. Thomas College, Pala, 686574, India

Received 24 November 2013; Revised 11 February 2014; Accepted 20 February 2014; Published 7 April 2014

Copyright © 2014 E. Sandhya and C. B. Prasanth. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

We introduce and characterize a new family of distributions, Marshall-Olkin discrete uniform distribution. The natures of hazard rate, entropy, and distribution of minimum of sequence of i.i.d. random variables are derived. First order autoregressive (AR (1)) model with this distribution for marginals is considered. The maximum likelihood estimates for the parameters are found out. Also, the goodness of the distribution is tested with real data.

1. Introduction

Marshall and Olkin [1] introduced a new method for adding a parameter to a family of distributions with application to the exponential and Weibull families. Jose and Alice [2] discussed Marshall-Olkin family of distributions and their applications in time series modeling and reliability. Jose and Krishna [3] have developed Marshall-Olkin extended uniform distribution. These works and most of the references there in, deal with continuous distribution. Not much work is seen in the discrete case. The reason behind this may be that it is difficult to obtain compact mathematical expressions for moments, reliability, and estimation in the discrete set up.

If is the survival function of a distribution, then, by Marshall-Olkin method, we get another survival function , by adding a new parameter to it. That is, Then the corresponding distribution function is .

Now its probability mass function (p.m.f.) is where is the p.m.f. corresponding to .

The hazard rate of is

We consider a new family of distributions by adding an additional parameter by using the Marshall-Olkin scheme (Marshall and Olkin [1] in Section 2). The nature of hazard rate, entropy and expectation are derived. In the third section, distributions of minimum sequence of i.i.d. random variables are found out. An AR (1) model with new Marshall-Olkin discrete uniform distribution is discussed in the next section. In the fifth section, the maximum likelihood estimate (m.l.e.) for the parameters is found out and, in the last section, the goodness of the distribution is tested with a real life data.

2. Marshall-Olkin Discrete Uniform Distribution

Let be a discrete random variable following uniform distribution with p.m.f. , . We have the distribution function and the survival function . By Marshall-Olkin method, we can form another survival function of Marshall-Olkin discrete uniform (MODU) distribution by substituting in , in (1), and we get

We write for a random variable (r.v.) with given in (4).

2.1. Stability Property of the New Family

If we apply the same method again into the new family, by adding a new parameter “ to the reliability function, we get , which is the reliability function of the new Marshall-Olkin family with parameters .

2.2. Probability Mass Function

From (2), consider the probability mass function p.m.f. where is the p.m.f. corresponding to . That is,

Now we plot the p.m.f. of for different values of and (Figure 1).

Figure 1: P.m.f. of .

Remark 1. If follows , then , , .

Now, our next interest is to check whether (6) characterizes MODU. If satisfies the condition , , , this should be true for also. For , it gives the parent distribution itself before reparameterization. Thus, Remark 1 implies that , , where is the p.m.f. of discrete uniform distribution with parameter “”. That is, . This relation need not characterize a discrete uniform distribution, when . For example, let ; then we get . Since , we get . That is . Let imply , which is not discrete uniform. Thus we have the following.

Theorem 2. A random variable follows , if and only if its p.m.f. satisfies

Proof. Suppose that is true for Marshall-Olkin (MO) scheme, then its parent distribution is discrete uniform on . That is, that is, We know the p.m.f. . That is, by (8); that is, That is, , by (8).
That is, or . We get, By substituting this in (11), That is, or . So , since .
That is, . But, when , it corresponds to discrete uniform distribution on . The converse is straight forward.

Remark 3. It is clear that MODU distribution does not possess additive property.

Remark 4. MODU is not infinitely divisible (i.d.), since its support is on , a finite range. It is not log convex, since the class of log convex distribution forms a subclass of the class of i.d. distributions.

Corollary 5. If , then , if , and , if .

Proof. Let , and . So, implies that . That is , , or . This happens only when . Now assume that . That is, . So ; that is, , which is true only when , since . Hence, the proof is completed.

2.3. Expectation, Standard Deviation, and Entropy of MODU Distribution

We numerically compute the expectation and standard deviation (Tables 1 and 2) and entropy (Table 3) of the MODU distribution with different and , since compact expressions are not available for calculating the same. We have

Table 1: Expectation and standard deviation (sd) with different “” and “ for .
Table 2: Expectation and standard deviation with different “” and “ for .
Table 3: Entropy with different “” and “” for .

Remark 6. If , then the is decreasing with increasing value of when and the increasing with increasing value of when .

From Remark 1 and Tables 1 and 2, we have the following.

Remark 7. The for is equal to for .

From Remark 1, Table 3, and Figure 2, the following is observed.

Figure 2: Entropy with different “” and “” for .

Remark 8. The entropy of is equal to the entropy of .

Let ; then the mean, median, and mode of the distribution (in Figure 1 and Tables 1 and 2) for different and are computed in Table 4.

Table 4: Mean, median, and mode of the MODU distribution with different and .

From Figure 1 and from Table 4, the following is clear.

Remark 9. The MODU distribution is positively skewed, when , since mode < median < mean and the distribution is negatively skewed, when , since mode > median > mean. Also, it is to be noted that the distribution is unimodal; that is, when , the mode = 1 and, when , the mode .
Therefore, the MODU distribution can be applied for the data showing positive skewness when and the data showing negative skewness when .

2.4. Hazard Function

The hazard function of is That is,

Theorem 10. Let ; then the distribution is with Increasing Failure Rate (IFR) when , Decreasing Failure Rate (DFR) when , and constant FR when .

Proof. From (17), the hazard function of is .
Then . If , the distribution is with IFR.
That is, if or if , the distribution is with IFR.
Similarly, if , the distribution is with DFR, and if , the distribution is with a constant FR.
We illustrate these results graphically (Figure 3) and numerically (Table 5).
From Figure 3, also, it is clear that the failure rate is IFR, when , and DFR, when .

Table 5: Hazard function of with .
Figure 3: Hazard function of .

3. MODU Distribution as the Distribution of Minimum of a Sequence of i.i.d. Random Variables

The following theorem gives a characterization of minimum of a sequence of i.i.d. random variables following MODU distribution.

Theorem 11. Let be a sequence of i.i.d. random variables with common survival function . Let be a geometric random variable independent of   such that , ; , and . Let . Then is distributed as MODU distribution if and only if   follows discrete uniform distribution with .

Proof. Proof follows as in the same lines as given in Satheesh et al. [46] for a similar characterization of minimum of sequence of i.i.d. random variables.
The survival function of is   =     =    =    =    =  . Substituting , the survival function of the discrete uniform distribution; that is, which is the survival function of . By retracing the steps we can easily show the converse.

4. AR (1) Model with MODU Distribution as Innovating Distribution

Arnold and Robertson [7], Chernick [8], and Satheesh et al. [9] studied some properties of the AR (1) models. As Satheesh et al. [9] discussed, construct a first order autoregressive minification process with the following structure: where is a sequences of i.i.d. random variables following a discrete uniform distribution with parameter independent of . Then the process is stationary and is marginally distributed with marginal distribution.

Theorem 12. In an AR (1) process with structure (20) is stationary Markovian with marginal if and only if is distributed as uniform with parameter .

Proof. From the structure (20), it follows that Under stationary equilibrium, this implies If we take , then,
, which is the survival function of . We can also show that follows uniform distribution with parameter . By retracing the steps we can easily show the converse.

5. Maximum Likelihood Estimates (m.l.e.) for the Parameters of MODU

Let be independent random samples from . Then, from the likelihood function of the distribution, we can write We calculated the m.l.e. of and numerically (using Mathematica) as the solution of these two nonlinear equations. But, here, the maximum of the range of observation is . So m.l.e. of the parameter is the largest value of the observation. Then, from (23), we can find m.l.e. of .

Algorithm 13. Consider the following.(1)Through simulation, 2500 random samples were generated from inverse function of distribution function for some given value of the parameters and .(2)Then we iterate the m.l.e. for the parameters from the equations (23) and (24) (Table 6).(3)For accuracy, we repeat the same calculation ten times with same values of and .(4)The mean and the standard error (SE) of these estimates are calculated.(5)Then it is calculated with different values of the parameter and the same is found with an increase in sample size 5000 also.(6)The mean estimated values for the parameters are tabulated (Table 7).

Table 6: Initial observations.
Table 7: Mean of m.l.e. of from 10 repetitions. (, sample size ).

It is clear that the SE of m.l.e. of the parameters “” and is decreasing with increase in sample size.

6. An Application of MODU

Example 14. The data was collected from the daily attendance register of the science and commerce (nonlanguage) supplementary class (20 working days in a month) of a higher secondary school in Palakkad district, Kerala. We took a sample of 50 students (out of 360) and let be the number of days these students attended in the class for the whole year 2012-2013 (an academic year is 10 months) (see Tables 8, 9, and 10).

Table 8: Observed frequency of 50 students in 10 months from science and commerce classes.
Table 9: The m.l.e. of (sample size = 50, m.l.e. of , ).
Table 10: Mean m.l.e. of and mean Standard Error (SE) of for the samples of 10 months.

We arbitrarily fix 4 months, and the m.l.e.s are computed. Initially we fit discrete uniform to the data (see Tables 11 and 12).

Table 11: Goodness of fit of uniform distribution with .
Table 12: Mean, median, and mode of the data in May, August, October, and February.

Since mode > median > mean, the data exhibits negative skewness with m.l.e. of and it is observed that the distribution is unimodal; hence, MODU distribution is supposed to give a better fit than uniform distribution (see Table 13).

Table 13: Goodness of fit of MODU distribution with .

Result. From the values, it is seen that MODU distribution is better fit than uniform distribution.

Example 15. The data was collected from the daily attendance register of the supplementary language class (20 working days in a month) from the same higher secondary school in Palakkad district, Kerala. We took a sample of 50 students and let be the number of days these students attended in the same for the whole year 2012-2013 (an academic year is 10 months) (see Tables 14, 15, and 16).

Table 14: Observed frequency of 50 students in 10 months from the language classes.
Table 15: The m.l.e. of and SE of (sample size = 50, m.l.e. of , ).
Table 16: Mean m.l.e. of and mean SE of for the samples of 10 months.

We arbitrarily fix 3 months, and the m.l.e.s are computed. Fit discrete uniform to the above data. We have the results as shown in Tables 17 and 18.

Table 17: Goodness of fit of uniform distribution with .
Table 18: Mean median and mode of the data in May, September, and February.

Since mode < median < mean, the data exhibits positive skewness with m.l.e. of and it is observed that the distribution is unimodal; hence, MODU distribution is supposed to give a better fit than uniform distribution (see Table 19).

Table 19: Goodness of fit of MODU distribution with .

Result. From the values it is seen that, here, MODU distribution is a better fit than uniform distribution.

7. Conclusions

The p.m.f.s of MODU distribution satisfies the relation , , . The failure rate of is IFR, when , DFR, when , and constant, when . Like discrete uniform distribution, the probability function degenerates when a is large. In the case of , the standard deviation is decreasing with the increasing value of , when , and is increasing with increasing value of , when . For and , the is equal in both cases. It is also observed that entropy. The MODU distribution is positively skewed, when , since, here, mode < median < mean and the distribution is negatively skewed, when , since then mode > median > mean. It is to be noted that the distribution is unimodal; that is, when , the mode and, when , mode . An application of MODU distribution is also discussed.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

References

1. A. W. Marshall and I. Olkin, “A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families,” Biometrika, vol. 84, no. 3, pp. 641–652, 1997.
2. K. K. Jose and T. Alice, Marshall-Olkin Family of Distributions-Applications in Time Series Modeling and Reliability, JC Publications, 2005.
3. K. K. Jose and E. Krishna, Marshall-Olkin Extended Uniform Distribution, ProbStat Forum, 2011.
4. S. Satheesh and N. U. Nair, “A note on maximum and minimum stability of certain distributions,” Calcutta Statistical Association Bulletin, vol. 53, pp. 249–252, 2002.
5. S. Satheesh and N. U. Nair, “On the stability of geometric extremes,” International Indian Statistical Association, vol. 42, pp. 99–109, 2004.
6. S. Satheesh, E. Sandhya, and N. U. Nair, “Stability of random sums,” Stochastic Modelling and Applied Probability, vol. 5, pp. 17–26, 2002.
7. B. C. Arnold and C. A. Robertson, “Autoregressive logistic processes,” Journal of Applied Probability, vol. 26, no. 3, pp. 524–531, 1989.
8. M. R. Chernick, “A limit theorem for the maximum of autoregressive processes with uniform marginal distributions,” Annals of Probability, vol. 9, no. 1, pp. 145–149, 1981.
9. S. Satheesh, E. Sandhya, and S. Sherly, “A generalization of stationary AR(1) schemes,” Statistical Methods, vol. 8, no. 2, pp. 213–225, 2006.