Abstract

In this paper, by the branching process and the martingale method, we prove that the size of the largest component in the critical random intersection graph is asymptotically of order and the width of scaling window is .

1. Introduction and Main Result

Random intersection graph , which was introduced by Singer-Cohen [1] and Karoński et al. [2], is defined as follows. Let V be a set of n vertices and M a set of m elements. For each vertex , it is assigned a random subset of , where each element of M is included in randomly and independently with probability . There is an edge between vertices u and if and only if . Denote the largest component in by and its size by . In , Behrisch proved that [3] when and , there is a phase transition of in the vicinity of the point (here, we point out that when is not an integer, means with being the largest integer less than ). When with high probability (for a given graph property , we say that graph possesses with high probability if the probability that possesses tends to 1 as ) (w.h.p. for brevity), jumps from logarithmic order to linear order in ; Lagerås and Lindholm [4] also got that in , with and , exhibits a jump from logarithmic order to linear order in n around the point that . Recently, Wang et al. [5] studied the component evolution in critical and got that when , w.h.p. The order of the largest component in some critical random intersection graphs is and the width of scaling window around the critical probability is ; while when w.h.p. The order of the largest component and the width of the scaling window are and , respectively. It is natural to ask what is the order of and what is the width of scaling window in critical , which is proposed in [5]. The aim of the present paper is to study the critical . Now our main result, which is not surprising but still interesting, can be stated as follows.

Theorem 1. Let be a positive function on n such that and as . In with , the following statements hold.(1)Weakly subcritical case: if , then there are two positive constants and such that w.h.p.(2)Critical case: if for some constant , then there are a positive function , which tends to infinity as , and a constant such that w.h.p.(3)Weakly supercritical case: if , then there are two positive constants and such that w.h.p.

The following notation will be used. Write , , and for the probability, expected value, and variance of a random event and a random variable, respectively. For any two positive functions and of a natural-valued parameter , denote if there is a positive constant C such that when n is large enough, if and and if . Furthermore, all logarithms in this article have the natural base, all inequalities or asymptotic statements have to be understood in terms of n being large enough yet fixed. And for clarity of presentation, floor and ceiling signs are omitted whenever they are not essential.

2. Auxiliary Lemmas

To get the bound, we need some properties of random variable with compound binomial distribution. Let X be a compound binomial distribution with generating function:

Then, we have the following lemma.

Lemma 1. For any natural numbers , let be random variables with generating function , . Suppose , then

Here, means is stochastically smaller than . It holds if and only if there exists a coupling of and such that .

Proof. Note can be expressed as , where s has the binomial distribution and N has the binomial distribution which is independent of s, . Then, it is obvious by the coupling method.

Besides the above lemma, we still need the following existing theorems.

Theorem 2 ([6], Theorem 1). Let , , , and be an increasing graph property. If , thenwhere denotes that G has property , and is the Erdös–Rényi random graph which is obtained by retaining each edge of the complete graph on n vertices independently with probability .

By Theorem 2, to give a lower bound for , the following theorem on the size of the largest component in the critical Erdös–Rényi random graph is useful.

Theorem 3 ([7], Theorem 5.23, [8, 9]). Let denote the size of the largest component in . For any which tends to infinity as , the following statements hold.(a)In , if , then w.h.p. (b)In , if , then w.h.p. (c)In , if , then w.h.p.

3. Proof of Theorem 1

Note that given a function on the size of largest component at the least is an increasing property in with respect to . And when , and , as .when and for some constant ,

Therefore, keeping Theorems 2-3 in mind, we can determine the lower bounds directly for the weakly subcritical, critical, and weakly supercritical cases. So we only need to prove the upper bounds in the rest of this section. We mainly combine the martingale argument [8] and branching process method [10] to obtain the upper bound. We will make use of the following component exploration process on random bipartite graph (see the footnote (note that usually is constructed through the bipartite graph which is a graph with bipartition , the vertices set , and the elements set ; any vertex in V and any element in U is connected by an edge in independently with probability ; an edge between and in is present if both and are adjacent to some element in ) for its definition), not on , to get the size of components.

Exploration process . Let be the component containing vertex in . In this procedure, we explore the vertices of V on the . During the exploration, vertices will be active, inactive, or neutral, and the elements of U are neutral all the time. At the beginning, assuming all the vertices are neutral, we choose a vertex uniformly and make it active. At each time , we choose a vertex uniformly from the active vertices and check the vertices such that the distance between and is two in , that is to check all pairs where u runs over all the neutral elements in . If is an edge in , then check where runs over all the neutral vertices. If there is an element u such that and are present in then make active; otherwise, keep it neutral. After checking all the neutral vertices in , let be inactive. When there is no active vertex, the component , which is the set of inactive vertices, is explored. Then, we choose a neutral vertex uniformly from the rest of neutral ones and proceed on.

Let be the number of vertices which become active due to the exploration of active vertex , which is the number of vertices at distance 2 from in , and denote the total number of active vertices at step , where . It is easy to see that for any ,

Define T to be the least t for which , i.e.,

Then, at the time , the set of explored vertices is precisely , which means .

3.1. Weakly Subcritical Case

We only need to give an upper bound on T as . For this, we first define a random walk and a stopping time which is stochastically larger than , then bound the stopping time to determine the desired result. To this end, we need the following lemma.

Lemma 2 ([11], Lemma 8). Let η be an integer random variable with such that, for any integer , we have . Let be a sequence of i.i.d. Random variables distributed as η andwhere is an integer constant. Define t to be its hitting time of , i.e.,

If satisfiesthen for any integer ,where and constants in depend only on n and but not on

Proof of the upper bound. To make use of Lemma 2, let be a sequence of i.i.d. Random variables distributed as . SetTake . Note and set for . For the simplicity of notation, write in the rest of the paper. By these notations, when and , where and as ,DefineNext, we will prove that there is satisfyingIn fact, let and . Note that for any natural number i and , . Then, when n is large enough we can obtainHere, the last inequality holds as, noting ,Also, we can get thatThird, it is easy to check that is continuous in θ when . Hence, the intermediate value theorem implies the claim holds. So far, by (16) and (17) and the inequalities , for any ; when n is large enough, we can deduce thatNow, we are in the position of exploring components of . Suppose that k vertices have been explored at step i in the exploration process. We are in the position of exploration through a vertex, say . Since the number of neighbors of the initially picked vertex has distribution and the fact that vertices only can be newly identified one time, the number of newly explored vertices through has the distribution , by Lemma 1, , where distributed as . That is to say, is dominated above by for all , where T is defined to be in the exploration process. Let denote total inactive (explored) vertices which are explored from time 0 to t by the exploration process on starting from vertex which is chosen uniformly from vertex set Define . Notice thatHence, there is a coupling of the processes and such thatwhich means τ is stochastically larger than So by Lemma 2, we getFor a positive constant , set . Then, by the inequality , we obtain thatDenote . Then,

3.2. Inside the Critical Window

As the size of largest component at the least for a given function on n is an increasing property in with respect to , to give an upper bound, we assume that . Recall which is a sequence of the i.i.d. Random variables distributed as , and

For any real number , define

Let . Then, it is easy to check that is a martingale with . By the optimal stopping theorem, we have . Let . Then, by the equality for any positive , we can obtain

Therefore,which means . So, for any constant , we have

As in the proof of (27), we can determine that

Recall . Hence,

3.3. Weakly Supercritical Case

In this subsection, we appeal to the branching process method [10] to get the upper bounds in the weakly supercritical regime of . This method is also used by Kang et al. [12] to study other random graphs.

Theorem 4([13], Theorem 3.7). Consider the Galton–Watson branching process with supercritical offspring distribution . Let X be the random variable with the distribution of the number of offsprings. Define

Assume ; let be the unique positive solution to which means the supercritical branching process dies out with probability , and let

Then and

That is, is an offspring distribution of a subcritical branching process which is often called the dual process of Furthermore, if denotes the branching tree for and is the corresponding object for , then for any finite tree ,

Now we can obtain the following Lemma 3 about the supercritical branching process with offspring distribution and its dual process.

Lemma 3. Let denote the branching process with offspring distribution , be its total size, and be its surviving probability, where , , and . Denote its dual process by and the total size of by . Then, the following holds.(1)The surviving probability is(2)There is a positive constant such that(3)For any integer ,

Proof. (1) It is well known that for the Galton–Watson branching process with offspring distribution , it survives with the probability which is the unique solution in to , where is the corresponding probability generating function; (see [14], Theorem 1.6.1). Therefore, we only need to check that for the branching process with distribution ,whereIn fact, is continuous for , and by the inequality that for small enough ,we can obtain thatHence, the intermediate value theorem implies the claim holds.(2)To estimate , we notice the following equalities or inequalities. First, (4) impliesSecondly, by Theorem 4,And, we haveNow, we are in position of estimating . Let be the number of children for each vertex in the dual processes . Then,Hence, there is a positive constant such that the expectation of the total number of the vertices in the dual branching process is(3)Denote for the event that survives and for the event that the process dies out. Then,

To prove the desired result, we still need the following auxiliary lemma. Note that any branching process can be understood as a potentially infinite random rooted tree. And the width of a rooted tree is the maximum of the sizes of the generations.

Lemma 4. Let denote the branching process with offspring distribution , where m, n, and p, satisfy the conditions in (3) of Theorem 1. Denote for the event that BP survives and D=SC for the event that the process dies out. Then for any function M such that , we have

Proof. Define and let us construct generation by generation and stop as soon as we see the first generation of size at the least M if there is one. Then, the conditional probability which the process dies out given that holds is at mostHence, bywe have thatwhich means

Given a vertex of , let denote the component of , containing and be the random spanning tree rooted at which is obtained as following. First, explore new neighbors starting at via the breadth-first search. In graph , the breadth-first search for exploring vertices is as follows: starting with , first we find all the neighbors, ; next, we find all the neighbors of , then the new neighbors of different from the vertices explored so far, and so on, ending the second stage with the new neighbor . The third stage consists of finding all the neighbors of the vertices found in the second stage, and so on on ; then, contract the edges between the elements whose distance are from and the vertices whose distances are from , , i.e., we delete all the edges and is connected to all the vertices whose distances are two from in the graph resulting from the exploring process, say , and is connected to all the vertices whose distances are two from ; see Figure 1 for this operation. Let be the branching process with the distribution , which can be interpreted as a potentially infinite random rooted tree. So, it is obvious that the following lemma holds.

Lemma 5. For any the random rooted trees and can be coupled so that .

Let be the set of all vertices of , in components that contain at least L vertices, and be the cardinality of this set. We can get Lemma 6.

Lemma 6. Let be a real function satisfying and . Then,

Proof. In fact, by Lemma 3,

In the next lemma, we shall prove that the number of vertices in large components is concentrated around its expectation.

Lemma 7. Suppose satisfies and . Then, w.h.p.

Proof. By Chebyshev’s inequality, to get the result, we should showwhich means it suffices to proveFix a vertex of . We explore the component by breadth-first search as before which generates a tree But, we stop the exploration process immediately if one of the following events occurs:(i)We have reached L vertices in total(ii)There are vertices which have been reached (found as a new neighbor of an earlier vertex) but not yet explored (tested for new neighbors)Here, for any , [a] is the smallest integer no less than a. We call a vertex reached but not fully explored a boundary vertex. Note that there are boundary vertices. DefineThen, we can get that . Also, as we proceed in breadth-first manner, all the boundary vertices are contained in two consecutive generations. Therefore, if holds, then either or the total number of offspring of branching process is finite and Hence, by Lemmas 4 and 5,As implies holds, we getIn the reminder of the proof, we shall prove thatNow given a , we explore its component as before in breadth-first search but within . Also, we couple the resulting tree with so that . Let be the component of u in , so is spanned by . Set be the event that is infinite, and . Notice that , and they are equal unless there is an edge connecting a boundary vertex to a vertex of . As there are boundary vertices,Here, the first inequality holds due to that for any finitely many events and is the probability that there is no common element for any two vertices and in . One should notice that the event depends only on the (edge) indicator random variables for edges with at least one endpoint in ; on the other hand, the tree is constructed in This means that any event determined solely by is by construction independent of the event . Furthermore, is independent of . Therefore,Then, it follows that noting that is independent of ,Thus, as there are at most vertices in ,which gives that

Proof of (3) in Theorem 1. Define which tends to as . And let , which means and . Then (3) in Theorem 1 follows by Lemmas 5–7.

Data Availability

Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

The authors would like to thank Dr. Christoph Koch for the helpful discussion on Lemma 7. This work was supported partially by the NSFC (nos. 11961015, 11671216 and 11401127), the GXNSF (no. 2018GXNSFAA050031), and the startup grant from the Guilin University of Technology.