BioMed Research International

Volume 2014, Article ID 873436, 9 pages

http://dx.doi.org/10.1155/2014/873436

## A Diverse Stochastic Search Algorithm for Combination Therapeutics

^{1}Department of Physics, Texas Tech University, P.O. Box 41051, Lubbock, TX 79409, USA^{2}Department of Electrical and Computer Engineering, Texas Tech University, P.O. Box 43102, Lubbock, TX 79409, USA

Received 17 November 2013; Revised 20 January 2014; Accepted 6 February 2014; Published 12 March 2014

Academic Editor: Hua Xu

Copyright © 2014 Mehmet Umut Caglar and Ranadip Pal. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

#### Abstract

*Background*. Design of drug combination cocktails to maximize sensitivity for individual patients presents a challenge in terms of minimizing the number of experiments to attain the desired objective. The enormous number of possible drug combinations constrains exhaustive experimentation approaches, and personal variations in genetic diseases restrict the use of prior knowledge in optimization. *Results*. We present a stochastic search algorithm that consisted of a parallel experimentation phase followed by a combination of focused and diversified sequential search. We evaluated our approach on seven synthetic examples; four of them were evaluated twice with different parameters, and two biological examples of bacterial and lung cancer cell inhibition response to combination drugs. The performance of our approach as compared to recently proposed adaptive reference update approach was superior for all the examples considered, achieving an average of 45% reduction in the number of experimental iterations. *Conclusions.* As the results illustrate, the proposed diverse stochastic search algorithm can produce optimized combinations in relatively smaller number of iterative steps. This approach can be combined with available knowledge on the genetic makeup of the patient to design optimal selection of drug cocktails.

#### 1. Introduction

Biological networks are complex and stochastic by nature. They are also robust and incorporate redundancy in their functionality. Thus, from the perspective of intervention, targeting individual proteins or pathways may not be sufficient for achieving a desirable outcome. For instance, solid tumors often fail to respond to monotherapy due to redundant pathways being able to carry on proliferation [1, 2]. Thus, combination therapy is often considered where multiple proteins and pathways are targeted to reduce tumor growth and avoid resistance to therapy [3, 4]. The primary concern with this approach is the enormous increase in the possible candidate concentrations that needs to be experimentally tested. One possible solution can be detailed modeling of the cellular system and design of the combination therapy based on analytical optimization and simulation. However, the kind of detailed model that captures the synergy or antagonism of drugs at different levels can require enormous amount of experimentation to infer the model parameters. Furthermore, this kind of approach may only work for molecularly targeted drugs where the specific drug targets are known, but modeling chemotherapeutic drug synergies can often be difficult. Existing approaches to drug sensitivity prediction based on genomic signatures often suffer from low accuracy [5–7]. Thus for generation of optimal drug cocktails, systematic empirical approaches tested on in-vitro patient tumor cultures are often considered. Some existing approaches include (i) systematic screening of combinations [8–10] which requires numerous test combinations, (ii) medicinal algorithmic combinatorial screen (MACS) based on laboratory drug screen for multiple drug combinations guided by sequential search using a fitness function [11], and (iii) deterministic and stochastic optimized search algorithms [12–15]. The systematic search approach should be focused on locating the global maximum instead of getting stuck in a local maximum. Furthermore, the optimization algorithm needs to be effective in search spaces, without existing prior knowledge, and easily adaptable to higher dimensional systems. Since the knowledge of the drug sensitivity distribution is unknown, the algorithm should be effective over a number of unrelated search spaces. A common problem related to the stochastic search algorithms in the literature is the normalization issue mentioned in [15]. Some of the search algorithms like Gur Game require proper normalization of the search space without having any prior information about it [13]. In order to overcome this problem, proposed algorithms should be adaptive to nonnormalized search spaces.

In this paper, we propose a diversified stochastic search algorithm (termed DSS) that does not require prior normalization of the search space and can find optimum drug concentrations efficiently. At this point, it is important to emphasize that the primary objective of the algorithm is to minimize the number of experimental steps and not the CPU time. The critical problem is the cost of the experiments that are necessary to find the most efficient combination, since the cost of biological experiments is significantly higher than the cost of computation in terms of time and money. As the number of biological experiments necessary to find the global maximum is equal to the number of steps, the objective is to minimize the experimental steps in order to reduce the cost of the overall process. With our proposed algorithm, we significantly decrease the number of steps necessary to find the maximum. The proposed iterative algorithm is based on estimating the sensitivity surface and incorporating the response of previous experimental iterations. By generating an estimate of the sensitivity distribution based on currently available response data, we are able to make larger moves in the search space as compared to smaller steps for gradient based approaches. The proposed algorithm is composed of two parts: (a) the first part consists of generating a rudimentary knowledge of the search space, (b) and in the second part, we utilize the crude knowledge of the sensitivity distribution to run a focused search to locate the exact maximums or run a diverse search to gather further information on the sensitivity distribution.

For comparing the accuracy of our proposed approach, we compare the performance of our algorithm to the recently proposed* adaptive reference update* (ARU) [15] algorithm which has been shown to outperform earlier stochastic search approaches for drug cocktail generation [13, 14]. We consider seven diverse example functions that represent possible drug interaction surfaces and also test our algorithm on two experimentally generated synergistic drug combinations. Even though our algorithm is not constrained to discretized drug levels, we have considered discretized drug concentration levels for our examples to be able to compare our results with previous studies. The results illustrate that the proposed algorithm is more efficient than the ARU algorithm for all the considered drug response surfaces. We also present a theoretical analysis of the proposed search algorithm to explain the algorithm performance.

The paper is organized as follows: the* Results* section contains the detailed performance analysis for the 9 examples, and 4 of them were analyzed for 2 different parameter sets; the* Discussion* section includes the theoretical analysis of the algorithm along with conclusions; and the search algorithm along with the surface estimation algorithm are presented in the* Methods* section.

#### 2. Results

In this section, we present the performance of our proposed algorithm for nine different examples. The numbers of drugs considered in the examples are 2, 3, 4, and 5 with 21, 11, 11, and 11 discretized concentration levels, respectively, resulting in search space sizes of , , , and for the synthetic examples and search grid sizes of and for the experimentally generated examples with two drugs and number of discretization levels of 9 and 10. As mentioned earlier, our results are compared with the latest stochastic search algorithm for drug cocktail generation (termed ARU algorithm) [15] which was shown to outperform earlier approaches [13, 14]. Similar to comparisons in [15], two parameters are primarily considered (a)* Cost*: average number of steps required to reach within 95% of the maximum sensitivity and (b)* Success Rate*: percentage of times that the search algorithm reaches 95% of the maximum sensitivity within a fixed number of steps. The details of the example functions, search parameters, and performance of our approach and ARU approach are shown in supplementary Tables 1–9 (see Supplementary Material available online at http://dx.doi.org/10.1155/2014/873436). Each table contains the problem dimensions, intervals, grid points, algorithmic parameters, and the performance comparison in terms of success percentage and average number of iterations (termed score) for our proposed approach and ARU. Two of the presented examples are based on experimentally generated data. (i) Supplementary Table 8 reports the results for bacterial (*S. aureus*) inhibition response for the drugs Trimethoprim and Sulfamethoxazole that has a synergistic effect as shown in [10]. The data surface is shown in supplementary Table 10. (ii) Supplementary Table 9 considers lung cancer inhibition response using the drugs Pentamidine and Chlorpromazine [16]. Both of these compounds have moderate antiproliferative activities on their own in-vitro in A549 lung carcinoma cells. But neither pentamidine (an antiprotozoal agent) nor chlorpromazine (an antipsychotic agent) is used clinically as a cancer drug. On the other hand, because of the synergy between them, they can prevent the growth of A549 lung carcinoma cells; in addition to that, in proper concentrations, combination is more effective than some of the commonly used cancer drugs. The data surface is shown in supplementary Table 11. The performance of our approach as compared to ARU algorithm for the nine examples is summarized in Table 1. The results indicate that we achieve 100% success rate for all nine examples (thirteen different evaluations), whereas ARU has slightly lower success rate in 2 of these examples. The primary benefit of our approach is the lower average number of iterations to reach within 95% of the maximum sensitivity. For all the examples considered, we require significantly lower average number of iterations to reach within 95% of the maximum. Note that the standard deviation of the number of iterations required to reach within 95% of the maximum is relatively small as compared to the difference in average iterations between ARU and DSS. For instance, the first example in 2 dimensions requires an average of iterations for our proposed approach as compared to iterations for ARU approach. The standard deviation () in 100 runs of DSS algorithm is and thus the difference in the mean runs between DSS and ARU is more than . The ARU algorithm has earlier been shown to outperform other existing algorithms such as Gurgame and its variants. Please refer to Tables 1 and 2 of [15] for the detailed comparison results of ARU with Gurgame. This strongly illustrates that the proposed algorithm is able to generate high sensitivity drug combinations in lower number of average iterations than existing approaches.

To further illustrate the significance of the proposed approach, let us consider one of the experimental example results. The experimental data on lung cancer contains the sensitivity for drug concentration combinations where each drug is assumed to have one of 10 discrete concentrations. This data has been utilized to study the efficacy of the proposed algorithm. For instance, an exhaustive search approach will experimentally test the sensitivity of each of these 100 concentrations and select the one with the highest sensitivity and thus it will require 100 experimental steps. On the other hand, the stochastic search algorithms such as ARU and proposed DSS will start with random drug concentration combinations and try to sequentially select drug concentrations that will provide an improved knowledge of the sensitivity surface over these two drugs. As Table 1 shows, ARU will require an average of 12.4 sequential steps to reach a drug combination that has sensitivity within 95% of the maximum sensitivity, whereas the proposed DSS will require an average of 5.97 sequential steps to reach within 95% of the maximum sensitivity. Thus, DSS will reach within 95% of the maximum sensitivity on an average of 5.97 experimental steps, whereas ARU will require 12.4 experimental steps and exhaustive search will require 100 experimental steps. Note that since ARU and DSS are stochastic approaches, the number of sequential steps required can vary with each experimental run and the numbers 12.4 and 5.97 represent the mean of multiple experimental runs. The experimental data has been used here to provide the sensitivities for specific drug concentrations requested by the algorithms.

For analyzing the behavior of our algorithm during the iteration process, we analyzed the minimal distance of the optimal point(s) from the DSS selected points. Let us consider drugs and to discretization levels for each drug. Figure 1 represents the minimum distance of the points selected for experimentation to any of the optimal point(s) for the synthetic Example 6 (the simulation details are included in supplementary Table 6) with two different parameter sets (the number of initial points is 40 and 10, resp.) for 100 repeated experiments. Note that and for the example and thus the maximum possible distance is . The number of optimal points for this example is . The red vertical line represents the value of which is and , respectively, for two different solutions of this example. The black vertical line represents the average number of iterations required to reach an optimal point for the specific response function. The cyan vertical dotted line represents the worst situation out of 100 different runs. The average minimum distance of the experimental points to the optimal point is shown in green in Figure 2. The solid blue line represents the analytically calculated expected minimum distance. The theoretical analysis of the minimum distance is included in Section 3. Note that there is a change in the shape of the blue curve after the end of Step 1 (iteration 40 for Figure 2(a) and iteration 10 for Figure 2(b)). The dotted blue curve denotes the analytically calculated where and denote the mean and standard deviation for the minimum distance. Figures 1 and 2 illustrate that the minimum distance of the selected points to the optimal points decreases with successive iteration and closely matches the analytical predictions.

#### 3. Discussion

In this section, we provide a generalized analysis of the proposed search algorithm followed by conclusions and future research directions.

##### 3.1. Theoretical Analysis

In this subsection, we will attempt to theoretically analyze the distance of the point with the global sensitivity maximum from the points that are tested by the proposed algorithm. We will consider that each drug is discretized from to levels and that we are considering drugs. Thus, any drug cocktail can be represented by a length vector , where for . Thus, the search space of drug cocktails (denoted by ) is of size and represents points in an -dimensional hypercube of length . Let denote the drug cocktail with the maximum sensitivity among the points. The mapping from the drug cocktail to sensitivity will be denoted by the function ; that is, the maximum sensitivity will be given by . We will assume that if the distance of the test point () from the point with the global maximum () is small, the sensitivity will be close to the global maximum; that is, a small will imply a small . We will primarily analyze the norm of .

Note that . The first points in our algorithm are chosen randomly in the search space and thus we will consider that has a uniform distribution between and . can also be situated in any portion of the search space and thus we will consider to have a uniform distribution between and . Thus, the probability mass function of the random variable will be given by

Subsequently, the PMF of the random variable will be given by

The random variable denoting the norm will be a sum of random variables with PMF given by (2). The distribution for the sum of any two random variables consists of the convolution of the individual distributions of the random variables. Thus, the probability distribution of can be calculated by convolving times. The distribution of for and is shown in Figure 3.

At the beginning of our algorithm, we are selecting points in random. Thus, the nearest neighbor distance from the optimal point will be given by the random variable that denotes the minimum of random variables selected independently based on the distribution of . Thus, the cumulative distribution function (CDF) of is given by [17]

The PMF of given by for and is shown in Figure 4.

For example, the expected minimum distance from the optimal point for , is for . The mean and variance of the minimum distance from the optimal point for different values of , , and are shown in Table 2. Note that if there are optimal points in diverse locations, the mean and variance of the minimum distance of the selected points from any of the optimal points are given by and . This is because when there are optimal points, the minimum distance will consist of the minimum of distances ( distances from each optimal point). If there are multiple optimal points in one hill with small distances between each other, they will be considered as one single optimal point for the minimum distance analysis.

As Table 2 shows, the distance will increase with increasing and and following Step 1 of the algorithm, our point with highest experimental sensitivity may not be close to the optimal point but rather may belong to another hill with a local optima. However, based on the nearest neighbor distances, we would expect to have at least one point close to the optimal point in the top optimal points. Thus, if we keep selecting points for further experimentation from around the top experimental points sequentially, we expect that on an average points will be selected around the optimal point.

Consider that the distance from the optimal point was given by the random variable and if a point is selected randomly between the experimental point and the randomly selected point, the subsequent nearest neighbor distance from the optimal point will be given by the random variable , where is a uniform random variable between and . The distance in each dimension will be reduced by a number selected based on a uniform random variable, and consequently, we will approximate the distance (sum of such distances) to be reduced by a number selected based on a uniform random variable. Thus, after points have been selected sequentially around the optimal point, the distance to the optimal point will be given by the random variable . The probability distribution function of the multiplication of random variables with uniform distribution between is given by [18] Thus, if the expected distance from the optimal point after the initial selection of points is and we select points sequentially between the optimal point and its current nearest neighbor, the expected nearest neighbor distance from the optimal point will be .

As an example, if , , and , we have from Table 2. The expected distance from the optimal point at the end of iterative steps will be assuming a single hill and a probability for the focused search (path of Step 2 of the algorithm). Based on the focused search probability, at the end of iterations, we expect to have points selected around the optimal point.

##### 3.2. Conclusions

In this paper, we proposed a diverse stochastic search (DSS) algorithm that consisted of a parallel and sequential phase that outperformed existing efficient algorithms for drug cocktail generations on nine different examples (thirteen different evaluations). Our results show that the DSS algorithm is more efficient than the previous algorithms in terms of decreasing the number of experiments required to generate the optimum drug combination (i.e., cost of the algorithm) which in turn reduces the total cost of the drug combination search process in terms of time and money. Note that the primary costs in each sequential biological experimental step are in personnel effort to prepare the drug combination and the time involved to generate the combination drug response and thus the goal is to reduce the number of sequential steps. One of the limitations of the current approach is the computational complexity when the number of drugs () is large. The proposed method is suitable for selection of optimal drug concentrations when the number of candidate drugs has been reduced from hundreds to around ten. A number of approaches can be applied to achieve the selection of candidate drugs. For instance, an application of a drug screen to measure cell viability of the tumor culture can be utilized to narrow down the drugs to be included in a combination drug cocktail [19, 20]. Note that algorithms [19, 20] are convenient for selecting the small set of drugs for combination therapy but not for deciding on the optimal drug concentrations of the selected drugs. For possible clinical application, available genetic information can be utilized to narrow down the possible drugs to be tested and the proposed DSS algorithm can be applied to tumor cell cultures to generate the optimal concentrations of the drugs. In this paper, we also presented a theoretical analysis of the search based on minimum distance between the optimal point and the DSS selected points. Future research directions will consider incorporating the cost of drug application in the optimization process and the effect of data extraction noise on the search algorithm. The cost will be a measure of toxicity or side effects of the drug combination. One approach to incorporate the cost can consist of changing the sensitivity surface by negating the cost from the sensitivity. The cost can be simplified to be proportional to the linear addition of the individual drug concentrations. Another approach for incorporating the cost entails restricting the search space by limiting the search to areas that have cost lower than a toxicity threshold.

#### 4. Methods

In this section, we present the search algorithm along with the surface mapping algorithm based on currently available information. Finally, we discuss the reasoning behind the selection of the algorithm related parameters.

##### 4.1. The Search Algorithm

The primary objective of the search algorithm is to locate the global maximum in minimum number of iteration steps. Numerous approaches can be considered for this purpose and our proposed method is based on a combination of stochastic and deterministic approaches. We expect that the efficiency in terms of average number of iteration steps can be increased by large jumps over the search space rather than using traditional step-by-step gradient descent approach. Our algorithm consists of two parts: an initial parallel part and a subsequent iterative segment. The objective of the initial part is to generate a rudimentary idea of the search space. The objective of the iteration part is twofold: (a) it tries to find the exact maximum using the currently available knowledge and (b) it searches the space further to add new knowledge, that is, it attempts to find new hills that the previous iterations could not locate.

Step-by-step schema of the search algorithm is described as follows.

*Step 1. *Generation of Latin Hypercube Numbers (LHNs)(1)In this step, points in the given grid are selected for drug response experiments. We first generate points in the continuous search space based on LHN generation approach with the criterion of maximizing the minimum distance between these points. This approach assists in distributing the points homogeneously in the search space such that the maximum possible distance between a given target point and the nearest point whose coordinates are represented by LHN will be minimum. Consequently, we map these points to the nearest grid points and term these mapped points as approximate Latin hypercube points. We considered this continuous-discrete grid mapping to compare our results with the previous studies that utilized a grid structure for the search space.(2)In this step, experiments are conducted to determine the efficiency of the drug combinations determined by the approximate Latin hypercube points.

*Step 2. *Iterative Segment(1)Normalize the experimental drug efficacy results to numbers between 0 and 1. Then the th power of the normalized drug efficacies are considered where denotes the number of drugs. The power step emphasizes the hills of the distribution and the value is termed as* Power used for the inputs*.(2)Estimate the drug efficacies of the unknown grid points using the sensitivity surface estimation algorithm. The details of the surface estimation algorithm are explained in subsequent sections. At the end of this procedure, we have estimates for the efficacies of each and every point on the search grid. The grid points are classified into two groups: known points from experimental data and estimated points based on interpolation and extrapolation.(3)Decide the objective of the iteration step based on a probabilistic approach. For our case, the algorithm follows* path a* to find the exact maximum based on previous knowledge with a probability and follows* path b* with a probability to explore the search space with a diversified approach.(4)*Path a* (Focused Search)(a)The main idea of the focused search is to experimentally search the estimated maximums generated following the surface estimation mapping. The algorithm also tries to avoid focusing on an individual local maximum by exploring geographically apart multiple estimated local maximums. To achieve this purpose, we employ a tracking algorithm to label the local maximums and avoid prolonged emphasis on individual maximum points. The individual steps of the Focused Search part are described as follows.(b)Sort the grid sensitivities (both experimental and estimated sensitivities) from higher to lower sensitivity values.(c)Check if the location corresponding to the highest sensitivity is an experimental point or an estimated point. If it is an estimated point, generate the experimental sensitivity for this grid location.(d)If the highest sensitivity point is an experimental point, check the second highest point. If the second highest point is an estimated point, generate the experimental value for this grid location.(e)If the second highest point is also an experimental point, generate the gradient from the second highest point based on the mapped surface. If the upward path from the second highest point leads up to the highest point, label both the points as 1, which implies that they belong to the same hill. Otherwise, label the highest point as 1 and label the second highest point as 2, which indicates that they belong to different hills.(f)Repeat this procedure till an estimated point is located. Meanwhile, keep labeling the experimental points with respect to the hill they belong to and the order of the point on the hill (ex: 3rd highest point in hill 2, etc.).(g)If a hill's highest points are experimental points, then label the hill as discovered, which indicates that we have enough information on this hill and collecting information on other hills might be more beneficial.(h)If the search continues till of grid points without finding a suitable candidate for experimentation, halt the search. Locate all the considered points that are inside a sphere of volume of the whole search space with center being the highest sensitivity point. Assign a value of “0” for the sensitivities of all points inside this sphere. (Maintain the record of their actual values in another place). Then go to the beginning of Step 2.(5)*Path b* (Diverse Search)(a)The aim of the diverse search is to explore the space to locate new possible candidate hills that were not discovered in the previous searches.(b)Assume that the surface generated by the experimental and estimated points is a probability distribution function (PDF).(c)Generate points by sampling this distribution. For the generation process, we use the Gibbs sampling algorithm. The number of points generated by the Gibbs algorithm is termed as* Number of points to generate the Gibbs sampling*. Since the points are generated from sampling the PDF, the points are denser around the hills and less dense at locations where the efficacy estimate is close to 0.(d)Randomly select one of the generated points as the candidate point and generate its sensitivity experimentally.

##### 4.2. Sensitivity Surface Estimation Algorithm

The sensitivity surface estimation algorithm used for our approach is established on the dimensional application of the penalized least square regression analysis based on discrete cosine transform (LS-DTC) proposed by Garcia [21, 22]. The code is generated to compute missing values in data sets. The estimation algorithm contains a parameter , termed as* smoothing parameter* that determines the smoothness of the output. For our case, the smoothness parameter is adjusted to a small value so that the result of surface estimation goes through the actual experimental points. The core of the algorithm is based on minimizing the equation , where wRSS corresponds to weighted residual sum of squares and is the penalty function related to the smoothness of the output. wRSS can be written explicitly as , where represents the actual data with missing values and provides the estimate of the data without any missing values. is a diagonal matrix, whose entries represent the reliability of the points and can take values between 0 and 1. For our case, the missing values represent unknown points and are assigned a value of 0 and the experimental points are reliable points which are assigned a value of 1. The solution to that minimizes can be generated based on an iterative process starting from an arbitrary initial point .

##### 4.3. Choice of Parameters

The implementation of the proposed algorithm includes several parameters that can affect the performance of the search process. In this subsection, we present the guiding principles behind the selection of the parameters based on the dimensionality and the total number of grid points in the search space.

###### 4.3.1. Latin Hypercube Numbers (LHNs)

Denote the number of points that will be tested in Step 1 of the algorithm. These points are supposed to provide an initial estimate of the search space. Based on simulations and theoretical analysis, we observe that increasing the number of LHNs provides limited benefit in terms of reaching the maximum sensitivity combination after a certain point. On the other hand, keeping this number too low will cause the program to start the second step with limited knowledge and to search low sensitivity locations. Thus, there is an optimum number of Latin hypercube numbers to maximize the benefit of the algorithm. Although this optimum number depends on the search space; our simulations for 4 surfaces with two different LHNs (10 and 40) illustrate that the proposed algorithm provides better results than ARU algorithm for a fairly large interval of LHNs.

###### 4.3.2. Latin Hypercube Iterations

The Latin hypercube numbers are distributed homogeneously through an iterative algorithm. The iterations maximize the minimum distance between the points. It is desirable to have a higher number of iterations, but after a point, the benefits of increasing the iterations become negligible. For our simulations, we selected a threshold point following which the increase in the maximum minimum distance is negligible.

###### 4.3.3. Number of Iterations to Generate the Sensitivity Surface Estimate

This parameter is related to sensitivity surface estimation algorithm and describes the number of iterations used to find a smooth surface passing through the given points in high dimensional space. A higher value for this parameter will provide a smoothed surface (that still passes through the experimental points) but will carry a high computational time cost. Furthermore, the benefits of increasing the iterations become negligible after a threshold and the output surface becomes more stable. For our examples, we have fixed this number to 100.

###### 4.3.4. Probability of Focused Search

Denote the probability that the search algorithm follows . attempts to discover the exact local maximum of a hill, and attempts to learn new hills. For all our simulations, this parameter has been assigned a value of 0.3.

###### 4.3.5. Power Used by Inputs

This parameter attempts to emphasize the hills. After normalizing the experimental points, we take the th power of the values so that the high peaks are emphasized as compared to dips or grid points with average values in the estimated surface. Thus, the probability of point selection around hills is increased during the Gibbs sampling process.

###### 4.3.6. Number of Points Generated by the Gibbs Sampling

This parameter describes the number of points generated by the Gibbs sampling in of Step 2 of the algorithm. More points provide a better representation of the estimated surface. After a level, the number of points is sufficient to represent the probability distribution and the benefits of increasing the iterations become negligible. We achieve better sampling by increasing this parameter. This parameter is required to be large for problems in higher dimensions or problems containing a huge number of grid points. For our examples, if the number of grid points is below 7500, this parameter has been assigned a value equal to twice the number of grid points. Otherwise, we have fixed the number of the Gibbs sampling points to 15,000.

###### 4.3.7. Clustering Related Parameters

The clustering concept is introduced to avoid the search being stuck in one dominant hill.

###### 4.3.8. Cluster Threshold

This denotes the maximum number of experimental points in an individual hill. Further exploration of the hill is paused once this value is reached. For our examples, if the number of drugs (dimensions) is less than 5, is assigned a value of . Otherwise, the parameter is fixed at 7.

###### 4.3.9. Cluster Break

This parameter denotes the maximum number of high efficacy point estimates in a single hill. If this condition is reached, we assign a value of 0 sensitivity for points around the known top of the hill. This parameter is considered to be around of the total grid points.

###### 4.3.10. Cluster Distance

This parameter represents the radius of the sphere around the hill top for which any grid point within the sphere is assigned a value of 0. The Cluster Distance is selected such that the volume of the sphere is of the total volume. The parameter considers that the algorithm has no knowledge of the hills that are narrower than the 0.2% of the total search space.

#### Conflict of Interests

The authors declare that they have no competing interests.

#### Authors’ Contribution

Mehmet Umut Caglar and Ranadip Pal conceived and designed the algorithms, Mehmet Umut Caglar performed the simulation experiments, Mehmet Umut Caglar and Ranadip Pal analyzed the results, and Mehmet Umut Caglar and Ranadip Pal wrote the paper. All authors read and approved the final paper.

#### Acknowledgment

This work was supported in part by NSF Grant CCF 0953366.

#### References

- C. L. Corless, A. Schroeder, D. Griffith et al., “PDGFRA mutations in gastrointestinal stromal tumors: frequency, spectrum and in vitro sensitivity to imatinib,”
*Journal of Clinical Oncology*, vol. 23, no. 23, pp. 5357–5364, 2005. View at Publisher · View at Google Scholar · View at Scopus - J. A. Engelman, K. Zejnullahu, T. Mitsudomi et al., “MET amplification leads to gefitinib resistance in lung cancer by activating ERBB3 signaling,”
*Science*, vol. 316, no. 5827, pp. 1039–1043, 2007. View at Publisher · View at Google Scholar · View at Scopus - A. Dubrovska, J. Elliott, R. J. Salamone et al., “Combination therapy targeting both tumor-initiating and differentiated cell populations in prostate carcinoma,”
*Clinical Cancer Research*, vol. 16, no. 23, pp. 5692–5702, 2010. View at Publisher · View at Google Scholar · View at Scopus - B. Al-Lazikani, U. Banerji, and P. Workman, “Combinatorial drug therapy for cancer in the post-genomic era,”
*Nature Biotechnology*, vol. 30, no. 7, pp. 679–692, 2012. View at Publisher · View at Google Scholar - J. Barretina, G. Caponigro, N. Stransky et al., “The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity,”
*Nature*, vol. 483, pp. 603–607, 2012. View at Publisher · View at Google Scholar - M. J. Garnett, E. J. Edelman, S. J. Heidorn et al., “Systematic identification of genomic markers of drug sensitivity in cancer cells,”
*Nature*, vol. 483, pp. 570–575, 2012. View at Publisher · View at Google Scholar - M. L. Sos, K. Michel, T. Zander et al., “Predicting drug susceptibility of non-small cell lung cancers based on genetic lesions,”
*Journal of Clinical Investigation*, vol. 119, no. 6, pp. 1727–1740, 2009. View at Publisher · View at Google Scholar · View at Scopus - A. A. Borisy, P. J. Elliott, N. W. Hurst et al., “Systematic discovery of multicomponent therapeutics,”
*Proceedings of the National Academy of Sciences of the United States of America*, vol. 100, no. 13, pp. 7977–7982, 2003. View at Publisher · View at Google Scholar · View at Scopus - M. Wadman, “The right combination,”
*Nature*, vol. 439, pp. 390–391, 2006. View at Publisher · View at Google Scholar · View at Scopus - G. R. Zimmermann, J. Lehár, and C. T. Keith, “Multi-target therapeutics: when the whole is greater than the sum of the parts,”
*Drug Discovery Today*, vol. 12, no. 1-2, pp. 34–42, 2007. View at Publisher · View at Google Scholar · View at Scopus - R. G. Zinner, B. L. Barrett, E. Popova et al., “Algorithmic guided screening of drug combinations of arbitrary size for activity against cancer cells,”
*Molecular Cancer Therapeutics*, vol. 8, no. 3, pp. 521–532, 2009. View at Publisher · View at Google Scholar · View at Scopus - D. Calzolari, S. Bruschi, L. Coquin et al., “Search algorithms as a framework for the optimization of drug combinations,”
*PLoS Computational Biology*, vol. 4, no. 12, Article ID e1000249, 2008. View at Publisher · View at Google Scholar · View at Scopus - K. W. Pak, F. Yu, A. Shahangian, G. Cheng, R. Sun, and C.-M. Ho, “Closed-loop control of cellular functions using combinatory drugs guided by a stochastic search algorithm,”
*Proceedings of the National Academy of Sciences of the United States of America*, vol. 105, no. 13, pp. 5105–5110, 2008. View at Publisher · View at Google Scholar · View at Scopus - B.-J. Yoon, “Enhanced stochastic optimization algorithm for finding effective multi-target therapeutics,”
*BMC Bioinformatics*, vol. 12, supplement 1, article S18, 2011. View at Publisher · View at Google Scholar · View at Scopus - M. Kim and B. J. Yoon, “Adaptive reference update (ARU) algorithm. A stochastic search algorithm for efficient optimization of multi-drug cocktails,”
*BMC Genomics*, vol. 13, supplement 6, article S12, 2012. View at Google Scholar - A. A. Borisy, P. J. Elliott, N. W. Hurst et al., “Systematic discovery of multicomponent therapeutics,”
*Proceedings of the National Academy of Sciences of the United States of America*, vol. 100, no. 13, pp. 7977–7982, 2003. View at Publisher · View at Google Scholar · View at Scopus - D. P. Bertsekas and J. N. Tsitsikilis,
*Introduction to Probability*, Athena Scientific, 2nd edition, 2008. - C. P. Dettmann and O. Georgiou, “Product of n independent uniform random variables,”
*Statistics and Probability Letters*, vol. 79, no. 24, pp. 2501–2503, 2009. View at Publisher · View at Google Scholar · View at Scopus - N. Berlow, L. E. Davis, E. L. Cantor, B. Seguin, C. Keller, and R. Pal, “A new approach for prediction of tumor sensitivity to targeted drugs based on functional data,”
*BMC Bioinformatics*, vol. 14, article 239, 2013. View at Publisher · View at Google Scholar - R. Pal and N. Berlow, “A kinase inhibition map approach for tumor sensitivity prediction and combination therapy design for targeted drugs,” in
*Proceedings of the Pacific Symposium on Biocomputing*, vol. 17, pp. 351–362, World Scientific, 2012. - D. Garcia, “Robust smoothing of gridded data in one and higher dimensions with missing values,”
*Computational Statistics and Data Analysis*, vol. 54, no. 4, pp. 1167–1178, 2010. View at Publisher · View at Google Scholar · View at Scopus - G. Wang, D. Garcia, Y. Liu, R. de Jeu, and A. Johannes Dolman, “A three-dimensional gap filling method for large geophysical datasets: application to global satellite soil moisture observations,”
*Environmental Modelling & Software*, vol. 30, pp. 139–142, 2012. View at Publisher · View at Google Scholar · View at Scopus