Abstract

Various aspects of the interaction of HIV with the human immune system can be modeled by a system of ordinary differential equations. This model is utilized, and a multiobjective optimal control problem (MOOCP) is proposed to maximize the CD4+ T cells population and minimize both the viral load and drug costs. The weighted sum method is used, and continuous Pareto optimal solutions are derived by solving the corresponding optimality system. Moreover, a model predictive control (MPC) strategy is applied, with the final goal of implementing Pareto optimal structured treatment interruptions (STI) protocol. In particular, by using a fuzzy approach, the MOOCP is converted to a single-objective optimization problem to derive a Pareto optimal solution which among other Pareto optimal solutions has the best satisfaction performance. Then, by using an embedding method, the problem is transferred into a modified problem in an appropriate space in which the existence of solution is guaranteed by compactness of the space. The metamorphosed problem is approximated by a linear programming (LP) model, and a piecewise constant solution which shows the desired combinations of reverse transcriptase inhibitor (RTI) and protease inhibitor (PI) drug efficacies is achieved.

1. Introduction

Human Immunodeficiency Virus (HIV) infects CD4+ T-cells, which are an important part of the human immune system, and other target cells. The infected cells produce a large number of viruses. Medical treatments for HIV have greatly improved during the last two decades. Highly active antiretroviral therapy (HAART) allows for the effective suppression of HIV-infected individuals and prolongs the time before the onset of Acquired Immune Deficiency Syndrome (AIDS) for years or even decades and increases life expectancy and quality for the patient but eradication of HIV infection does not seem possible with currently available antiretroviral drugs. This is due primarily to the establishment of a pool of latently infected T-cells and sites within the body where drugs may not achieve effective levels [13]. HAART contains two major types of anti-HIV drugs, reverse transcriptase inhibitors (RTI) and protease inhibitors (PI). Reverse transcriptase inhibitors prevent HIV from infecting cells by blocking the integration of the HIV viral code into the host cell genome. Protease inhibitors prevent infected cells from replication of infectious virus particles and can reduce and maintain viral load below the limit of detection in many patients. Moreover, treatment with either type of drug can also increase the CD4+ T-cell count that are target cells for HIV.

Many of the host-pathogen interaction mechanisms during HIV infection and progression to AIDS are still unknown. Mathematical modeling of HIV infection is of interest to the medical community as no adequate animal models exist to test efficacy of drug regimes. These models can test different assumptions and provide new insights into questions that are difficult to answer by clinical or experimental studies. A number of mathematical models have been formulated to describe various aspects of the interaction of HIV with healthy cells [4]. The basic model of HIV infection is presented by Perelson et al. [5] that contains three state variables: healthy CD4+ T-cells, infected CD4+ T-cells, and concentration of free virus. Another model is presented in [6] that although maintaining a simple structure, it offers important theoretical insights into immune control of the virus based on treatment strategies. Furthermore, this modified model is developed to describe the natural evolution of HIV infection, as qualitatively described in several clinical studies [7].

The problem of designing drug administration in HIV infected patients using mathematical models can be considered as multi-objective optimal control problems. For example, these objectives may include maximizing the level of healthy CD4+ T-cells and minimizing the cost of treatment [815], maximizing immune response and minimizing both the cost of treatment and viral load [1621], maximizing both the level of healthy CD4+ T-cells and immune response and minimizing the cost of treatment [22], maximizing the level of healthy CD4+ T-cells while minimizing both the side effects and drug resistance [23], and maximizing survival time of patient subject to drug cost [24].

When a multi-objective problem is treated, each objective conflicts with one another and, unlike a single-objective optimization, the solution to this problem is not a single point, but a family of solutions known as the Pareto-optimal set. Among these solutions, the designer should find the best compromise taking into proper account the attributes and characteristics of the handled problem.

There exists a wide variety of methods that can be used to compute Pareto optimal points. A widely used technique consists of reducing the multi-objective problem to a single-objective one by means of “scalarization” procedure. The weighted sum (WS) method is a commonly used scalarization technique which consists of assigning each objective function a weight coefficient and then optimizing the function obtained by summing up all the objective functions scaled by their weight coefficients. Nevertheless, it has intrinsic drawbacks. The WS method is highly scale dependent, an equal distribution of weights does not necessarily lead to an even spread along the Pareto front, and that points in a nonconvex part of the Pareto front cannot be obtained [25].

Recent scalar multiple objective optimization techniques such as Normal Boundary Intersection (NBI) [26] and Normalized Normal Constraint (NNC) [27] have been found to mitigate the disadvantages of the WS method. Recently, these methods have been successfully combined with direct optimal control approaches for the efficient solution of multi-objective optimal control problems. For example, in [28], a successful application of NBI and NNC for the multiple objective optimal control of (bio) chemical processes has been reported, and in [29] several scalarization techniques for multi-objective optimization, for example, WS, NNC, and NBI have been integrated with fast deterministic direct optimal control approaches.

The papers [8, 13, 22, 24, 30, 31] consider only RTI medication while the papers [14, 32] consider only PIs. In [6, 1921], all effects of a HAART medication are combined to one control variable in the model. In [7, 1012, 15, 17, 18, 3335], dynamical multidrug therapies based on RTIs and PIs are designed.

In the considered control approaches, the amount of medications can be either continuous or on-off type. This treatment is also known as structured treatment interruption (STI). STI has received considerable attentions as it might reduce the risk of HIV mutating to strains which are resistant to current medication regimens. STI approach might also reduce possible long-term toxicity of the drugs. A concise summary of clinical STI studies, including protocols and results, is presented in [35].

In this paper, we consider a mathematical model of HIV dynamics that includes the effect of antiretroviral therapy, and we perform analysis of optimal control regarding maximizing the CD4+ T-cell counts and minimizing both the viral load and cost of drugs.

The paper is organized as follows: in Section 2, the underlying HIV mathematical model is presented. Our formulation of the MOOCP is described in Section 3. In Section 4, the weighted sum method is used, and the MOOCP is converted to a single-objective optimal control problem. First, we assume that the cost of the drug regime varies quadratically with the amount of drug administered, and then we characterize the continuous Pareto optimal solutions using Pontryagin’s Maximum Principle. Secondly, it is shown that the presence of linear cost functionals leads to STI-type Pareto optimal solutions, and an MPC-based technique is proposed to find this kind of solutions. In Section 5, a fuzzy approach is utilized and the MOOCP is converted to a single-objective optimization problem, with the final goal of finding a Pareto optimal solution which has the best satisfaction performance among other Pareto optimal solutions. Moreover, the converted problem is approximated by an LP model applying a measure-theoretical approach, and a piecewise constant solution is achieved. Some numerical experiments are provided in Section 6. The last section is the conclusion.

2. Presentation of a Working Model

In this paper, the pathogenesis of HIV is modeled with a system of ordinary differential equations (ODEs) described in [7]. This model can be viewed as an extension of basic HIV Model of Perelson et al. [5] Most of the terms in the model have straightforward interpretations as following.

The first equation represents the dynamics of the concentration of healthy CD4+ T-cells (). The healthy CD4+ T-cells are produced from a source, such as the thymus, at a constant rate , and die at a rate . The cells are infected by the virus at a rate . The second equation describes the dynamics of the concentration of infected CD4+ T-cells (). The infected CD4+ T-cells result from the infection of healthy CD4+ T-cells and die at a rate and are killed by cytotoxic T-lymphocyte effectors CTLe () at a rate . The population of cytotoxic T-cells is divided into precursors or CTLp (), and effectors or CTLe (). Equations (2.3) and (2.4) describe the dynamics of these compartments. In accordance with experimental, findings establishment of a lasting CTL response depends on CD4+ T-cell help, and that HIV impairs T-helper cell function. Thus, proliferation of the CTLp population is given by and is proportional to both virus load () and the number of uninfected T-helper cells (). CTLp differentiation into effectors occurs at a rate . Finally, CTLe dies at a rate . Equation (2.5) describes the dynamics of the free virus particles (). These free virus particles are produced from infected CD4+ T-cells at a rate and are cleared at a rate . The model also contains an index of the intrinsic virulence or aggressiveness of the virus (). This index increases linearly in the case of an untreated HIV-infected individual, with a growth rate that depends on the constant . Finally, (2.6) describes the dynamic of this index. In model variables and denote protease inhibitors (PI) and reverse transcriptase inhibitors (RTI), respectively. The effect of PI drugs is modeled by reducing the proliferation rate of viruses from infected cells, while the effect of RTI drugs is modeled by reducing the infection rate, and in this way, blocking the infection of CD4+ T-cells by free virus. Hence, in this model the RTI drugs have an effect on virulence because their main role is halting cellular infection and preventing virus production by reducing the production rate from infected CD4+ T cells.

The model has several parameters that must be assigned for numerical simulations. The descriptions, numerical values, and units of the parameters are summarized in Table 1. These descriptions and values are taken from [7]. We note that (2.1)–(2.6) with these parameters model dynamics of fast progressive patients (FPP).

3. Multiobjective Optimal Control Formulation

A problem arising from the use of most chemotherapy is the multiple and sometimes harmful side effects, as well as the ineffectiveness of treatment after a certain time due to the capability of the virus to mutate and become resistant to the treatment. Although we do not intend to model effects of resistance or side effects, we impose a condition called a limited treatment window, that monitors the global effects of these phenomena. The treatment lasts for a given period from time to , say. In clinical practice, antiretroviral therapy is initiated at , the time at which CD4+ T-cell counts reach 350 cells/μL. We would like to maximize levels of healthy CD4+ T-cells, minimize level of viral load, and also we want to minimize the systemic costs of treatments. Thus, the following objective functionals are to be maximized simultaneously where and are the solution of ODEs (2.1)–(2.6) corresponding to control pair and is a positive integer. Let be the set of all measurable control pairs that Therefore, the optimal drug regimen problem can be represented as In general, there does not exist a pair of control functions that renders the maximum value to each functional , simultaneously, and one uses the concept of Pareto optimality in the sense of following definition.

Definition 3.1. A pair is said to be an Pareto optimal solution of the problem (3.3) if, and only if, there exist no such that for all , and for some .

As seen from Definition 3.1, in general there exist an infinite number of Pareto optimal solutions. Actually, we should select one control function among the set of Pareto optimal solutions. We are going to find a Pareto optimal solution of problem (3.3), combining the techniques and methods from the classic multi-objective optimization and control theory.

4. The Weighted Sum Method

The weighted sum method scalarizes set of objectives into a single-objective by multiplying each objective with user supplied weights. The weight of an objective is usually chosen in proportion to the objective’s relative importance in the problem. After, a composite objective functional can be formed by summing the weighted objectives and the MOOCP given in (3.3) is converted to a single-objective optimization problem as follows:

Theorem 4.1. The solution of the weighted sum problem (4.1) is Pareto optimal if the weighting coefficients are positive, that is, for all .

Proof. Assume is the optimal solution of problem (4.1). If is not a Pareto optimal solution, then there is such that for all , and for some . Since for all , thus, , and this contradicts the optimality of .

4.1. Continuous Solutions

Here, we followed [8, 32] in assuming that systemic costs of the PI and RTI drugs treatment are proportional to and at time , respectively, (). We note that the existence of the solution can be obtained using a result by Fleming and Rishel [36]. That is rather straightforward to show that the right-hand sides of (2.1)–(2.6) are bounded by a linear function of the state and control variables and that the integrand of the composed objective functional is convex on and is bounded. We now proceed to compute candidates for a Pareto optimal solution. To this end, we apply the Pontryagin’s Maximum Principle [37] and begin by defining the Lagrangian (which is the Hamiltonian augmented with penalty terms for the constraints) to be where are the penalty multipliers satisfying Here, is the optimal control pair yet to be found. Thus, the Maximum Principle gives the existence of adjoint variables satisfying where , are the transversality conditions. The Lagrangian is maximized with respect to and at the optimal pair , so the partial derivatives of the Lagrangian with respect to and at and are zero. That is, and .

Since

then we have

To determine an explicit expression for the optimal control without , we consider the following three cases.(i)On the set , we set . Hence, the optimal control is (ii)On the set , we set . Hence, the optimal control is which implies that (iii)On the set , we set . Hence, the optimal control is which implies that

Combining all the three cases in compact form gives Using similar arguments, we also obtain the following expression for the second optimal control function We point out that the optimality system consists of the state system (2.1)–(2.6) with the initial conditions, the adjoint (or co-state) system (4.4) with the terminal conditions, together with the expressions (4.12) and (4.13) for the control functions. The results of implementing the optimal control policy obtained by this procedure are shown in Section 6.

4.2. STI Solution

From a therapeutic point of view, it may be unsafe to administrate drugs at a dose less than maximum because virus mutations may occur (see, e.g., [21] and references therein for a more exhaustive discussion on this point). Therefore, standard HAART protocols require persistent drugs uptake at maximum value. However, a number of clinical and theoretical studies attempted STI protocols in which periods of therapy at maximum dosages are alternated with periods of treatment suspension [21, 38]. The reasons for these attempts can be found in several side effects of HAART, such as serious hepatic damages and the high cost of the therapy, but also in the evidence that appropriate suspension periods may enhance the immune response of the patient.

In this section, we follow [24] in assuming that the cost of the drug regime varies linearly with the amount of drug administered. Thus, and are considered as follow:

We now proceed to compute candidates for a Pareto STI solution. To this end, we apply the Pontryagin’s Maximum Principle again and begin by defining the Hamiltonian to be:

and the maximum principle requires that in which the optimal controls and corresponding states and costates satisfying (2.1)–(2.6) and (4.4) are indicated by a star superscript *. From (4.16) and by performing straightforward calculations it is concluded that the optimal pair should maximize the following expression: Therefore, where and are the switching functions which determine the type of optimal control pair (). Now we consider an opportunity for the control pair to contain singular arcs. Let us assume that the switching function is zero on the singular interval . To find a singular control , we use the fact that , and , on ; hence, . Therefore, from the last two equations (4.4), it is concluded that where . Since , equating right-hand sides of first two equations (4.4) and then substituting (4.19) into the resulting expression yield where, Differentiating both sides of (2.1) and then substituting (4.20) into the resulting expression, lead to

where Obviously, can be calculated using the Chain rule. Now, we find the value of singular control . With respect to this fact that , and from second last equation (4.4) and (2.2) it is easy to see that during the corresponding singular interval, say , Moreover, differentiating both sides of (2.2) and then substituting (4.24) into the resulting expression yield where

Note that the sets and are considered such that . Moreover, for the control pair to be optimal, the Kelley’s condition [39] must be satisfied where and are known as the order of singularity.

The above discussion shows that using linear cost functionals, while ignoring singular arcs, leads to an STI solution. Unfortunately, solving the corresponding optimality system is not analytically possible and the gradient method [40] fails to converge; therefore, we resort to model predictive control- (MPC-) based approach. MPC is a control technology widely used in many areas, especially in the process industries [41], for systems with a large number of controlled and manipulated variables, which interact significantly. In general, feedback control technologies, and MPC in particular, have started to gain significant attention in the biomedical area [12, 1921]. A thorough overview of the history of MPC and its various incarnations can be found in [42]. In order to give a formal description of the proposed MPC algorithm, some preliminary definitions are necessary. Let be the sampling time, we denote with the state of the system at time , that is, Next, let denote the vector of PI and RTI drugs taken in the time interval , and rewrite the system model (2.1)–(2.6) in the integrated form , where is a vector-value function obtained by numerical integration of (2.1)–(2.6) over the sampling , assuming a constant drug uptake during the sampling time. With these definitions, it is now possible to state the following finite-horizon optimal control problem (FHOCP) to be solved at each discrete time : in which is an integer(control horizon), and is the decision variable. Let be a maximizing control input sequence. The first input in the optimal sequence, that is, , is injected into the system, and the entire optimization is repeated at subsequent control intervals.

5. Piecewise Constant Solution Using Fuzzy Aggregation and Embedding Method

In this section, as in continuous solutions, we set . With respect to the nature of controls and , that indicate the PI and RTI drug efficacies as a function of time, piecewise constant control is more practical from the clinical point of view. In this section, we are going to propose such kind of solutions for problem (3.3).

Setting, and , the system of differential equations (2.1)–(2.6) can be represented in a generalized form asIn the absence of importance on the objectives and without knowledge of the possible level of attainment for those objectives, finding a Pareto optimal solution that the best satisfies the decision maker can be viewed as a fuzzy problem. In this situation, for each of the objective functionals, , of problem (3.3) we associate a linear membership function as where or denotes the value of the objective functional such that the degree of membership function is 0 or 1, respectively, and it is depicted in Figure 1. Following the principle of the fuzzy decision of Bellman and Zadeh [43], the MOOCP (3.3) can be interpreted as the following problem: Let be a given small positive number. Introducing an auxiliary variable and by adding the augmented term to it, a solution to problem (3.3), can be obtained by solving the following problem:

Theorem 5.1. If is the optimal solution of the problem (5.4)–(5.6), then is a Pareto optimal solution of (3.3).

Proof. If is not a Pareto optimal solution for the problem (3.3), then there exists such that for all , and for some . Obviously, and . Therefore, . This contradicts the assumption that () is the optimal solution for problem (5.4)–(5.6).

Using the measure theory for solving optimal control problems based on the idea of Young [44], which was applied for the first time by Wilson and Rubio [45], has been theoretically established by Rubio in [46]. In the rest of this section, we follow their approach and we transfer the problem (5.4)-(5.6) into a modified problem in an appropriate space that the existence of the optimal solution () is guaranteed.

5.1. Transformation to Functional Space

We assume that state variable and control input get their values in the compact sets and , respectively. Set .

Definition 5.2. A pair is said to be admissible if the following conditions hold.(i)The vector function is absolutely continuous and belongs to for all .(ii)The function takes its values in the set and is Lebesgue measurable on .(iii) satisfies in (5.5) and (5.6) on , that is, the interior set of .We assume that the set of all admissible pairs is nonempty and denote it by . Let be an admissible pair and an open ball in containing , and the space of all real-valued continuous differentiable functions on it. Let and define as follows: for each , where . The function is in the space , the set of all continuous functions on the compact set . Since is an admissible pair, we have for all . Let be the space of infinitely differentiable all real-valued functions with compact support in . Define Assume be an admissible pair. Since the function has compact support in , so, . Thus, for , and for all , by integrating both sides of (5.9) and using integration by parts, we have Also by choosing the functions which are dependent only on time, we have where is the space of all functions in that depend only on time and is the integral of on . Equations (5.8), (5.10), and (5.11) are weak forms of (5.6). Now, we consider the following positive linear functional on

Proposition 5.3. Transformation of admissible pairs in into the linear mappings defined in (5.12) is an injection.

Proof. See [46].

Thus, the problem (5.4)–(5.6) is converted to the following optimization problem in functional space: where , , , , and .

5.2. Transformation to Measure Space

Let denote the space of all positive Radon measures on . By the Riesz representation theorem [46], there exists a unique positive Radon measure on such that So, we may change the space of optimization problem to measure space. In other words, the optimization problem in functional space (5.13)–(5.17) can be replaced by the following new problem in measure space: Obviously, takes its values in the compact set . We shall consider the maximization of (5.21) over the set of all pairs satisfying (5.20)–(5.23). The main advantages of considering this measure theoretic form of the problem is the existence of optimal pair , where this point can be studied in a straightforward manner without having to impose conditions such as convexity which may be artificial. Define function , as . The following theorem guarantees the existence of an optimal solution.

Theorem 5.4. Function attains its maximum on the set.

Proof. The set can be written as where We will show in the next section that (5.22) and (5.23) are special cases of (5.21). Therefore, the set can be written as , where It is well known that the set is compact in weak*-topology [46]. Furthermore, for , the set as intersection of inverse image of the closed singleton sets under the continuous functions is also closed. Thus, is a closed subset of a compact set. This proves the compactness of the set . Hence, the compactness of implies the compactness of with the product topology. Besides, for , the set as intersection of the inverse image of the closed sets under the continuous functions is also closed. Therefore, the set as a closed subset of the compact set is compact also. Since the function mapping the compact set on the real line is continuous so it has a maximum on the compact set .

5.3. Approximation

Since is an infinite-dimensional space, the problem (5.19)–(5.23) is an infinite-dimensional linear programming problem and we are mainly interested in approximating it. First, the maximization of is considered not over the set , but over a subset of it denoted by requiring that only a finite number of constraints (5.21)–(5.23) be satisfied. Let , , and be the sets of total functions, respectively, in , and .

Remark 5.5. From (5.7) and (5.9), it can be seen that (5.22) and (5.23) are also achieved from (5.21) by setting and , respectively.

Now, the first approximation will be completed by choosing finite number of total functions and using following propositions.

Proposition 5.6. Consider the linear program consisting of maximizing the function over the set of pairs in satisfying Then, tends to as .

Proof. We have and hence, . Therefore, is nonincreasing and bounded sequence then converges to a number such that . We show that, . Set . Then, and . It is sufficient to show that . Assume and . Since the Linear combinations of the functions are uniformly dense in , there is the sequence such that tends to uniformly as . Hence, , , and tend to zero as , where, We have and functional is linear. Therefore, and Since the right-hand side of the above inequality tends to zero as , while left-hand side is independent of , therefore, . Thus and which implies .

Proposition 5.7. An optimal pair in the set at which the function attains its maximum can be found where the measure has the form where , and is unitary atomic measure with the support being the singleton set , characterized by .

Proof. Assume that is an optimal solution of maximizing the function over the set . Denote the set of all measures satisfying in (5.27) for a fixed , by . The set is a weak*-compact subset of [46]. Therefore, from Theorem .5 of [46], has at least one extreme point in the form of (5.30).

Therefore, our attention is restricted to finding a measure in the form of (5.30) which maximizes the function and satisfies in (5.20) and K number of the constraints (5.21)–(5.23). Thus, by choosing the functions , , , and , the infinite-dimensional problem (5.19)–(5.23) is approximated by the following finite-dimensional nonlinear programming (NLP) problem: where . Clearly, (5.31) is an NLP problem with unknowns , ,. One is interested in LP problem. The following proposition enables us to approximate the NLP problem (5.31) by a finite-dimensional LP problem.

Proposition 5.8. Let be a countable dense subset of , where is a sufficiently large number. Given , a measure can be found such that where the measure has the form and the coefficients , are the same as optimal measure (5.30), and ,.

Proof. We rename the functions s, s, s, and s sequentially as . Then, for , The functions , are continues. Therefore, can be made less than by choosing , sufficiently near .

For constructing a suitable set , is divided to subintervals as follows: where . Moreover, the intervals and are divided, respectively, into and subintervals. So, the set is divided into cells. One point is chosen from each cell. In this way, we will have a grid of points, which are numbered sequentially as , .

The desired numbers and in definition of the membership functions can be set

Remark 5.9. With respect to this fact that the maximum level of healthy cells is attained with full HAART, we have . Similarly, the other numbers and can be calculated without explicitly solving auxiliary optimal control problems. The values of these variables are shown in Table 3.

Therefore, according to (5.33) the NLP problem (5.31) is converted to the following LP problem:

Here, we discuss suitable total functions , , and . The functions s can be taken to be monomials of and the components of the vector as follows:In addition, we choose some functions with compact support in the following form [46]:

Finally, the following functions are considered that are dependent on only:

where , are given by (5.35). These functions are used to construct the approximate piecewise constant controls [46]. Of course, we need only to construct the control function , since can be obtained by solving the ODEs (2.1)–(2.6). By using simplex method, a nonzero optimal solution , , of the LP problem (5.37) can be found where cannot exceed the number of constraints, that is, . Setting , a piecewise constant control function approximating the optimal solution is constructed based on these nonzero coefficients as follows [46]:

where and are, respectively, the 8th and the 9th components of grid point .

6. Numerical Results

In this section we show the response of the HIV model to the continuous, STI and piecewise constant controls presented in Sections 4 and 5.

6.1. Continuous Solutions

As we discussed in Section 4.1, in the case of continuous solutions the optimality systems, is a two-point boundary value problem because of the forward-in-time nature of the state system (2.1)–(2.6) with the initial conditions and the backward-in-time nature of the adjoint system (4.4) with the terminal conditions. We consider a gradient method for the solution of the optimal control problem. The algorithm proceeds as follows.(i)Choose an initial guess of control.(ii)Solve the state system forward in time with initial conditions using an initial guess of control.(iii)Solve the adjoint system backward in time with terminal conditions.(iv)Update the controls in each iteration using the optimality conditions (4.12) and (4.13).(v)Continue the iterations until convergence is achieved.

For further discussion of the gradient method, we refer the interested reader to [40]. We use the parameters for solving the optimality system from Table 1. Antiretroviral therapy is initiated at , the time at which the CD4+ T-cell count falls below 350 cells/μL, and treatment was simulated for 750 days. Furthermore, control variables and are bounded by , , , and , and the following initial condition is used (see [7]):We ran simulations with five different values of weight factor . The corresponding control functions are presented in Figure 2. As we increase and , thereby increasing the cost of therapy, the control functions are decreasing. In Figure 8, we plot only the response of the HIV model to the obtained control function corresponding to weights , , , and for brevity. Since the magnitude of CD4+ T cell, and virus population is much larger than the magnitude of the cost of the drugs treatment in the objective functional , these differences in magnitudes are balanced by this choice of . Moreover, the performance of the gradient method with this choice of is shown in Figure 3. The number of required iterations to achieve the convergence is 307. Note that in Figure 3, denotes the infinity norm, and denotes the state in the th iteration.

6.2. MPC-Based STI Solutions

We implement therapy protocols based on the MPC algorithm (4.29) described in Section 4. All computations are carried out by MATLAB. We choose a sampling time of five days, . Consequently, we do not worry about creating an explicit discretization of our differential equation; we simply use a numerical simulator to approximate our discretization. Moreover, we chose horizon length . In particular, a finite horizon and a finite control space mean that we have, for each horizon, a finite number of possible control sequences. We implement MPC by using the Simulated Annealing (SA) metaheuristic combined with the Steepest Descent Explorer [47] for searching this space.

We ran simulations with three different values of weight factor . The corresponding control functions are plotted in Figures 4, 5, and 6. In Figure 8, we present only the response of the HIV model to the STI solution corresponding to weights , , , and for brevity.

6.3. Piecewise Constant Solutions

In our implementation, we choose number of functions from the set as

Furthermore, we set and . By using controllability on the dynamical control system, the considered ranges for states and controls and the corresponding number of partitions in the construction of the set are summarized in Table 2. Note that the selected values from the set in the construction of are 0, 0.4, and 0.7. These values indicate off, moderate, and strong PI-therapy, respectively. Similarly, the corresponding values for RTI control are 0, , and [7]. The desired values for and are summarized in Table 3. Implementing the corresponding LP model (5.37), we achieve , which is close to the obtained number from Table 3, that is, . Figure 7 shows the resulting piecewise constant control pair. We solved the HIV model (2.1)–(2.6) with no treatment () and with fully efficacious treatment (). The numerical results for these two cases as well as the response of the systems to piecewise constant control, STI control, and continuous control are shown in Figure 8 for comparison. Except for STI solution, which is derived by considering linear cost functionals, the values of the objective functionals as well as corresponding values of the membership functions for other cases are given in Table 3.

From Figures 8(a) and 8(b), we see drop in the number of CD4+ T cells, and a rise in viral load following the initial infection until about the third month. After this time, CD4+ T cells start recovering and virus starts decreasing due to the immune response, but can never eradicate virus completely. Then CD4+ T cells level decreases and viral load increases due to destruction of immune system in absence of treatment. Figures 8(b) and 8(c) show a clear correlation between the CTLe and virus population. As the virus increases upon initial infection, CTLe increases in order to decrease the virus. Once this is accomplished, virus decreases. Then virus grows back slowly, and this triggers an increase in the CTLe population. CTLe further increases in an attempt to keep the virus at constant levels but lose the battle because of virus-induced impairment of CD4+ T cell function, in absence of treatment. Memory CTL responses depend on the presence of CD4+ T cell help. Figures 8(a) and 8(b) show that, in presence of treatment, the virus is controlled to very low levels and CD4+ T cells are maintained above the critical levels. Therefore, immune response expands for relatively long time successfully. Furthermore, these Figures indicate inverse coloration between viral load and CD4+ T cells level. From Figures 8(c) and 8(d) interestingly, a decrease in CTLs occur in response to therapy can be observed. The extent of the decrease is directly correlated with the increase in treatment effectiveness which is consistent with experimental findings [48]. From Figure 8(a), we conclude that in the presence of treatment, the level of CD4+ T cells decreases very slowly as compared with untreated patient. Moreover, this figure shows that the average CD4+ T cells concentration with piecewise constant control is higher than continuous control. Interestingly, from Figure 8(d) we see that the intensity of immune response in the presence of continuous control is more than full treated patient until about the 42th month, and the intensity of immune response with piecewise constant control is higher than other controls after about the 31th month.

7. Conclusion

In this paper, a six-dimensional HIV model is considered, and a multi-objective optimal control problem is proposed to design therapy protocols to treat the HIV infection. The model permits drug therapy (RTIs and PIs) as controllers. We derived continuous treatment strategies by solving the corresponding optimality systems with the gradient method. In addition, structured treatment interruption (STI) protocols are found by using ideas from model predictive control. The results of simulations show that the proposed MPC-based method can effectively be applied to find STI-type solutions. In particular, a novel idea based on fuzzy aggregation and measure theory is proposed to find a compromise Pareto optimal solution, and numerical results confirmed the effectiveness of this approach. The method is not iterative and it does not need any initial guess of the solution and it can be utilized to solve multi-objective optimal control problems.