- About this Journal ·
- Abstracting and Indexing ·
- Advance Access ·
- Aims and Scope ·
- Annual Issues ·
- Article Processing Charges ·
- Articles in Press ·
- Author Guidelines ·
- Bibliographic Information ·
- Citations to this Journal ·
- Contact Information ·
- Editorial Board ·
- Editorial Workflow ·
- Free eTOC Alerts ·
- Publication Ethics ·
- Reviewers Acknowledgment ·
- Submit a Manuscript ·
- Subscription Information ·
- Table of Contents
Advances in High Energy Physics
Volume 2011 (2011), Article ID 217035, 12 pages
A Simple Introduction to Gröbner Basis Methods in String Phenomenology
Rudolf Peierls Centre for Theoretical Physics, University of Oxford, Oxford OX1 3NP, UK
Received 3 February 2011; Accepted 13 February 2011
Academic Editor: Yang-Hui He
Copyright © 2011 James Gray. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
I give an elementary introduction to the key algorithm used in recent applications of computational algebraic geometry to the subject of string phenomenology. I begin with a simple description of the algorithm itself and then give 3 examples of its use in physics. I describe how it can be used to obtain constraints on flux parameters, how it can simplify the equations describing vacua in 4D string models, and lastly how it can be used to compute the vacuum space of the electroweak sector of the MSSM.
There is currently a great deal of interest in applying the methods of computational algebraic geometry to string phenomenology and closely related subfields of theoretical physics. For some examples of recent work see [1, 2, 6–8, 11–18, 21, 22] and references therein. These papers utilise advances in algorithmic techniques in commutative algebra to study a wide range of subjects including various aspects of globally supersymmetric gauge theory [1, 2, 6–8], finding flux vacua in string phenomenology [10–16], studying heterotic model building on smooth Calabi-Yau in non-standard embeddings [17, 18], and more besides [19–23].
Despite the wide range of physical problems which have been addressed within this context, the computational tools which are being used are all based, finally, on the same algorithm. The Buchberger algorithm [24, 25] is at once what lends these methods their power and also the rate limiting step-placing bounds on the size of problem that can be dealt with. The recent burst of activity in this field has been fueled, in part, by the advent of freely available, efficient implementations of this algorithm [26, 27]. There are also interfaces available between the commutative algebra program  and Mathematica [11–14, 28], with [11–14] being particularly geared towards physicist's needs. The aim of this paper is to give an elementary introduction to the Buchberger algorithm and some of its recent applications.
In order to give an idea of how one simple algorithm can make so much possible, I will present the Buchberger algorithm and then show how it may be applied to physics in 3 elementary examples. Firstly, I will describe how it can be used to obtain constraints on the flux parameters in four-dimensional descriptions of string phenomenological models which are necessary and sufficient for the existence of certain types of vacuum [11–14]. Secondly, I will describe how the Buchberger algorithm can be used to simplify the equations describing the vacua of such systems making problems of finding minima much more tractable [11–14]. Finally, I will describe how the same simple algorithm can be used to calculate the supersymmetric vacuum space geometry of the electroweak sector of the MSSM [1, 2].
The remainder of this paper is structured as follows. In Section 2, I take a few pages to explain the algorithm and the few mathematical concepts that we will require. In the three sections following that, I then describe the three examples mentioned above. I will conclude by making a few final comments about the versatility and scaling of the Buchberger algorithm.
2. A Tiny Bit of Commutative Algebra
Two pages of simple mathematics will suffice to achieve all of the physical goals mentioned in the introduction. First of all we define the notion of a polynomial ring. In this paper we will call the fields of the physical systems we study and any parameters present, such as flux parameters, . The polynomial rings and are then simply the infinite set of all polynomials in the fields and parameters and the infinite set of all polynomials in the parameters, respectively.
Another mathematical concept we will require is that of a monomial ordering. This is simply an unambiguous way of stating whether any given monomial is formally bigger than any other given monomial. We may denote this in a particular case by saying , where , are monomials in the fields and parameters. It is important to say what is not meant by this. We are not saying that we are taking values of the variables such that the monomial is numerically larger than the monomial . We are rather saying that, in our formal ordering, is considered to come before .
For our purposes we will require a special type of monomial ordering called an elimination ordering. This means that our formal ordering of monomials has the following property: In words this just says that if the largest monomial in according to our ordering, , does not depend on , then does not depend on the fields at all. The monomial ordering classes all monomials with fields in them as being bigger than all of those without such constituents.
Given this notion of monomial orderings, we can now present the one algorithm we will need to use—the Buchberger algorithm [24, 25]. The Buchberger algorithm takes as its input a set of polynomials. These may be thought of as a system of polynomial equations by the simple expedient of setting all of the polynomials to zero. The algorithm returns a new set of polynomials which, when thought of as a system of equations in the same way, has the same solution set as the input. The output system, however, has several additional useful properties as we will see.
The Buchberger Algorithm
(1)Start with a set of polynomials call this set . (2)Choose a monomial ordering with the elimination property described above. (3)For any pair of polynomials , , multiply by monomials, and form a difference so as to cancel the leading monomials with respect to the monomial ordering: (4) Perform polynomial long division of with respect to ; that is, form , where is a monomial and such that cancels a monomial in . Repeat until no further reduction is possible. Call the result . (5) If , then consider the next pair. If , then add to and return to step (3).
The algorithm terminates when all S-polynomials which may be formed reduce to 0. The final set of polynomials is called a Gröbner basis.
As mentioned above, the resulting set of polynomials has several nice properties. The feature which is often taken as defining is that polynomial long division with respect to this new set of polynomials always gives the same answer—it does not matter in which order we divide the polynomials out by.
For us, however, the important point about our Gröbner basis is that it has what is called the elimination property. The set of all polynomials in which depend only upon the parameters, , gives a complete set of equations on the which are necessary and sufficient for the existence of a solution to the set of equations we started with. The reason why this is so is actually very straightforward. Our elimination ordering says that any monomial with a field in it is greater than any monomial only made up of parameters. Looking back at step (3) of the Buchberger algorithm we see that we are repeatedly canceling off the leading terms of our polynomials—those containing the fields—as much as we can. Thus, if it is possible to rearrange our initial equations to get expressions which do not depend upon the fields , then the Buchberger algorithm will do this for us. Clearly, while we have interpreted the as parameters and the as fields in the above, as this is what we will require for Section 3, this was not necessary. The Buchberger algorithm can be used to eliminate any unwanted set of variables from a problem, in the manner we have described.
This completes all of the mathematics that we will need for our entire discussion, and we may now move on to apply what we have learnt.
The first physical question we wish to answer is the following. Given a four-dimensional supergravity describing a flux compactification, what are the constraints on the flux parameters which are necessary and sufficient for the existence of a particular kind of vacuum? This question can be asked, and answered [11–14], for any kind of vacuum, but in the interests of concreteness and brevity let us restrict ourselves to the simple case of supersymmetric Minkowski vacua.
Here is the superpotential of a typical system, taken from . It describes a nongeometric compactification of type IIB string theory This system has some known constraints on its parameters which are necessary for the existence of a permissible vacuum. These come from, for example, tadpole cancellation conditions:
We also have the same constraints with the hats and checks switched around. In this example the fields, which we have been calling , are , , and , and everything else is a “flux” parameter, or an in our notation.
In total, the equations which must be satisfied if a supersymmetric Minkowski vacuum is to exist are , , , , and the constraints on the flux parameters given above. To extract a set of constraints solely involving the parameters which are necessary and sufficient for the existence of a solution to these equations, we simply follow the procedure outlined in the previous section.
The reader will note that the result is a somewhat lengthy set of equations. In principle one has to find quantized solutions to these expressions, an obviously intractable Diophantine problem, and therefore it might be asked why this result is of any use. In fact, knowledge of such constraints on the flux parameters is hugely useful for a number of reasons.(i)Firstly, we note that, while the full result of this process is often complex, some of the constraints can give us simple information about the system. In the current case, for example, it can be seen that is required for the existence of a supersymmetric Minkowski vacuum. (ii)Secondly, if one is scanning over a range of flux parameters and trying to numerically solve the equations to find vacua, one can speed up one’s analysis by first substituting any given set of flux parameters into the constraints we have obtained. If the constraints are not satisfied, then vacua do not exist and there is no point in searching numerically for a solution. This turns what would be a time-consuming numerical process giving inconclusive results (no solution was found) into a quick analytic conclusion (no solution exists). (iii)Lastly, knowledge of such constraints can greatly speed up algebraic approaches to finding vacua such as those outlined in [11–14].
4. Simplifying Equations for Vacua
Another use for the mathematics we learnt in Section 2 is the so-called “splitting tools” used in work such as [11–14]. The physical idea here is simple. Consider trying to solve the equations to find the vacua, including those which spontaneously break supersymmetry, of some supergravity theory. These equations are often extremely complicated. One way of viewing why this is so is that the equations for the turning points of the potential contain a lot of information. They describe not only the isolated minima of the potential which are of interest but also lines of maxima, saddle points of various sorts, and so forth. A useful tool to have, therefore, would be an algorithm that takes such a system as an input and returns a whole series of separate sets of equations, each individually describing fewer turning points. Since each separate equation system would then contain less information, one might expect them to be easier to solve. It would be beneficial to choose a division of these equations which has physical interest. The choice we will make here, and which programs like Stringvacua implement [11–14], is to split up the equations for the turning points according to how they break supersymmetry—that is, according to which -terms vanish when evaluated on those loci.
The ability that packages such as Stringvacua have to split up equations in this manner is based upon the following splitting tool (see  for a nice set of more detailed notes on this kind of mathematical technique). Say that one of the -terms of our theory is called . Then we can split the equations describing turning points of the potential into two pieces:
The first of these expressions is a set of equations which is easier to solve, in general, than alone. We can use the -term to simplify the equations for the turning points of the potential. On the other hand, expression (4.2) is not even a set of equations—it contains an inequality. We can convert (4.2) into a system purely involving equalities by making use of the mathematics we learned in Section 2.
Consider the following set of equations, including a dummy variable :
The second equation in (4.3) has a solution if and only if , simply . If , then the equation reduces to which clearly has no solutions. Equations (4.3), then, have a solution whenever the set of equalities and inequalities (4.2) do. Unfortunately they also depend upon one extra, and unwanted, variable—. This is not a problem as we already know how to remove unwanted variables from our equations. We can simply eliminate them, as we did the fields in Section 2. This will leave us with a necessary and sufficient set of equations in and for a solution to (4.3) and thus to (4.2).
So we can split the equations for the turning points of our potential into two simpler systems. One describes the turning points of for which and the other, those for which . We can of course perform such a splitting many times—once for each -term! In fact, using additional techniques from algorithmic algebraic geometry [11–14, 31–33], which are essentially based upon the same trick, one can go much further. One can split the equations for the turning points up into component parts gaining one set of equations for every separate locus. Because we know which -terms are nonzero on each of them, these are classified according to how they break supersymmetry. The researcher interested in a certain type of breaking can therefore select the equations describing the vacua of interest and throw everything else away.
The above process of splitting up the equations for the vacua of a system can be very simply carried out in Stringvacua. Numerous examples can be found in the Mathematica help files which come with the package [11–14]. Here, let us consider the example of M-theory compactified on the coset . The Kähler and superpotential for this coset, which has structure, has been presented in  Even this, relatively simple, model results in a potential of considerable size. Defining and , we find
To find the turning points of this potential we naively need to take eight different derivatives of (4.5) and solve the resulting set of intercoupled equations in eight variables. This is clearly prohibitively difficult. Using the techniques described in this section, however, Stringvacua can separate off parts of the vacuum space for us with ease. Consider, for example, the vacua which are isolated in field space and for which the real parts of all of the -terms are nonzero, with the imaginary parts vanishing. To find these, the package tells us, we need only to solve the equations
Because they only describe a small subset of all of the turning points of the full potential, these equations are extremely simple in form and may be trivially solved. For this particular example the physically acceptable turning point that results is a saddle—something which can be readily ascertained once its location has been discovered.
5. Geometry of Vacuum Spaces
As a final example of what we can do with the simple techniques introduced in Section 2, we will show how to calculate the vacuum space of a globally supersymmetric gauge theory. It is a well-known result (see  and references therein) that the supersymmetric vacuum space of such a theory, with gauge group , can be described as the space of holomorphic gauge invariant operators (GIOs) built out of -flat field configurations. What does this space look like? Consider a space, the coordinates of which are identified with the GIOs of the theory. If there were no relations amongst the gauge invariant operators, then this space would be the vacuum space. However, there frequently are relations because of the way in which the GIOs are built out of the fields. For example, if we have three gauge invariant operators , , and which are built out of the fields as , , , then we have the relation . If we take these GIOs to be built out of the -flat field configurations, then there will be still further relations among them. The vacuum space of the theory is the subspace defined by the solutions of these equations describing relations amongst the gauge invariant operators, once -flatness has been taken into account.
How can we calculate such a thing? The holomorphic gauge invariant operators of a globally supersymmetric gauge theory are given in terms of the fields
Here are our GIOs, and the are the functions of the fields that define them. Let us write the -terms of the theory as . Consider the following set of equations: These equations have solutions whenever the are given by functions of the fields in the correct way and when those field configurations which are being used are -flat. However, according to the proceeding discussion, we wish to simply have equations in terms of the GIOs to describe our vacuum space. As in previous sections, we can eliminate the unwanted variables in our problem, in this case, the fields , using the algorithm of Section 2 to obtain the equations describing the vacuum space.
As a simple example, let us take the electroweak sector of the MSSM [1, 2] (with right-handed neutrinos). Given the field content of the left-handed leptons, , the right-handed leptons, and , and the two Higgs, and , one can build the elementary GIOs given in Table 1. The indices , run over the 3 flavours, and the indices label the fundamental of .
To compute the -terms we require the superpotential. Let us take the most general renormalizable form which is compatible with the symmetries of the theory and R-parity Here is the invariant tensor of and , , , and are constant coefficients.
We now just follow the procedure outlined at the begining of this section. We calculate the -terms by taking derivatives of the superpotential, we label the gauge invariant operators to , we form (5.2), and then we simply run the elimination algorithm given in Section 2.
The result is, upon simplification, given by six quadratic equations in 6 variables. It is a simple description of an affine version of a famous algebraic variety—the Veronese surface [1, 2]. What can be done with such a result? The first observation we can make is that this vacuum space is not a Calabi-Yau. This means, for example, that one can say definitively that it is not possible to engineer this theory by placing a single D3 brane on a singularity in a Calabi-Yau manifold, without having to get into any details of model building.
Secondly one can study such vacuum spaces in the hope of finding hints at the structure of the theory's higher energy origins. In the case we have studied in this section, for example, we can “projectivize” (pretend the GIOs are homogeneous coordinates on projective space rather than flat space coordinates) and study the Hodge diamond of the result. The structure of supersymmetric field theory tells us that this Hodge diamond should depend on 4 arbitrary integers, but there is nothing at low energies which prevents us from building theories with any such integers we like. Interestingly, in the case of electroweak theory, these integers are all zero or one:
Whether this structure is indeed a hint of some high energy antecedent or just a reflection of the simplicity of the theory is debatable. This example does, however, demonstrate the idea of searching for such evidence of new physics in vacuum space structure. We should also add here that similar techniques can be used to show that the vacuum space of SQCD is a Calabi-Yau [6–8].
6. Final Comments
To conclude we will make several points—one of which is a note of caution, with the rest being more optimistic. The first point which we will make is that we should be careful lest the above discussion makes the algorithm we have been describing sound like an all-powerful tool. There is, as ever, a catch. In this case it is the way the algorithm scales with the complexity of the problem. A “worst case” upper bound for the degree of the polynomials in a reduced Gröbner basis can be found in . If is the largest degree found in your original set of equations, then this bound is
where is the number of variables. This worst case bound is therefore scaling doubly exponentially in the number of degrees of freedom. These very high-degree polynomials are an indication that the problem is becoming very complex and thus computationally intensive. Despite this, physically useful cases can be analysed using this algorithm quickly, as demonstrated in this paper and in the references. This scaling does mean that one is not likely to gain much by putting one's problem on a much faster computer. One good point about (6.1) is that if you can find a way, using physical insight, to simplify the problem under study, then what you can achieve may improve doubly exponentially. Such a piece of physical insight was one of the keystones of the application of these methods to finding flux vacua [11–14].
We finish by commenting that the methods of computational commutative algebra which we have discussed here are extremely versatile. We have been able to perform three very different tasks simply utilizing one algorithm in a very simple manner. These methods are of great utility in problems taken from the literature, and their implementation in a user friendly way in Stringvacua means that they may be tried out on any given problem with very little expenditure of time and effort by the researcher. Many more techniques from the field of algorithmic commutative algebra could be applied to physical systems than those described here or indeed in the physics literature. We can therefore expect that this subject will only increase in importance in the future.
The author is funded by STFC and would like to thank the University of Pennsylvania for generous hospitality while some of this document was being written. In addition he would like to thank the organisers of the 2008 Vienna ESI workshop “Mathematical Challenges in String Phenomenology,” where the talk upon which these notes are based was first given. The author would like to offer heartfelt thanks to his collaborators on the various projects upon which this paper is based. These include Lara Anderson, Daniel Grayson, Amihay Hanany, Yang-Hui He, Anton Ilderton, Vishnu Jejjala, André Lukas, Noppadol Mekareeya, and Brent Nelson.
- J. Gray, Y. H. He, V. Jejjala, and B. D. Nelson, “Exploring the vacuum geometry of N = 1 gauge theories,” Nuclear Physics B, vol. 750, no. 1-2, pp. 1–27, 2006.
- J. Gray, Y. H. He, V. Jejjala, and B. D. Nelson, “Vacuum geometry and the search for new physics,” Physics Letters. Section B, vol. 638, no. 2-3, pp. 253–257, 2006.
- S. Benvenuti, BO. Feng, A. Hanany, and Y. H. He, “Counting BPS operators in gauge theories: quivers, syzygies and plethystics,” Journal of High Energy Physics, vol. 2007, no. 11, article 050, 2007.
- B. Feng, A. Hanany, and Y. H. He, “Counting gauge invariants: the plethystic program,” Journal of High Energy Physics, vol. 2007, no. 3, article no. 090, 2007.
- D. Forcella, A. Hanany, Y. -H. He, and A. Zaffaroni, “The master space of N = 1 gauge theories,” Journal of High Energy Physics, vol. 2008, no. 8, article 012, 2008.
- J. Gray, Y. -H. He, A. Hanany, N. Mekareeya, and V. Jejjala, “SQCD: A geometric aperçu,” Journal of High Energy Physics, vol. 2008, no. 5, article 099, 2008.
- A. Hanany and N. Mekareeya, “Counting gauge invariant operators in SQCD with classical gauge groups,” Journal of High Energy Physics, vol. 2008, no. 10, article 012, 2008.
- A. Hanany, N. Mekareeya, and G. Torri, “The Hilbert series of adjoint SQCD,” Nuclear Physics B, vol. 825, no. 1-2, pp. 52–97, 2010.
- F. Ferrari, “On the geometry of super Yang-Mills theories: phases and irreducible polynomials,” Journal of High Energy Physics, vol. 2009, no. 1, article no. 026, 2009.
- J. Distler and U. Varadarajan, “Random polynomials and the friendly landscape,” http://arxiv.org/abs/hep-th/0507090.
- J. Gray, Y. H. He, A. Ilderton, and A. Lukas, “STRINGVACUA. A Mathematica package for studying vacuum configurations in string phenomenology,” Computer Physics Communications, vol. 180, no. 1, pp. 107–119, 2009.
- J. Gray, Y. H. He, A. Ilderton, and A. Lukas, “A new method for finding vacua in string phenomenology,” Journal of High Energy Physics, vol. 2007, no. 7, article no. 023, 2007.
- J. Gray, Y. H. He, and A. Lukas, “Algorithmic algebraic geometry and flux vacua,” Journal of High Energy Physics, vol. 2006, no. 9, article no. 031, 2006.
- “The Stringvacua Mathematica package,” http://www-thphys.physics.ox.ac.uk/projects/Stringvacua/.
- A. Font, A. Guarino, and J. M. Moreno, “Algebras and non-geometric flux vacua,” Journal of High Energy Physics, vol. 2008, no. 12, article no. 050, 2008.
- A. Guarino and G. J. Weatherill, “Non-geometric flux vacua, s-duality and algebraic geometry,” Journal of High Energy Physics, vol. 2009, no. 2, article 042, 2009.
- L. Anderson, Y.-H. He, and A. Lukas, “Monad bundles in heterotic string compactifications,” Journal of High Energy Physics, vol. 2008, no. 7, article 104, 2008.
- L. B. Anderson, Y. H. He, and A. Lukas, “Heterotic compactification, an algorithmic approach,” Journal of High Energy Physics, vol. 2007, no. 7, article no. 049, 2007.
- P. Kaura and A. Misra, “On the existence of non-supersymmetric black hole attractors for two-parameter Calabi-Yau's and attractor equations,” Fortschritte der Physik, vol. 54, no. 12, pp. 1109–1141, 2006.
- S. Raby and A. Wingerter, “Can string theory predict the Weinberg angle?” Physical Review D, vol. 76, no. 8, Article ID 086006, 2007.
- V. Braun, T. Brelidze, M. R. Douglas, and B. A. Ovrut, “Calabi-Yau metrics for quotients and complete intersections,” Journal of High Energy Physics, vol. 2008, no. 5, article 080, 2008.
- V. Braun, T. Brelidze, M. R. Douglas, and B. A. Ovrut, “Eigenvalues and eigenfunctions of the scalar Laplace operator on Calabi-Yau manifolds,” Journal of High Energy Physics, vol. 2008, no. 7, article 120, 2008.
- P. Candelas and R. Davies, “New Calabi-Yau manifolds with small Hodge numbers,” Fortschritte der Physik, vol. 58, no. 4-5, pp. 383–466, 2010.
- B. Buchberger, An algorithm for finding the bases elements of the residue class ring modulo a zero dimensional polynomial ideal, Ph.D. thesis, University of Innsbruck, Austria, 1965.
- B. Buchberger, “An algorithmical criterion for the solvability of algebraic systems of equations,” Aequationes Mathematicae, vol. 4, no. 3, pp. 374–383, 1970 (German).
- D. Grayson and M. Stillman, “Macaulay 2, a software system for research in algebraic geometry,” http://www.math.uiuc.edu/Macaulay2/.
- G.-M. Greuel, G. Pfister, and H. Schönemann, “Singular: a computer algebra system for polynomial computations,” Centre for Computer Algebra, University of Kaiserslautern, 2001, http://www.singular.uni-kl.de/.
- M. Kauers and V. Levandovskyy, “Singular.m,” http://www.risc.uni-linz.ac.at/research/combinat/software/Singular/.
- J. Shelton, W. Taylor, and B. Wecht, “Nongeometric flux compactifications,” Journal of High Energy Physics, no. 10, pp. 2057–2080, 2005.
- M. Stillman, “Tools for computing primary decompositions and applications to ideals associated to Bayesian networks,” in Solving Polynomial Equations: Foundations, Algorithms, and Applications, A. Dickenstein and I. Z. Emiris, Eds., Springer, Berlin, Germany, 2005.
- P. Gianni, B. Trager, and G. Zacharias, “Gröbner bases and primary decomposition of polynomial ideals,” Journal of Symbolic Computation, vol. 6, pp. 149–167, 1988.
- D. Eisenbud, C. Huneke, and W. Vasconcelos, “Direct methods for primary decomposition,” Inventiones Mathematicae, vol. 110, no. 1, pp. 207–235, 1992.
- T. Shimoyama and K. Yokoyama, “Localization and primary decomposition of polynomial ideals,” Journal of Symbolic Computation, vol. 22, no. 3, pp. 247–277, 1996.
- A. Micu, E. Palti, and P. M. Saffin, “M-theory on seven-dimensional manifolds with SU(3) structure,” Journal of High Energy Physics, vol. 2006, no. 5, article 048, 2006.
- M. A. Luty and W. Taylor, “Varieties of vacua in classical supersymmetric gauge theories,” Physical Review D, vol. 53, no. 6, pp. 3399–3405, 1996.
- H. M. Möller and F. Mora, “Upper and lower bounds for the degree of Gröbner bases,” in Proceedings of the International Symposium on Symbolic and Algebraic Computation (EUROSAM '84), Lecture Notes in Comput. Sci. 174, pp. 172–183, Cambridge, UK, 1984.