About this Journal Submit a Manuscript Table of Contents
ISRN Algebra
VolumeΒ 2011Β (2011), Article IDΒ 247403, 11 pages
doi:10.5402/2011/247403
Research Article

A New Proof of the Existence of Free Lie Algebras and an Application

Dipartimento di Matematica, Università degli Studi di Bologna, Piazza di Porta San Donato 5, 40126 Bologna, Italy

Received 23 March 2011; Accepted 20 April 2011

Academic Editors: K.Β Dekimpe and J.Β Kakol

Copyright Β© 2011 Andrea Bonfiglioli and Roberta Fulci. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

The existence of free Lie algebras is usually derived as a consequence of the Poincaré-Birkhoff-Witt theorem. Moreover, in order to prove that (given a set 𝑋 and a field 𝕂 of characteristic zero) the Lie algebra β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) of the Lie polynomials in the letters of 𝑋 (over the field 𝕂 ) is a free Lie algebra generated by 𝑋 , all available proofs use the embedding of a Lie algebra 𝔀 into its enveloping algebra 𝒰 ( 𝔀 ) . The aim of this paper is to give a much simpler proof of the latter fact without the aid of the cited embedding nor of the Poincaré-Birkhoff-Witt theorem. As an application of our result and of a theorem due to Cartier (1956), we show the relationships existing between the theorem of Poincaré-Birkhoff-Witt, the theorem of Campbell-Baker-Hausdorff, and the existence of free Lie algebras.

1. Introduction

We begin with the main definition we will be concerned with in this paper. In the sequel, 𝕂 will denote a fixed field, and all linear objects (linear maps, algebras, spans, etc.) will be tacitly meant with respect to 𝕂 .

Definition 1.1. Let 𝑋 be any nonempty set. We say that 𝐿 is a free Lie algebra generated by 𝑋 if 𝐿 is a Lie algebra, and there exists a map πœ™ ∢ 𝑋 β†’ 𝐿 satisfying the following property. For every Lie algebra 𝔀 and for every map 𝑓 ∢ 𝑋 β†’ 𝔀 , there exists exactly one Lie algebra morphism 𝑓 πœ™ such that the diagram below commutes 247403.fig.00a(1.1) When 𝑋 βŠ‚ 𝐿 set-theoretically and the above property is satisfied with the inclusion map πœ„ ∢ 𝑋 β†ͺ 𝐿 replacing πœ™ , 𝐿 will be called a free Lie algebra over 𝑋 .
It is easy to see that a free Lie algebra generated by 𝑋 —if it exists—is unique up to (Lie algebra) isomorphism. The existence of a free Lie algebra generated by a set can be proved in a standard way (see [1, Chapitre II, Section  2, n.1] and [2, Section  0.2]). As for the existence of a free Lie algebra over a set, it is natural to expect that (given a field 𝕂 of characteristic zero) the Lie 𝕂 -algebra of the Lie polynomials in the letters of 𝑋 is a free Lie algebra over 𝑋 . To prove this fact, however, more profound results are required, as we will explain below.
In the sequel, if 𝑉 is a vector space, we denote by 𝒯 ( 𝑉 ) its tensor algebra and by β„’ ( 𝑉 ) the smallest Lie-subalgebra of 𝒯 ( 𝑉 ) containing 𝑉 . The aim of this paper is to prove the following result.

Theorem 1.2. Let 𝕂 be a field of characteristic zero, and let 𝕂 ⟨ 𝑋 ⟩ be the free vector space generated by 𝑋 , then β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) is a free L i e algebra over 𝑋 .

Classically, the above theorem is derived from the theorem of Poincaré, Birkhoff, and Witt (henceforth referred to as PBW), which we here recall for the reading convenience and to fix some notation.

Theorem of Poincaré-Birkhoff-Witt
Let 𝕂 be a field of characteristic zero, and let 𝔀 be a Lie algebra. We denote by 𝒰 ( 𝔀 ) the universal enveloping algebra of 𝔀 and by 1 the unit element of 𝒰 ( 𝔀 ) . Let πœ‹ denote the natural projection πœ‹ ∢ 𝒯 ( 𝔀 ) β†’ 𝒰 ( 𝔀 ) , and let us set πœ‡ ∢ = πœ‹ | 𝔀 . Suppose that 𝔀 is endowed with an indexed (linear) basis { π‘₯ 𝑖 } 𝑖 ∈ 𝐼 , where 𝐼 is totally ordered by the relation β‰Ό , then the following elements form a linear basis for 𝒰 ( 𝔀 ) : ξ€· π‘₯ 1 , πœ‡ 𝑖 1 ξ€Έ ξ€· π‘₯ β‹― πœ‡ 𝑖 𝑛 ξ€Έ , w h e r e 𝑛 ∈ β„• , 𝑖 1 , … , 𝑖 𝑛 ∈ 𝐼 , 𝑖 1 β‰Ό β‹― β‰Ό 𝑖 𝑛 . ( 1 . 2 )
As a by-product of this theorem, we have the following crucial result.

Corollary 1.3. The function πœ‡ ∢ 𝔀 β†’ 𝒰 ( 𝔀 ) is injective.

The above corollary is the essential tool one needs in order to prove Theorem 1.2. To the best of our knowledge, all the books using free Lie algebras prove Theorem 1.2 by making use of Theorem PBW (or, precisely, of Corollary 1.3); see Bourbaki [1, Chapitre II, Section 3, n.1, Théorème 1] (where it employed [3, Chapitre I, Section 2, n.7, Corollaire 3 du Théorème 1] which is the PBW Theorem); see Reutenauer [2, Theorem 0.5] (where the injectivity of 𝔀 β†ͺ 𝒰 ( 𝔀 ) is used); see also Hochschild [4, Chapter X, Section 2], Humphreys [5, Chapter V, Section 17.5], Jacobson [6, Chapter V, Section 4], and Varadarajan [7, Section 3.2].

For the sake of completeness, let us briefly recall the argument which uses PBW.

Lemma 1.4. Let 𝑉 be a vector space, then β„’ ( 𝑉 ) satisfies the following property: for every L i e algebra 𝔀 and for every linear map 𝐹 ∢ 𝑉 β†’ 𝔀 , there exists exactly one L i e algebra morphism 𝐹 ∢ β„’ ( 𝑉 ) β†’ 𝔀 prolonging 𝐹 .

Proof. The universal property of 𝒯 ( 𝑉 ) allows us to prove the existence of exactly one morphism of associative algebras  𝐹 such that the following diagram commutes: 247403.fig.00b(1.3) It is easily seen that  𝐹 ( 𝑑 ) ∈ πœ‡ ( 𝔀 ) for every 𝑑 ∈ β„’ ( 𝑉 ) , so that we can compose the restriction of  𝐹 to β„’ ( 𝑉 ) with πœ‡ βˆ’ 1 ∢ πœ‡ ( 𝔀 ) β†’ 𝔀 (the function πœ‡ βˆ’ 1 is well posed on πœ‡ ( 𝔀 ) thanks to Corollary 1.3). The map 𝐹 ∢ β„’ ( 𝑉 ) ⟢ 𝔀 , 𝐹 = πœ‡ βˆ’ 1 ∘ ξ‚€  𝐹 | β„’ ( 𝑉 )  ( 1 . 4 ) satisfies all the requirements in the statement of the lemma.

Let us now prove Theorem 1.2 with the aid of Lemma 1.4 above. Let 𝑋 be any set and let 𝔀 , 𝑓 be as in Definition 1.1. Let us denote by πœ™ the inclusion map 𝑋 β†ͺ β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) . We apply Lemma 1.4 with 𝑉 = 𝕂 ⟨ 𝑋 ⟩ and 𝐹 obtained by prolonging the function 𝑓 by linearity on 𝑉 , then the Lie algebra morphism 𝐹 ∢ β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) β†’ 𝔀 is easily seen to satisfy the requirements of 𝑓 πœ™ in Definition 1.1, and Theorem 1.2 is proved.

The aim of this paper is to provide a new proof of Theorem 1.2, independently of PBW and of Corollary 1.3. This will be done in Sections 2 and 3.

The remainder of the paper (Section 4) provides a motivation for the first part. The occasion to search for a proof of Theorem 1.2 which is alternative to the usual one arose when—in our studies in monograph [8]—we came across a nonstandard demonstration of Theorem PBW due to Cartier [9]. Cartier’s proof uses the Theorem of Campbell, Baker, and Hausdorff (CBH, shortly) in order to prove PBW. In its turn, CBH is usually proved by means of Theorem PBW (see, e.g., Bourbaki [1, Chapitre II, Section  3, n.1, Corollaire 2], Hochschild [4, Proposition  2.1], Jacobson [6, Theorem  9, Chapter V], and Serre [10, 11]). So Cartier shows that, beside the usual argument P B W ⟹ C B H , ( 1 . 5 ) surprisingly, the reverse path can be followed too.

In order to make this reverse path “ C B H β‡’ P B W ” fully consistent, it is necessary to provide a proof of CBH which is independent of PBW. (Seemingly, apart from Hausdorff's original argument [12], an algebraic proof of CBH independent of PBW was not available at the time of Cartier's paper [9], dated 1956. Twelve years later, Eichler [13] gave another proof, using only free Lie algebras, and then seven years after Eichler, Djoković [14] provided another one, using formal power series computations. More recently, Reutenauer [2, Section  3.4] has proved CBH with a rigorous algebraic formalization of the early ideas by Pascal, Baker, Campbell, and Hausdorff. See [8, Chapters 1, 4] for more details on all these topics.) Cartier provides this proof in [9], making use of free Lie algebras generated by a finite set. As long as the existence of free Lie algebras is again a consequence of PBW, clearly it cannot be exploited to prove Theorem PBW itself. Hence, in [9] a proof of the existence of free Lie algebras generated by finite sets is given, independently of PBW, and relying only on the ideas contained in the classical proof of the theorem of Dynkin, Specht, and Wever.

By making use of these ideas, we here provide a proof, with no prerequisites, of the existence of free Lie algebras over arbitrary sets, thus generalizing the result in [9]. We hope that a new proof, which is alternative to all those presented in books, is welcome, especially since it makes no use of such a deep result as the Theorem of Poincaré, Birkhoff, and Witt. As an application, we are able to highlight the full interdependence of PBW and CBH and the existence of free Lie algebras (see Section 4), which—to the best of our knowledge—has never been pointed out in the specialized literature so far (see Theorem 4.1 and Corollary 4.3).

2. The Free Lie Algebra Generated by a Set

In the present short section, included for the sake of completeness and to fix the notation used throughout, we recall an argument, which dates back to Bourbaki [1, Chapitre II, Section 2, n.1] (see also Reutenauer [2, Section 0.2]) proving directly the existence of free Lie algebras generated by a set.

For 𝑛 ∈ β„• , 𝑛 β‰₯ 1 , consider the set 𝑀 𝑛 ( 𝑋 ) (also denoted by 𝑀 𝑛 , for short) of all (roughly speaking) the noncommutative, nonassociative words of length 𝑛 on the elements of 𝑋 . The 𝑀 𝑛 s can be defined inductively by means of disjoint unions (denoted by ∐ ) of Cartesian products in the following way: 𝑀 1 ( 𝑋 ) ∢ = 𝑋 , 𝑀 2 ( 𝑋 ) ∢ = 𝑋 Γ— 𝑋 , 𝑀 3  𝑀 ( 𝑋 ) ∢ = ( 𝑋 Γ— 𝑋 ) Γ— 𝑋 𝑋 Γ— ( 𝑋 Γ— 𝑋 ) , 𝑛  ( 𝑋 ) ∢ = 𝑖 + 𝑗 = 𝑛 𝑀 𝑖 ( 𝑋 ) Γ— 𝑀 𝑗 ( 𝑋 ) , 𝑛 β‰₯ 2 . ( 2 . 1 ) Let ∐ 𝑀 ( 𝑋 ) ∢ = 𝑛 β‰₯ 1 𝑀 𝑛 ( 𝑋 ) . We can define a (noncommutative, nonassociative) binary operation on 𝑀 ( 𝑋 ) as follows. For any 𝑀 , 𝑀 β€² ∈ 𝑀 ( 𝑋 ) , say, 𝑀 ∈ 𝑀 𝑛 and 𝑀 β€² ∈ 𝑀 𝑛 ξ…ž , we denote by 𝑀 β‹… 𝑀 β€² the unique element of 𝑀 𝑛 + 𝑛 ξ…ž corresponding to ( 𝑀 , 𝑀 β€² ) in the canonical injections 𝑀 𝑛 Γ— 𝑀 𝑛 ξ…ž βŠ‚ 𝑀 𝑛 + 𝑛 ξ…ž βŠ‚ 𝑀 ( 𝑋 ) . This binary operation endows 𝑀 ( 𝑋 ) with the structure of a magma, called the free magma generated by 𝑋 . Let us now set L i b ( 𝑋 ) ∢ = 𝕂 ⟨ 𝑀 ( 𝑋 ) ⟩ , the free vector space generated by the free magma 𝑀 ( 𝑋 ) . (The free vector space generated by a set 𝑀 is here thought of as the 𝕂 -vector space of the functions from 𝑀 to 𝕂 vanishing outside a finite set; equivalently, it will be treated as the set of formal (finite) linear combinations of elements of 𝑀 , where 𝑀 is a basis for 𝕂 ⟨ 𝑀 ⟩ .) The canonical map 𝑀 ( 𝑋 ) β†ͺ L i b ( 𝑋 ) (sending 𝑀 ∈ 𝑀 ( 𝑋 ) to the characteristic function of { 𝑀 } in 𝑀 ( 𝑋 ) ) will be simply denoted by set inclusion. The operation on 𝑀 ( 𝑋 ) extends by linearity to an operation on L i b ( 𝑋 ) turning it into a nonassociative algebra, called the free (nonassociative) algebra generated by 𝑋 . We have the following result, whose proof is standard and hence omitted.

Lemma 2.1. Let 𝑋 be a set, then, for every 𝕂 -algebra 𝐴 and every function 𝑓 ∢ 𝑋 β†’ 𝐴 , there exists a unique algebra morphism 𝑓 ∢ L i b ( 𝑋 ) β†’ 𝐴 prolonging 𝑓 .

Given an (not necessarily associative) algebra ( 𝑀 , βˆ— ) , we say that 𝑆 βŠ† 𝑀 is a (two-sided) magma ideal in 𝑀 if 𝑆 is a vector subspace of 𝑀 such that 𝑠 βˆ— π‘š and π‘š βˆ— 𝑠 belong to 𝑆 , for every 𝑠 ∈ 𝑆 and every π‘š ∈ 𝑀 . Moreover, if 𝐻 βŠ† 𝑀 is any set, the smallest (two-sided) magma ideal in 𝑀 containing 𝐻 will be said to be the magma ideal generated by 𝐻 in 𝑀 .

Let us now consider the algebra L i b ( 𝑋 ) and 𝐴 ∢ = { 𝑄 ( π‘Ž ) = π‘Ž β‹… π‘Ž , 𝐽 ( π‘Ž , 𝑏 , 𝑐 ) = π‘Ž β‹… ( 𝑏 β‹… 𝑐 ) + 𝑏 β‹… ( 𝑐 β‹… π‘Ž ) + 𝑐 β‹… ( π‘Ž β‹… 𝑏 ) ∣ π‘Ž , 𝑏 , 𝑐 ∈ L i b ( 𝑋 ) } . ( 2 . 2 ) We henceforth denote by π”ž the magma ideal generated by 𝐴 in L i b ( 𝑋 ) . We next consider the quotient vector space L i e ( 𝑋 ) ∢ = L i b ( 𝑋 ) / π”ž and the natural projection πœ‹ ∢ L i b ( 𝑋 ) β†’ L i e ( 𝑋 ) . Then the map [ πœ‹ ] L i e ( 𝑋 ) Γ— L i e ( 𝑋 ) ⟢ L i e ( 𝑋 ) , ( πœ‹ ( π‘Ž ) , πœ‹ ( 𝑏 ) ) ⟼ ( π‘Ž ) , πœ‹ ( 𝑏 ) ∢ = πœ‹ ( π‘Ž β‹… 𝑏 ) , π‘Ž , 𝑏 ∈ L i b ( 𝑋 ) ( 2 . 3 ) is well posed, and it endows L i e ( 𝑋 ) with a 𝕂 -algebra structure, which turns out to be a Lie algebra over 𝕂 (see the very definition of 𝐴 ).

Proposition 2.2. Let 𝑋 be any set and, with the above notation, let us set πœ‘ ∢ = πœ‹ | 𝑋 ∢ 𝑋 β†’ L i e ( 𝑋 ) then (1) L i e ( 𝑋 ) , together with the map πœ‘ , is a free L i e algebra generated by 𝑋 (according to Definition 1.1);(2)the set { πœ‘ ( π‘₯ ) } π‘₯ ∈ 𝑋 is independent in L i e ( 𝑋 ) , whence πœ‘ is injective; (3)the set πœ‘ ( 𝑋 )    L i e -generates L i e ( 𝑋 ) , that is, the smallest L i e subalgebra of L i e ( 𝑋 ) containing πœ‘ ( 𝑋 ) , coincides with L i e ( 𝑋 ) .

The proof of this proposition is derived by collecting together various results appearing in [1, Chapitre II, Section 2], with the additional care of transposing them to a nonassociative setting (see, e.g., [8, Theorem 2.54] for all the details).

3. The Isomorphism L i e ( 𝑋 ) ≃ β„’ ( 𝕂 ⟨ 𝑋 ⟩ )

From now on, we turn to prove Theorem 1.2 without using neither Theorem PBW nor Corollary 1.3. The arguments are more delicate than those in the preceding section.

We fix throughout a field 𝕂 of characteristic zero. We denote by β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) the smallest Lie subalgebra of 𝒯 ( 𝕂 ⟨ 𝑋 ⟩ ) (the tensor algebra of the vector space 𝕂 ⟨ 𝑋 ⟩ ) containing 𝑋 . Let πœ‘ be as in Proposition 2.2. Being L i e ( 𝑋 ) a free Lie algebra generated by 𝑋 , there exists a unique Lie algebra morphism Ξ¦ ∢ L i e ( 𝑋 ) ⟢ β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) s u c h t h a t Ξ¦ ( πœ‘ ( π‘₯ ) ) = π‘₯ , f o r e v e r y π‘₯ ∈ X . ( 3 . 1 ) Our main task is to show that Ξ¦ is an isomorphism, without using PBW theorem. This will immediately prove that β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) is a free Lie algebra over 𝑋 , according to Definition 1.1. We will do this by means of some auxiliary functions.

Lemma 3.1. With the above notation, one has the grading ⨁ L i e ( 𝑋 ) = 𝑛 β‰₯ 1 𝐡 𝑛 , where 𝐡 1 = s p a n ( πœ‘ ( 𝑋 ) ) , and, for any 𝑛 β‰₯ 2 , 𝐡 𝑛 = ξ€Ί 𝐡 1 , 𝐡 𝑛 βˆ’ 1 ξ€» ξ€½ [ πœ‘ ] = s p a n ( π‘₯ ) , 𝑦 ∢ π‘₯ ∈ 𝑋 , 𝑦 ∈ 𝐡 𝑛 βˆ’ 1 ξ€Ύ . ( 3 . 2 )

Proof. It is easy to prove the grading ⨁ L i b ( 𝑋 ) = 𝑛 L i b 𝑛 ( 𝑋 ) , where L i b 𝑛 ( 𝑋 ) is the span of 𝑀 𝑛 ( 𝑋 ) . On the other hand, a simple argument shows that π”ž is also the magma ideal generated by the elements ξ€· 𝑀 β‹… 𝑀 , 𝑀 + 𝑀 ξ…ž ξ€Έ β‹… ξ€· 𝑀 + 𝑀 ξ…ž ξ€Έ βˆ’ 𝑀 β‹… 𝑀 βˆ’ 𝑀 ξ…ž β‹… 𝑀 ξ…ž , ξ€· 𝑀 𝑀 β‹… ξ…ž β‹… 𝑀 ξ…ž ξ…ž ξ€Έ + 𝑀 ξ…ž β‹… ξ€· 𝑀 ξ…ž ξ…ž ξ€Έ β‹… 𝑀 + 𝑀 ξ…ž ξ…ž β‹… ξ€· 𝑀 β‹… 𝑀 ξ…ž ξ€Έ , ( 3 . 3 ) with 𝑀 , 𝑀 ξ…ž , 𝑀 ξ…ž ξ…ž ∈ 𝑀 ( 𝑋 ) . As a consequence, we have an analogous grading ⨁ π”ž = 𝑛 π”ž 𝑛 , with π”ž 𝑛 βŠ† L i b 𝑛 ( 𝑋 ) , for every 𝑛 ∈ β„• . This gives ⨁ L i e ( 𝑋 ) = 𝑛 𝐢 𝑛 , where 𝐢 𝑛 = L i b 𝑛 ( 𝑋 ) / π”ž is spanned by the 𝑛 -degree commutators of the elements of πœ‘ ( 𝑋 ) (bracketing is taken in arbitrary order). In its turn, we obviously have 𝐢 𝑛 = 𝐡 𝑛 , where 𝐡 𝑛 is the span of the “nested” brackets ξ€Ί πœ‘ ξ€· π‘₯ 1 ξ€Έ β‹― ξ€Ί πœ‘ ξ€· π‘₯ 𝑛 βˆ’ 1 ξ€Έ ξ€· π‘₯ , πœ‘ 𝑛 β‹― ξ€» ξ€Έ ξ€» , f o r π‘₯ 1 , … , π‘₯ 𝑛 ∈ X . ( 3 . 4 ) Finally, it is a simple proof to check that [ 𝐡 𝑛 , 𝐡 π‘š ] βŠ† 𝐡 𝑛 + π‘š for every 𝑛 , π‘š .

Thanks to Lemma 3.1, the following auxiliary map is well posed:   𝛿 ∢ L i e ( 𝑋 ) ⟢ L i e ( 𝑋 ) , 𝛿 𝑛 𝑏 𝑛 ξƒͺ  ∢ = 𝑛 𝑛 𝑏 𝑛 ξ€· 𝑏 𝑛 ∈ 𝐡 𝑛 ξ€Έ , βˆ€ 𝑛 ∈ β„• . ( 3 . 5 ) In the remainder of the section, for any vector space 𝑉 , we denote by E n d ( 𝑉 ) the set of the endomorphisms of 𝑉 . By the universal properties of the free vector space 𝕂 ⟨ 𝑋 ⟩ and of tensor algebras, there exists a unique morphism of unital associative algebras πœƒ ∢ 𝒯 ( 𝕂 ⟨ 𝑋 ⟩ ) β†’ E n d ( L i e ( 𝑋 ) ) , such that πœƒ ( π‘₯ ) = a d πœ‘ ( π‘₯ ) , for every π‘₯ ∈ 𝑋 .

Finally, there exists a unique linear map 𝑔 ∢ 𝒯 ( 𝕂 ⟨ 𝑋 ⟩ ) β†’ L i e ( 𝑋 ) such that, for every π‘₯ 1 , … , π‘₯ π‘˜ ∈ 𝑋 and every π‘˜ β‰₯ 2 , it holds 𝑔 ( 1 𝕂 ) = 0 , 𝑔 ( π‘₯ 1 ) = πœ‘ ( π‘₯ 1 ) , and 𝑔 ( π‘₯ 1 βŠ— β‹― βŠ— π‘₯ π‘˜ ) = [ πœ‘ ( π‘₯ 1 ) β‹― [ πœ‘ ( π‘₯ π‘˜ βˆ’ 1 ) , πœ‘ ( π‘₯ π‘˜ ) ] ] .

The diagram below gives an idea of the setting we are working in 247403.fig.00c(3.6)

We are now ready for the following result (see also Cartier [9, Lemma 1, page 242], where finite 𝑋 s are considered); part (c) of the lemma is the core of a result due to Dynkin [15].

In the following statement, we denote by 𝒯 + ( 𝕂 ⟨ 𝑋 ⟩ ) the ideal of 𝒯 ( 𝕂 ⟨ 𝑋 ⟩ ) whose elements have vanishing zero-degree component.

Lemma 3.2. With all the above notation, one has the following: (a) 𝛿 is a derivation of the L i e algebra L i e ( 𝑋 ) and 𝛿 ∘ πœ‘ ≑ πœ‘ on 𝑋 , (b) 𝑔 ( π‘₯ 𝑦 ) = πœƒ ( π‘₯ ) ( 𝑔 ( 𝑦 ) ) , for every π‘₯ ∈ 𝒯 ( 𝕂 ⟨ 𝑋 ⟩ ) , 𝑦 ∈ 𝒯 + ( 𝕂 ⟨ 𝑋 ⟩ ) , (c) πœƒ ∘ Ξ¦ ≑ a d on L i e ( 𝑋 ) , that is, πœƒ ( Ξ¦ ( πœ‰ ) ) ( πœ‚ ) = [ πœ‰ , πœ‚ ] , for every πœ‰ , πœ‚ ∈ L i e ( 𝑋 ) ,(d) 𝑔 ∘ Ξ¦ ≑ 𝛿 on L i e ( 𝑋 ) .

Proof. (a) Take elements ⨁ 𝑑 , 𝑑 β€² ∈ L i e ( 𝑋 ) = 𝑛 β‰₯ 1 𝐡 𝑛 (see Lemma 3.1), βˆ‘ 𝑑 = 𝑛 𝑑 𝑛 , 𝑑 ξ…ž = βˆ‘ 𝑛 𝑑 ξ…ž 𝑛 , with 𝑑 𝑛 , 𝑑 ξ…ž 𝑛 ∈ 𝐡 𝑛 for every 𝑛 β‰₯ 1 , and 𝑑 𝑛 , 𝑑 ξ…ž 𝑛 are definitively equal to 0, then we have 𝛿 ξ€· ξ€Ί 𝑑 , 𝑑 ξ…ž  ξ€» ξ€Έ = 𝛿 𝑛 β‰₯ 1  𝑖 + 𝑗 = 𝑛 ξ€Ί 𝑑 𝑖 , 𝑑 ξ…ž 𝑗 ξ€» =  𝑛 β‰₯ 1 𝑛  𝑖 + 𝑗 = 𝑛 ξ€Ί 𝑑 𝑖 , 𝑑 ξ…ž 𝑗 ξ€» =  𝑛 β‰₯ 1  𝑖 + 𝑗 = 𝑛 ( ξ€Ί 𝑑 𝑖 + 𝑗 ) 𝑖 , 𝑑 ξ…ž 𝑗 ξ€» =  𝑛 β‰₯ 1  𝑖 + 𝑗 = 𝑛 ξ€Ί 𝑖 𝑑 𝑖 , 𝑑 ξ…ž 𝑗 ξ€» +  𝑛 β‰₯ 1  𝑖 + 𝑗 = 𝑛 ξ€Ί 𝑑 𝑖 , 𝑗 𝑑 ξ…ž 𝑗 ξ€» =   𝑖 𝑖 𝑑 𝑖 ,  𝑗 𝑑 ξ…ž 𝑗 ξƒ­ +   𝑖 𝑑 𝑖 ,  𝑗 𝑗 𝑑 ξ…ž 𝑗 ξƒ­ = ξ€Ί 𝛿 ( 𝑑 ) , 𝑑 ξ…ž ξ€» + ξ€Ί ξ€· 𝑑 𝑑 , 𝛿 ξ…ž . ξ€Έ ξ€» ( 3 . 7 ) Moreover, from the definition of the 𝐡 𝑛 s in Lemma 3.1, we have πœ‘ ( 𝑋 ) βŠ‚ 𝐡 1 , so 𝛿 ( πœ‘ ( π‘₯ ) ) = πœ‘ ( π‘₯ ) , f o r e v e r y π‘₯ ∈ X . ( 3 . 8 )
 (b) An inductive argument: if π‘₯ = π‘˜ ∈ 𝕂 = 𝒯 0 ( 𝑉 ) , (b) is trivially true, 𝑔 ( π‘˜ 𝑦 ) = π‘˜ 𝑔 ( 𝑦 ) (since 𝑔 is linear) and πœƒ ( π‘˜ ) ( 𝑔 ( 𝑦 ) ) = π‘˜ I d L i e ( 𝑋 ) ( 𝑔 ( 𝑦 ) ) = π‘˜ 𝑔 ( 𝑦 ) . ( 3 . 9 ) Thus, we are left to prove (b) when both π‘₯ , 𝑦 belong to 𝒯 + ( 𝕂 ⟨ 𝑋 ⟩ ) ; moreover, by linearity, we can assume without loss of generality that π‘₯ = 𝑣 1 βŠ— β‹― βŠ— 𝑣 π‘˜ and 𝑦 = 𝑀 1 βŠ— β‹― βŠ— 𝑀 β„Ž with β„Ž , π‘˜ β‰₯ 1 , and the 𝑣 s and 𝑀 s are elements of 𝑋 , 𝑔 ξ€· 𝑣 ( π‘₯ 𝑦 ) = 𝑔 1 βŠ— β‹― βŠ— 𝑣 π‘˜ βŠ— 𝑀 1 βŠ— β‹― βŠ— 𝑀 β„Ž ξ€Έ ξ€· πœ‘ ξ€· 𝑣 = a d 1 ξ€· πœ‘ ξ€· 𝑣 ξ€Έ ξ€Έ ∘ β‹― ∘ a d π‘˜ πœ‘ ξ€· 𝑀 ξ€Έ ξ€Έ ξ€· ξ€Ί 1 ξ€Έ ξ€Ί πœ‘ ξ€· 𝑀 , … β„Ž βˆ’ 1 ξ€Έ ξ€· 𝑀 , πœ‘ β„Ž … ξ€· 𝑣 ξ€Έ ξ€» ξ€» ξ€Έ = πœƒ 1 βŠ— β‹― βŠ— 𝑣 π‘˜ 𝑔 ξ€· 𝑀 ξ€Έ ξ€· 1 βŠ— β‹― βŠ— 𝑀 β„Ž ξ€Έ ξ€Έ = πœƒ ( π‘₯ ) ( 𝑔 ( 𝑦 ) ) . ( 3 . 1 0 )
 (c) Let πœ‰ ∈ L i e ( 𝑋 , then (c) follows if we show that πœƒ ( Ξ¦ ( πœ‰ ) ) and a d ( πœ‰ ) coincide (note that they are both endomorphisms of L i e ( 𝑋 ) ). In its turn, this is equivalent to the identity of πœƒ ∘ Ξ¦ with the map a d ∢ L i e ( 𝑋 ) ⟢ E n d ( L i e ( 𝑋 ) ) , πœ‰ ⟼ a d ( πœ‰ ) . ( 3 . 1 1 ) Now observe that both a d and πœƒ ∘ Ξ¦ are Lie algebra morphisms (indeed, Ξ¦ is a Lie algebra morphism by construction, and πœƒ is a Lie algebra morphism since it is a unital associative algebra morphism). Hence, the equality of a d and πœƒ ∘ Ξ¦ follows if we prove that they are equal on a system of Lie generators for L i e ( 𝑋 ) , namely, on πœ‘ ( 𝑋 ) (recall Proposition 2.2-(3.8)), for every π‘₯ ∈ 𝑋 , we indeed have ( πœƒ ∘ Ξ¦ ) ( πœ‘ ( π‘₯ ) ) = πœƒ ( π‘₯ ) = a d ( πœ‘ ( π‘₯ ) ) . ( 3 . 1 2 )
 (d) It suffices to show that 𝑔 ∘ Ξ¦ is a derivation of L i e ( 𝑋 ) ; we have [ ] [ ] = [ ] + [ ] . ( 𝑔 ∘ Ξ¦ ) πœ‰ , πœ‚ = 𝑔 ( Ξ¦ ( πœ‰ ) , Ξ¦ ( πœ‚ ) ) = 𝑔 ( Ξ¦ ( πœ‰ ) Ξ¦ ( πœ‚ ) βˆ’ Ξ¦ ( πœ‚ ) Ξ¦ ( πœ‰ ) ) = πœƒ ( Ξ¦ ( πœ‰ ) ) 𝑔 ( Ξ¦ ( πœ‚ ) ) βˆ’ πœƒ ( Ξ¦ ( πœ‚ ) ) 𝑔 ( Ξ¦ ( πœ‰ ) ) πœ‰ , 𝑔 ( Ξ¦ ( πœ‚ ) ) 𝑔 ( Ξ¦ ( πœ‰ ) ) , πœ‚ ( 3 . 1 3 )

In the third and fourth equalities, we used (b) and (c), respectively. Since 𝑔 ∘ Ξ¦ and 𝛿 are derivations of L i e ( 𝑋 ) coinciding on πœ‘ ( 𝑋 ) , (d) follows.

Theorem 3.3. Let 𝕂 be field of characteristic zero.
If Ξ¦ is as in (3.1), Ξ¦ is an isomorphism of L i e algebras.

Proof. From the very definition of 𝛿 and the fact that 𝕂 has characteristic zero, it follows that 𝛿 is injective.
From the identity 𝑔 ∘ Ξ¦ ≑ 𝛿 (see (d) in Lemma 3.2) and the injectivity of 𝛿 , we immediately infer the injectivity of Ξ¦ . Since Ξ¦ is clearly surjective (indeed, L i e ( 𝑋 ) and β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) are Lie generated by πœ‘ ( 𝑋 ) and 𝑋 , respectively), the theorem is proved.

In order to explicitly show the relationship of the arguments we employed to prove Theorem 1.2 with the so-called Theorem of Dynkin, Specht, and Wever, we seize the opportunity to show that the latter is implicitly contained in Theorem 3.3 and Lemma 3.2. (See e.g., ([4], [18], Proposition  2.2), ([6], [20], Chapter  V, Section  4, Theorem  8), and ([2], [26], Theorem  1.4), ([10, 11], [29], Chapter  IV, Section  8, LA  4.15). For the original proofs, see [1517].) Despite its well-known importance, we state it as a corollary of the former results.

Corollary 3.4 (Dynkin, Specht, and Wever). Let 𝕂 be a field of characteristic zero. Consider the linear map 𝑃 ∢ 𝒯 ( 𝕂 ⟨ 𝑋 ⟩ ) β†’ β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) such that 𝑃 ξ€· 1 𝕂 ξ€Έ ξ€· π‘₯ = 0 , 𝑃 1 ξ€Έ = π‘₯ 1 , 𝑃 ξ€· π‘₯ 1 βŠ— β‹― βŠ— π‘₯ π‘˜ ξ€Έ = π‘˜ βˆ’ 1 ξ€Ί π‘₯ 1 ξ€Ί π‘₯ , … π‘˜ βˆ’ 1 , π‘₯ π‘˜ ξ€» … ξ€» , βˆ€ π‘˜ β‰₯ 2 , ( 3 . 1 4 ) for any π‘₯ 1 , … , π‘₯ π‘˜ ∈ 𝑋 . Then 𝑃 is a projection, that is, 𝑃 is surjective and it is the identity on β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) .
Consequently, one has the following characterization of L i e elements: β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) = { 𝑑 ∈ 𝒯 ( 𝕂 ⟨ 𝑋 ⟩ ) ∣ 𝑃 ( 𝑑 ) = 𝑑 } . ( 3 . 1 5 )

Proof. The well-posedness and surjectivity of 𝑃 are obvious. To prove that 𝑃 is the identity on β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) , it suffices to test it on a homogeneous bracket, say 𝑑 = [ π‘₯ 1 , … , [ π‘₯ π‘˜ βˆ’ 1 , π‘₯ π‘˜ ] … ] , with π‘˜ ∈ β„• and π‘₯ 1 , … , π‘₯ π‘˜ ∈ 𝑋 . It is clear that it holds 𝑃 ( 𝑑 ) = π‘˜ βˆ’ 1 ( Ξ¦ ∘ 𝑔 ) ( 𝑑 ) . As a consequence, being Ξ¦ invertible (by Theorem 3.3), 𝑃 ( 𝑑 ) = π‘˜ βˆ’ 1 ξ€· Ξ¦ ∘ 𝑔 ∘ Ξ¦ ∘ Ξ¦ βˆ’ 1 ξ€Έ ( 𝑑 ) = π‘˜ βˆ’ 1 ξ€· Ξ¦ ∘ 𝛿 ∘ Ξ¦ βˆ’ 1 ξ€Έ ( 𝑑 ) = π‘˜ βˆ’ 1 Ξ¦ ξ€· π‘˜ Ξ¦ βˆ’ 1 ξ€Έ ( 𝑑 ) = 𝑑 . ( 3 . 1 6 ) In the second equality, we used (d) of Lemma 3.2; in the third, we used the definition of 𝛿 and the fact that Ξ¦ βˆ’ 1 ( 𝑑 ) = [ πœ‘ ( π‘₯ 1 ) , … , [ πœ‘ ( π‘₯ π‘˜ βˆ’ 1 ) , πœ‘ ( π‘₯ π‘˜ ) ] … ] ∈ 𝐡 π‘˜ .

4. The Relationship between the Theorems of PBW and of CBH

The deep intertwinement between PBW and CBH and the existence of free Lie algebras can be visualized in the following chain of equivalent statements.

Theorem 4.1. Consider the following six statements (all linear structures are understood to be over a field of characteristic zero): (a)the set { πœ‡ ( π‘₯ 𝑖 ) } 𝑖 ∈ 𝐼 is independent in 𝒰 ( 𝔀 ) ;(b)any L i e algebra 𝔀 can be embedded in its enveloping algebra 𝒰 ( 𝔀 ) ; (c)for every set 𝑋 , there exists a free L i e algebra over 𝑋 ;(d)for every finite set 𝑋 , there exists a free L i e algebra over 𝑋 ;(e)free L i e algebras over finite sets do exist, and the Campbell-Baker-Hausdorff theorem holds;(f)the Poincaré-Birkhoff-Witt theorem holds. Then these results can be proved each by the aid of the other in the following circular sequence: ( a ) ⟹ ( b ) ⟹ ( c ) ⟹ ( d ) ⟹ ( e ) ⟹ ( f ) ⟹ ( a ) . ( 4 . 1 )

Remark 4.2. Notice that statement (c) was proved in Section 3 above, without any prerequisite nor the aid of any of the other statements in Theorem 4.1. In particular, the isomorphism of L i e ( 𝑋 ) (the free Lie algebra generated by 𝑋 ) with β„’ ( 𝕂 ⟨ 𝑋 ⟩ ) (the Lie algebra of the Lie polynomials in the letters of 𝑋 ) can be proved independently of (b) and (f), as we announced.

Proof. ( a ) β‡’ ( b ) : this is obvious by the definition πœ‡ ( π‘₯ 𝑖 ) = πœ‹ ( π‘₯ 𝑖 ) ,
( b ) β‡’ ( c ) : this is the usual approach to the existence of free Lie algebras over a set recalled in the Introduction,
( c ) β‡’ ( d ) : this is obvious,
( d ) β‡’ ( e ) : the derivation of the Campbell-Baker-Hausdorff theorem from the existence of free Lie algebras over finite sets appears in Eichler's proof [13],
( e ) β‡’ ( f ) : the proof of PBW from CBH (with the joint use of free Lie algebras over finite sets) is contained in Cartier's paper [9],
( f ) β‡’ ( a ) : this is obvious.

As by-products of the previous theorem, we highlight the following corollaries, containing some probably unexpected facts.

Corollary 4.3. The following are consequences of Theorem 4.1. Again, all linear structures are on a field of characteristic zero. (a)The linear independence of the πœ‡ ( π‘₯ 𝑖 ) s is sufficient (besides being necessary) for the system in (1.2) to form a basis of 𝒰 ( 𝔀 ) . (b)The sole embedding 𝔀 β†ͺ 𝒰 ( 𝔀 ) proves the Poincaré-Birkhoff-Witt theorem. (c)The existence of free L i e algebras over finite sets is sufficient for the existence of all free L i e algebras over arbitrary sets. (d)The existence of free L i e algebras proves the Poincaré-Birkhoff-Witt theorem (not only the reverse fact is true).

Acknowledgment

The authors would like to thank F. Caselli for useful remarks on a former version of the paper.

References

  1. N. Bourbaki, β€œÉléments de Mathématique, Fasc. XXXVII: Groupes et Algèbres de Lie, Chap. II: Algèbres de Lie, Chap. III: Groupes de Lie,” in Actualités Scientifiques et Industrielles, vol. 1349, Hermann, Paris, France, 1972.
  2. C. Reutenauer, Free Lie Algebras, vol. 7 of London Mathematical Society Monographs, Clarendon Press, Oxford, UK, 1993.
  3. N. Bourbaki, β€œÉléments de Mathématique, Fasc. XXVI: Groupes et Algèbres de Lie, Chap. I: Algèbres de Lie,” in Actualités Scientifiques et Industrielles, vol. 1285, Hermann, Paris, France, 1960.
  4. G. P. Hochschild, The Structure of Lie Groups, Holden-Day, San Francisco, Calif, USA, 1965.
  5. J. E. Humphreys, Introduction to Lie Algebras and Representation Theory, vol. 9 of Graduate Texts in Mathematics, Springer, New York, NY, USA, 1980.
  6. N. Jacobson, Lie Algebras, vol. 10 of Interscience Tracts in Pure and Applied Mathematics, Interscience Publishers, John Wiley and Sons, New York, NY, USA, 1962.
  7. V. S. Varadarajan, Lie groups, Lie Algebras, and Their Representations, vol. 102 of Graduate Texts in Mathematics, Springer, New York, NY, USA, 1984.
  8. A. Bonfiglioli and R. Fulci, Topics in Noncommutative Algebra. The Theorem of Campbell, Baker, Hausdorff and Dynkin, Lecture Notes in Mathematics, Springer, Berlin, Germany, 2011.
  9. P. Cartier, β€œDémonstration algébrique de la formule de Hausdorff,” Bulletin de la Société Mathématique de France, vol. 84, pp. 241–249, 1956. View at Zentralblatt MATH
  10. J. P. Serre, Lie Algebras and Lie Groups, Lectures Given at Harvard University, W. A. Benjamin, New York, NY, USA, 1st edition, 1964.
  11. J. P. Serre, Lie Algebras and Lie Groups, vol. 1500 of Lecture Notes in Mathematics, Springer-Verlag, Berlin, Germany, 2nd edition, 1965.
  12. F. Hausdorff, β€œDie symbolische exponentialformel in der gruppentheorie,” Leipzig Berichte, vol. 58, pp. 19–48, 1906.
  13. M. Eichler, β€œA new proof of the Baker-Campbell-Hausdorff formula,” Journal of the Mathematical Society of Japan, vol. 20, pp. 23–25, 1968. View at Publisher Β· View at Google Scholar Β· View at Zentralblatt MATH
  14. D. Ž. Djoković, β€œAn elementary proof of the Baker-Campbell-Hausdorff-Dynkin formula,” Mathematische Zeitschrift, vol. 143, no. 3, pp. 209–211, 1975. View at Publisher Β· View at Google Scholar Β· View at Zentralblatt MATH
  15. E. B. Dynkin, β€œCalculation of the coefficients in the Campbell-Hausdorff formula,” Doklady Akademii Nauk SSSR, vol. 57, pp. 323–326, 1947.
  16. W. Specht, β€œDie linearen Beziehungen zwischen höheren Kommutatoren,” Mathematische Zeitschrift, vol. 51, pp. 367–376, 1948. View at Publisher Β· View at Google Scholar Β· View at Zentralblatt MATH
  17. F. Wever, β€œOperatoren in lieschen ringen,” Journal für die Reine und Angewandte Mathematik, vol. 187, pp. 44–55, 1949. View at Zentralblatt MATH