Misplaced Pages

Noether's second theorem

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Physics theorem for symmetries of action

In mathematics and theoretical physics, Noether's second theorem relates symmetries of an action functional with a system of differential equations. The theorem is named after its discoverer, Emmy Noether.

The action S of a physical system is an integral of a so-called Lagrangian function L, from which the system's behavior can be determined by the principle of least action. Specifically, the theorem says that if the action has an infinite-dimensional Lie algebra of infinitesimal symmetries parameterized linearly by k arbitrary functions and their derivatives up to order m, then the functional derivatives of L satisfy a system of k differential equations.

Noether's second theorem is sometimes used in gauge theory. Gauge theories are the basic elements of all modern field theories of physics, such as the prevailing Standard Model.

Mathematical formulation

First variation formula

Suppose that we have a dynamical system specified in terms of m {\textstyle m} independent variables x = ( x 1 , , x m ) {\textstyle x=(x^{1},\dots ,x^{m})} , n {\textstyle n} dependent variables u = ( u 1 , , u n ) {\textstyle u=(u^{1},\dots ,u^{n})} , and a Lagrangian function L ( x , u , u ( 1 ) , u ( r ) ) {\textstyle L(x,u,u_{(1)}\dots ,u_{(r)})} of some finite order r {\textstyle r} . Here u ( k ) = ( u i 1 . . . i k σ ) = ( d i 1 d i k u σ ) {\textstyle u_{(k)}=(u_{i_{1}...i_{k}}^{\sigma })=(d_{i_{1}}\dots d_{i_{k}}u^{\sigma })} is the collection of all k {\textstyle k} th order partial derivatives of the dependent variables. As a general rule, latin indices i , j , k , {\textstyle i,j,k,\dots } from the middle of the alphabet take the values 1 , , m {\textstyle 1,\dots ,m} , greek indices take the values 1 , , n {\textstyle 1,\dots ,n} , and the summation convention apply to them. Multiindex notation for the latin indices is also introduced as follows. A multiindex I {\textstyle I} of length k {\textstyle k} is an ordered list I = ( i 1 , , i k ) {\displaystyle I=(i_{1},\dots ,i_{k})} of k {\textstyle k} ordinary indices. The length is denoted as | I | = k {\textstyle \left|I\right|=k} . The summation convention does not directly apply to multiindices since the summation over lengths needs to be displayed explicitly, e.g. | I | = 0 r f I g I = f g + f i g i + f i j g i j + + f i 1 . . . i r g i 1 . . . i r . {\displaystyle \sum _{|I|=0}^{r}f_{I}g^{I}=fg+f_{i}g^{i}+f_{ij}g^{ij}+\dots +f_{i_{1}...i_{r}}g^{i_{1}...i_{r}}.} The variation of the Lagrangian with respect to an arbitrary variation δ u σ {\textstyle \delta u^{\sigma }} of the dependent variables is δ L = L u σ δ u σ + L u i σ δ u i σ + + L u i 1 . . . i r σ δ u i 1 . . . i r σ = | I | = 0 r L u I σ δ u I σ , {\displaystyle \delta L={\frac {\partial L}{\partial u^{\sigma }}}\delta u^{\sigma }+{\frac {\partial L}{\partial u_{i}^{\sigma }}}\delta u_{i}^{\sigma }+\dots +{\frac {\partial L}{\partial u_{i_{1}...i_{r}}^{\sigma }}}\delta u_{i_{1}...i_{r}}^{\sigma }=\sum _{|I|=0}^{r}{\frac {\partial L}{\partial u_{I}^{\sigma }}}\delta u_{I}^{\sigma },} and applying the inverse product rule of differentiation we get δ L = E σ δ u σ + d i ( | I | = 0 r 1 P σ i I δ u I σ ) {\displaystyle \delta L=E_{\sigma }\delta u^{\sigma }+d_{i}\left(\sum _{|I|=0}^{r-1}P_{\sigma }^{iI}\delta u_{I}^{\sigma }\right)} where E σ = L u σ d i L u i σ + + ( 1 ) r d i 1 d i r L u i 1 . . . i r σ = | I | = 0 r ( 1 ) | I | d I L u I σ {\displaystyle E_{\sigma }={\frac {\partial L}{\partial u^{\sigma }}}-d_{i}{\frac {\partial L}{\partial u_{i}^{\sigma }}}+\dots +(-1)^{r}d_{i_{1}}\dots d_{i_{r}}{\frac {\partial L}{\partial u_{i_{1}...i_{r}}^{\sigma }}}=\sum _{|I|=0}^{r}(-1)^{|I|}d_{I}{\frac {\partial L}{\partial u_{I}^{\sigma }}}} are the Euler-Lagrange expressions of the Lagrangian, and the coefficients P σ I {\textstyle P_{\sigma }^{I}} (Lagrangian momenta) are given by P σ I = | J | = 0 r | I | ( 1 ) | J | d J L u I J σ {\displaystyle P_{\sigma }^{I}=\sum _{|J|=0}^{r-|I|}(-1)^{|J|}d_{J}{\frac {\partial L}{\partial u_{IJ}^{\sigma }}}}

Variational symmetries

A variation δ u σ = X σ ( x , u , u ( 1 ) , ) {\textstyle \delta u^{\sigma }=X^{\sigma }(x,u,u_{(1)},\dots )} is an infinitesimal symmetry of the Lagrangian L {\textstyle L} if δ L = 0 {\textstyle \delta L=0} under this variation. It is an infinitesimal quasi-symmetry if there is a current K i = K i ( x , u , ) {\textstyle K^{i}=K^{i}(x,u,\dots )} such that δ L = d i K i {\textstyle \delta L=d_{i}K^{i}} .

It should be remarked that it is possible to extend infinitesimal (quasi-)symmetries by including variations with δ x i 0 {\displaystyle \delta x^{i}\neq 0} as well, i.e. the independent variables are also varied. However such symmetries can always be rewritten so that they act only on the dependent variables. Therefore, in the sequel we restrict to so-called vertical variations where δ x i = 0 {\displaystyle \delta x^{i}=0} .

For Noether's second theorem, we consider those variational symmetries (called gauge symmetries) which are parametrized linearly by a set of arbitrary functions and their derivatives. These variations have the generic form δ λ u σ = R a σ λ a + R a σ , i λ i a + + R a σ , i 1 . . . i s λ i 1 . . . i s a = | I | = 0 s R a σ , I λ I a , {\displaystyle \delta _{\lambda }u^{\sigma }=R_{a}^{\sigma }\lambda ^{a}+R_{a}^{\sigma ,i}\lambda _{i}^{a}+\dots +R_{a}^{\sigma ,i_{1}...i_{s}}\lambda _{i_{1}...i_{s}}^{a}=\sum _{|I|=0}^{s}R_{a}^{\sigma ,I}\lambda _{I}^{a},} where the coefficients R a σ , I {\displaystyle R_{a}^{\sigma ,I}} can depend on the independent and dependent variables as well as the derivatives of the latter up to some finite order, the λ a = λ a ( x ) {\displaystyle \lambda ^{a}=\lambda ^{a}(x)} are arbitrarily specifiable functions of the independent variables, and the latin indices a , b , {\displaystyle a,b,\dots } take the values 1 , , q {\displaystyle 1,\dots ,q} , where q {\displaystyle q} is some positive integer.

For these variations to be (exact, i.e. not quasi-) gauge symmetries of the Lagrangian, it is necessary that δ λ L = 0 {\displaystyle \delta _{\lambda }L=0} for all possible choices of the functions λ a ( x ) {\displaystyle \lambda ^{a}(x)} . If the variations are quasi-symmetries, it is then necessary that the current also depends linearly and differentially on the arbitrary functions, i.e. then δ λ L = d i K λ i {\displaystyle \delta _{\lambda }L=d_{i}K_{\lambda }^{i}} , where K λ i = K a i λ a + K a i , j λ j a + K a i , j 1 j 2 λ j 1 j 2 a {\displaystyle K_{\lambda }^{i}=K_{a}^{i}\lambda ^{a}+K_{a}^{i,j}\lambda _{j}^{a}+K_{a}^{i,j_{1}j_{2}}\lambda _{j_{1}j_{2}}^{a}\dots } For simplicity, we will assume that all gauge symmetries are exact symmetries, but the general case is handled similarly.

Noether's second theorem

The statement of Noether's second theorem is that whenever given a Lagrangian L {\textstyle L} as above that admits gauge symmetries δ λ u σ {\displaystyle \delta _{\lambda }u^{\sigma }} parametrized linearly by q {\displaystyle q} arbitrary functions and their derivatives, then there exist q {\displaystyle q} linear differential relations between the Euler-Lagrange equations of L {\textstyle L} .

Combining the first variation formula together with the fact that the variations δ λ u σ {\textstyle \delta _{\lambda }u^{\sigma }} are symmetries, we get 0 = E σ δ λ u σ + d i W λ i , W λ i = | I | = 0 r P σ i I δ λ u σ , {\displaystyle 0=E_{\sigma }\delta _{\lambda }u^{\sigma }+d_{i}W_{\lambda }^{i},\quad W_{\lambda }^{i}=\sum _{|I|=0}^{r}P_{\sigma }^{iI}\delta _{\lambda }u^{\sigma },} where on the first term proportional to the Euler-Lagrange expressions, further integrations by parts can be performed as E σ δ λ u σ = | I | = 0 s E σ R a σ , I λ I a = Q a λ a + d i ( | I | = 0 s 1 Q a i I λ I a ) , {\displaystyle E_{\sigma }\delta _{\lambda }u^{\sigma }=\sum _{|I|=0}^{s}E_{\sigma }R_{a}^{\sigma ,I}\lambda _{I}^{a}=Q_{a}\lambda ^{a}+d_{i}\left(\sum _{|I|=0}^{s-1}Q_{a}^{iI}\lambda _{I}^{a}\right),} where Q a I = | J | = 0 s | I | ( 1 ) | J | d J ( E σ R a σ , I J ) , {\displaystyle Q_{a}^{I}=\sum _{|J|=0}^{s-|I|}(-1)^{|J|}d_{J}\left(E_{\sigma }R_{a}^{\sigma ,IJ}\right),} in particular for | I | = 0 {\textstyle |I|=0} , Q a = E σ R a σ d i ( E σ R a σ , i ) + + ( 1 ) s d i 1 d i s ( E σ R a σ , i 1 . . . i s ) = | I | = 0 s ( 1 ) | I | d I ( E σ R a σ , I ) . {\displaystyle Q_{a}=E_{\sigma }R_{a}^{\sigma }-d_{i}\left(E_{\sigma }R_{a}^{\sigma ,i}\right)+\dots +(-1)^{s}d_{i_{1}}\dots d_{i_{s}}\left(E_{\sigma }R_{a}^{\sigma ,i_{1}...i_{s}}\right)=\sum _{|I|=0}^{s}(-1)^{|I|}d_{I}\left(E_{\sigma }R_{a}^{\sigma ,I}\right).} Hence, we have an off-shell relation 0 = Q a λ a + d i S λ i , {\displaystyle 0=Q_{a}\lambda ^{a}+d_{i}S_{\lambda }^{i},} where S λ i = H λ i + W λ i , {\textstyle S_{\lambda }^{i}=H_{\lambda }^{i}+W_{\lambda }^{i},} with H λ i = | I | = 0 s 1 Q a i I λ I a {\textstyle H_{\lambda }^{i}=\sum _{|I|=0}^{s-1}Q_{a}^{iI}\lambda _{I}^{a}} . This relation is valid for any choice of the gauge parameters λ a ( x ) {\textstyle \lambda ^{a}(x)} . Choosing them to be compactly supported, and integrating the relation over the manifold of independent variables, the integral total divergence terms vanishes due to Stokes' theorem. Then from the fundamental lemma of the calculus of variations, we obtain that Q a 0 {\displaystyle Q_{a}\equiv 0} identically as off-shell relations (in fact, since the Q a {\displaystyle Q_{a}} are linear in the Euler-Lagrange expressions, they necessarily vanish on-shell). Inserting this back into the initial equation, we also obtain the off-shell conservation law d i S λ i = 0 {\displaystyle d_{i}S_{\lambda }^{i}=0} .

The expressions Q a {\displaystyle Q_{a}} are differential in the Euler-Lagrange expressions, specifically we have Q a = D a [ E ] = | I | = 0 s ( 1 ) | I | d I ( E σ R a σ , I ) = | I | = 0 s F a σ , I d I E σ , {\displaystyle Q_{a}={\mathcal {D}}_{a}=\sum _{|I|=0}^{s}(-1)^{|I|}d_{I}\left(E_{\sigma }R_{a}^{\sigma ,I}\right)=\sum _{|I|=0}^{s}F_{a}^{\sigma ,I}d_{I}E_{\sigma },} where F a σ , I = | J | = 0 s | I | ( | I | + | J | | I | ) ( 1 ) | I | + | J | d J R a σ , I J . {\displaystyle F_{a}^{\sigma ,I}=\sum _{|J|=0}^{s-|I|}{\binom {|I|+|J|}{|I|}}(-1)^{|I|+|J|}d_{J}R_{a}^{\sigma ,IJ}.} Hence, the equations 0 = D a [ E ] {\displaystyle 0={\mathcal {D}}_{a}} are q {\textstyle q} differential relations to which the Euler-Lagrange expressions are subject to, and therefore the Euler-Lagrange equations of the system are not independent.

Converse result

A converse of the second Noether them can also be established. Specifically, suppose that the Euler-Lagrange expressions E σ {\displaystyle E_{\sigma }} of the system are subject to q {\displaystyle q} differential relations 0 = D a [ E ] = | I | = 0 s F a σ , I d I E σ . {\displaystyle 0={\mathcal {D}}_{a}=\sum _{|I|=0}^{s}F_{a}^{\sigma ,I}d_{I}E_{\sigma }.} Letting λ = ( λ 1 , , λ q ) {\textstyle \lambda =(\lambda ^{1},\dots ,\lambda ^{q})} be an arbitrary q {\textstyle q} -tuple of functions, the formal adjoint of the operator D a {\textstyle {\mathcal {D}}_{a}} acts on these functions through the formula E σ ( D + ) σ [ λ ] λ a D a [ E ] = d i B λ i , {\displaystyle E_{\sigma }({\mathcal {D}}^{+})^{\sigma }-\lambda ^{a}{\mathcal {D}}_{a}=d_{i}B_{\lambda }^{i},} which defines the adjoint operator ( D + ) σ {\displaystyle ({\mathcal {D}}^{+})^{\sigma }} uniquely. The coefficients of the adjoint operator are obtained through integration by parts as before, specifically ( D + ) σ [ λ ] = | I | = 0 s R a σ , I λ I a , {\displaystyle ({\mathcal {D}}^{+})^{\sigma }=\sum _{|I|=0}^{s}R_{a}^{\sigma ,I}\lambda _{I}^{a},} where R a σ , I = | J | = 0 s | I | ( 1 ) | I | + | J | ( | I | + | J | | I | ) d J F a σ , I J . {\displaystyle R_{a}^{\sigma ,I}=\sum _{|J|=0}^{s-|I|}(-1)^{|I|+|J|}{\binom {|I|+|J|}{|I|}}d_{J}F_{a}^{\sigma ,IJ}.} Then the definition of the adjoint operator together with the relations 0 = D a [ E ] {\displaystyle 0={\mathcal {D}}_{a}} state that for each q {\textstyle q} -tuple of functions λ {\displaystyle \lambda } , the value of the adjoint on the functions when contracted with the Euler-Lagrange expressions is a total divergence, viz. E σ ( D + ) σ [ λ ] = d i B λ i , {\displaystyle E_{\sigma }({\mathcal {D}}^{+})^{\sigma }=d_{i}B_{\lambda }^{i},} therefore if we define the variations δ λ u σ := ( D + ) σ [ λ ] = | I | = 0 s R a σ , I λ I a , {\displaystyle \delta _{\lambda }u^{\sigma }:=({\mathcal {D}}^{+})^{\sigma }=\sum _{|I|=0}^{s}R_{a}^{\sigma ,I}\lambda _{I}^{a},} the variation δ λ L = E σ δ λ u σ + d i W λ i = d i ( B λ i + W λ i ) {\displaystyle \delta _{\lambda }L=E_{\sigma }\delta _{\lambda }u^{\sigma }+d_{i}W_{\lambda }^{i}=d_{i}\left(B_{\lambda }^{i}+W_{\lambda }^{i}\right)} of the Lagrangian is a total divergence, hence the variations δ λ u σ {\textstyle \delta _{\lambda }u^{\sigma }} are quasi-symmetries for every value of the functions λ a {\displaystyle \lambda ^{a}} .

See also

Notes

  1. Noether, Emmy (1918), "Invariante Variationsprobleme", Nachr. D. König. Gesellsch. D. Wiss. Zu Göttingen, Math-phys. Klasse, 1918: 235–257
    Translated in Noether, Emmy (1971). "Invariant variation problems". Transport Theory and Statistical Physics. 1 (3): 186–207. arXiv:physics/0503066. Bibcode:1971TTSP....1..186N. doi:10.1080/00411457108231446. S2CID 119019843.

References

Further reading


Stub icon

This mathematical physics-related article is a stub. You can help Misplaced Pages by expanding it.

Stub icon

This article about theoretical physics is a stub. You can help Misplaced Pages by expanding it.

Categories: