Talk:Sturm-Liouville theory

Numbered equations

Hi Daniel,

I had to transform the templates provided in the WP article for numbered equations into plain old wikimarkup. I attempted to bring these templates over from WP, but they called many other templates and when I got them all over, the combined result didn't work. I decided to use a simple bit of html that defined a span with right justification and also defined an anchor. To reference the equation you then only need to insert a mediawiki markup referencing the anchor. It wouldn't be hard to turn this all into two templates if you think that would be useful. Dan Nessett 19:28, 2 September 2009 (UTC)

Redacted comment. I see it now.

Error

It seems to me that the article contains an error. Consider the definition:

Lu={1 \over w(x)}\left(-{d \over dx}\left[p(x){du \over dx}\right]+q(x)u\right)

Contrary to what is implied in the article, the operator L thus defined is not self-adjoint, unless 1/w(x) commutes with the operator to its right. This is in general not the case. The proper way to transform is (L in the next equation is w(x) times L in the previous equation):

Lu=\lambda \,w\;u\;\Longrightarrow \;w^{-1/2}Lw^{-1/2}w^{1/2}u=\lambda \,w^{1/2}u\;\Longrightarrow \;{\tilde {L}}\,{\tilde {u}}=\lambda \,{\tilde {u}}

with

{\tilde {L}}\equiv w^{-1/2}Lw^{-1/2}\quad {\hbox{and}}\quad {\tilde {u}}\equiv w^{1/2}u

Since w(x) is positive-definite w(x)^−½ is well-defined and real. The operator ${\tilde {L}}$ is self-adjoint.

--Paul Wormer 08:09, 14 October 2009 (UTC)

I am getting out of my field of expertise here, but let me ask some questions. First, the section defines L and then posits, " L gives rise to a self-adjoint operator." I am not sure what this means, but one way of interpreting it is L itself is not self-adjoint, but some transformation of it is. That is, L may not itself have real eigenvalues. This statement is followed by, "This can be seen formally by using integration by parts twice, where the boundary terms vanish by virtue of the boundary conditions. It then follows that the eigenvalues of a Sturm–Liouville operator are real and that eigenfunctions of L corresponding to different eigenvalues are orthogonal." This is, admittedly, vague. However, later in the section is the statement, "As a consequence of the Arzelà–Ascoli theorem this integral operator is compact and existence of a sequence of eigenvalues α_n which converge to 0 and eigenfunctions which form an orthonormal basis follows from the spectral theorem for compact operators." So, on the surface, it seems the integral operator plays some role in the argument.

That said, let me note that from the perspective of someone who has very little understanding of S-L theory, this section is obscure and would benefit from a complete rewrite. Dan Nessett 17:09, 14 October 2009 (UTC)

Paul, the factor 1 / w is cancelled by w in the definition of the inner product (I think). Peter Schmitt 19:07, 14 October 2009 (UTC)

[unindent]

Let me answer first Dan and then Peter. I forget about the real function q and define the operator Λ for real p(x)

\Lambda \;{\stackrel {\mathrm {def} }{=}}\;-{\frac {d}{dx}}p(x){\frac {d}{dx}}

The operator acts on a space of functions u, v, for which holds (integration with weight 1)

\int _{a}^{b}\;u(x){\frac {dv(x)}{dx}}\,dx=-\int _{a}^{b}\;{\frac {du(x)}{dx}}v(x)\,dx+\left[u(x)v(x)\right]_{a}^{b}=-\int _{a}^{b}\;{\frac {du(x)}{dx}}v(x)\,dx.

Here it was used that u and v satisfy periodic boundary conditions (are equal on the boundaries a and b). Then on the space of functions with this boundary condition the derivative is anti-Hermitian (anti-self-adjoint):

\langle u\;|\;{\frac {dv}{dx}}\rangle =-\langle {\frac {du}{dx}}\;|\;v\rangle \;\Longrightarrow \;{\frac {d}{dx}}^{\dagger }=-{\frac {d}{dx}}.

Apply the rule valid for arbitrary operators (capitals) and functions (lower case):

\left(AuB\right)^{\dagger }=B^{\dagger }{\bar {u}}A^{\dagger }

where the bar indicates complex conjugate. Then clearly, because p(x) is real, we have

\Lambda ^{\dagger }=-\left({\frac {d}{dx}}\right)^{\dagger }p(x)\left({\frac {d}{dx}}\right)^{\dagger }=\Lambda

so that Λ is Hermitian (self-adjoint). If we divide by real w on the left we get

\left({\frac {1}{w(x)}}\Lambda \right)^{\dagger }=\Lambda {\frac {1}{w(x)}}.

Unless Λ and w commute (and they don't) it is clear that the new operator is non-Hermitian.

Why do I integrate with weight 1 and not with weight w? This is because the original S-L problem has the form

\Lambda v(x)=\lambda w(x)v(x),\qquad \lambda \in \mathbb {R} .

Multiply by u(x) and integrate with weight 1

\int _{a}^{b}\;u(x)\Lambda v(x)dx=\lambda \int _{a}^{b}u(x)w(x)v(x)dx,

which is what we want. If we would introduce weight w it would appear squared on the right, which is what we don't want.

Two more comments.

I noticed the error because I'm very familiar with the generalized matrix eigenvalue problem (I sometimes felt during my working life that all I did was setting up and solving such problems.)

\mathbf {H} \mathbf {C} =\mathbf {S} \mathbf {C} {\boldsymbol {\epsilon }}\quad {\hbox{with}}\quad {\boldsymbol {\epsilon }}=\mathrm {diag} (\epsilon _{1},\ldots ,\epsilon _{n})

The matrix H is Hermitian, S is Hermitian and positive-definite (has non-zero positive eigenvalues) and ε contains the eigenvalues on its diagonal. Beginning students sometimes divide this problem by S, but that is not the way to solve this. One way is transforming by S^−½. Exactly as I did before.

The second comment: I own the classic two-volume book by Courant and Hilbert (2nd German edition). In volume I, chapter V, §3.3 they discuss the S-L problem and introduce the transformation (in their notation) z(x) = v(x) √ρ [where ρ(x) ≡ w(x)]. They give the transformed S-L operator Λ without proof. It took me about two hours to show that their form is indeed equal to the form that I started this discussion with:

{\tilde {\Lambda }}\,z(x)=\left[\rho ^{-1/2}\Lambda \rho ^{-1/2}\right]z(x)=-\left[{\frac {d}{dx}}{\frac {p(x)}{\rho (x)}}{\frac {d}{dx}}\right]z(x)\;-\;{\frac {1}{\sqrt {\rho (x)}}}{\frac {d{\big (}p(x)f(x){\big )}}{dx}}

with

f(x)\equiv {\frac {d\left({\frac {z(x)}{\sqrt {\rho (x)}}}\right)}{dx}}

It took me so long because Courant-Hilbert use a notation in which it is very unclear how far a differential must work. I hope that I avoided this unclarity by introducing f(x). The second term of the transformed Λ is a function, no differentiations are left "free" to act to the right.

--Paul Wormer 09:20, 15 October 2009 (UTC)

Paul, I freely admit that you know much more about partial differential equations and differential operators than I do. This is one of the topics I tried to avoid most of the time. And I believe that your transformation works (though I did not check it in detail).

But I think you misunderstood me. The operator is said to be selfadjoint with respect to the inner product

\langle f,g\rangle =\int _{a}^{b}{\overline {f(x)}}g(x)w(x)\,dx.

not the inner product

\langle f,g\rangle =\int _{a}^{b}{\overline {f(x)}}g(x)\,dx.

and in the first definition the 1 / w cancels with the w on both sides (your L):

\langle u\mid {\frac {1}{w}}Lv\rangle =\langle {\frac {1}{w}}Lu\mid v\rangle

You may want to check this in the book (p.121) cited in the bibliography (accidently, by one of my colleagues here in Vienna).

Peter Schmitt 12:21, 15 October 2009 (UTC)

Remark: In any case, this shows that the article needs either improvement or (preferably?) a "fresh" approach. Peter Schmitt 12:49, 15 October 2009 (UTC)

Peter, this weight function (measure) is tricky. I know of its existence (in group theory it is Haar measure and in the theory of orthogonal polynomials it gives the different kinds of such polynomials). But look at the Sturm-Liouville eigenvalue equation

\Lambda \;\,v(x)=\lambda \;w(x)\;v(x)

Suppose you are right, then projection of left and right hand side by u(x) is done as follows

\int \,{\overline {u(x)}}\Lambda \,v(x)w(x)dx=\lambda \int \,{\overline {u(x)}}\,v(x)\,w(x)^{2}\,dx.

I cannot prove that this is incorrect, but the squared weight does not feel right. To me it seems that the weight is already taken care of by its appearance in the S-L eigenvalue equation. If we take unit weight (as Courant-Hilbert do), then your definition of inner product appears automatically on the right hand side:

\int \,{\overline {u(x)}}\Lambda \,v(x)dx=\lambda \int \,{\overline {u(x)}}\,v(x)\,w(x)\,dx.

If I may make the matrix analogy (Hermitian overlap matrix W):

\mathbf {H} \mathbf {v} =\lambda \mathbf {W} \mathbf {v}

Then you project as

(\mathbf {W} \mathbf {u} )^{\dagger }\mathbf {H} \mathbf {v} =\lambda (\mathbf {W} \mathbf {u} )^{\dagger }\mathbf {W} \mathbf {v} =\lambda \mathbf {u} ^{\dagger }\mathbf {W} ^{2}\mathbf {v}

and I project as:

\mathbf {u} ^{\dagger }\mathbf {H} \mathbf {v} =\lambda \mathbf {u} ^{\dagger }\mathbf {W} \mathbf {v}

where on the right hand side we find the inner product of u and v in an non-orthonormal basis. In brief, integration with unit weight seems to me the appropriate inner product in the context of Sturm-Liouville theory, where the non-unit w appears already in the equation. However, I cannot prove it, it is my gut feeling (and I may call upon the authority of Courant and Hilbert, although I realize that that is an invalid argument).

--Paul Wormer 13:46, 15 October 2009 (UTC)

The above discussion is EXACTLY the reason why I joinged CZ! Argue merits instead of evil reverts and malicious attacks. :) David E. Volk 14:03, 15 October 2009 (UTC)

I think it is supposed to be read as

\int \,{\overline {u(x)}}\;\left({\frac {1}{w}}\Lambda \right)v(x)\;(w(x)dx)=\int \,{\overline {u(x)}}\;\Lambda v(x)\;dx=\int \,{\overline {\Lambda u(x)}}\;v(x)\,dx=\int \,{\overline {\left({\frac {1}{w}}\Lambda \right)u(x)}}\;v(x)\;(w(x)dx)

showing that

\Lambda /w

(with weight) is selfadjoint because

\Lambda

is selfadjoint without weight, and

\left({\frac {1}{w}}\Lambda \right)v(x)=\lambda \;v(x)

It is not a question of correct and incorrect, but rather of "traditional" and "modern". It is the same vector space, but with another measure for orthogonality. Peter Schmitt 14:41, 15 October 2009 (UTC)

[unindent]

Peter, I'm not sure what you mean by

{\overline {\left({\frac {1}{w}}\Lambda \right)u(x)}}

I suspect that I would write this as

\left({\frac {1}{w}}\Lambda \right)^{\dagger }{\overline {u(x)}}=\Lambda {\frac {1}{w}}{\overline {u(x)}}.

If that's correct then Λ acts on 1/w and 1/w does no longer cancel the weight w appearing in the volume element.

Do you agree with the following?

\left({\frac {1}{w}}\Lambda \right)^{\dagger }=\Lambda ^{\dagger }\left({\frac {1}{w}}\right)^{\dagger }=\Lambda {\frac {1}{w}},

This follows, because (IMHO): Λ^† = Λ and w is real, i.e., (1/w)^† = 1/w. Further (or do you disagree with the following inequality Peter?):

{\frac {1}{w}}\Lambda \neq \Lambda {\frac {1}{w}}\;\Longrightarrow \;{\frac {1}{w}}\Lambda \neq \left({\frac {1}{w}}\Lambda \right)^{\dagger }.

It is as with matrices: even if H and W are Hermitian, W⁻¹H is not Hermitian (unless W⁻¹ and H commute).

To David: I would like to correct what I see as an error, but I'm not emotional about it, and I just want to make sure that I am not mistaken before I edit the main article.

--Paul Wormer 16:19, 15 October 2009 (UTC)

I would like to comment on this discussion from a completely different perspective. As I have stated numerous times, I am unfamiliar with S-L theory. So when I read the article I read it in the role of a student. My goal is to better understand the material. The discussion in this section of the talk page has helped me to do that.

But, it probably is not appropriate to put this kind of detailed exposition in an encyclopedia article. That suggests to me that there is a place for more in depth discussions in a subpage associated with the article. One option is a tutorial on S-L theory. Another would be to put this kind of discussion in an appendix subpage. But note, doing this somewhat changes the goals of CZ. This goal is expanded beyond that of building an encyclopedia to building an information resource. It becomes more like a library. I think this is a topic that the draft charter committee should examine, since the concept of subpages seems to move us in that direction anyway. Dan Nessett 16:38, 15 October 2009 (UTC)

Now we come to the point: I read it as

{\frac {1}{w}}\Lambda =\Lambda {\frac {1}{w}}\;\Longrightarrow \;{\frac {1}{w}}\Lambda ^{\dagger }=\left({\frac {1}{w}}\Lambda \right)^{\dagger }

Remember, in the article L is defined as $\Lambda /w$ , simply as scalar factor for each x(similar to the multipliction with a diagonal matrix). Since w is real these factors remain unchanged in the adjoint operator

Peter Schmitt 17:50, 15 October 2009 (UTC)

I don't understand what you're saying here. I repeat once more my arguments: the operator Λ is a differential operator, 1 / w(x) is a function of x, hence the two do not commute (if they would quantum theory would not exist). The notation

\Lambda /w

is ambiguous, is it

(1/w)\Lambda ,\quad \Lambda (1/w),\quad {\hbox{or}}\quad -[p'(1/w)'+p(1/w)'']\;?

A diagonal matrix does not commute with an arbitrary square matrix (except if the diagonal matrix is c E). This is the content of Schur's lemma.

--Paul Wormer 07:44, 17 October 2009 (UTC)

You are right, Paul. Mentioning diagonal matrices was wrong (I was thinking of cE). But I still don't see your problem. Applying

(\Lambda /w)

on u means applying

(\Lambda )

and then dividing the resulting fuction (pointwise) by w. Since w is real this remains the same if you use the adjoint operator. You divide after the (adjoint)

\Lambda

is applied on v. It need not and is not meant to commute as an operator. (In this sense it is like a scalar, a scalar function that is cancelled by the weight of the inner product.) (Moreover, I trust the book - link above - by my colleague who is a specialist in differential equations.) Peter Schmitt 09:54, 17 October 2009 (UTC)

Making sure credit is given where credit is due

Since I have decided to no longer contribute to this article, due to my lack of expertise in the underlying field, I want to get on record who is responsible for the orthogonality proof on the Proofs subpage while I still remember to do it. I am recording this on the article talk page, since from the history of Proofs subpage the correct credit is not evident.

The proof of the orthogonality relation was developed by a friend and former colleague of mine at LLNL, John G. Fletcher. While I converted it from MS Word to mediawiki markup, I had nothing to do with the proof's logic. I state this since readers of the history of the proof page might come to an incorrect conclusion by virtue of the fact that I am recorded as the person who added the proof to the page. Dan Nessett 18:02, 16 October 2009 (UTC)

Talk:Sturm-Liouville theory

Numbered equations

Error

Making sure credit is given where credit is due

Navigation menu

Search