Whenever I want to learn a new subject, I announce a graduate course in it, since the best way to learn is by teaching. But even better than teaching humans is teaching computers, i.e. program! Since computers will not let you wave your hands and wing it.
– Doron Zeilberger (1999)
Suppose that we are given an -dimensional first-order vector ordinary differential equation (ODE)
where is a vector field. A solution of this ODE is a function
whose first derivative,
, is equal to
. Elementary, my dear Watson!
Despite the fact that the notation is rather pervasive, it is dangerous, in my humble opinion. One should keep in mind that such notation is an abbreviation. How could it not be an abbreviation? Note that
is a function, and that the argument of
is a vector in
, not a function! In other words, writing
results in a type error, and a compiler / interpreter with type checking would never accept the declaration
. We, fallible humans, accept it because we know (do we?) that it is an abbreviation, and because it is tradition. Who are we to challenge a centuries-old tradition?
If is an abbreviation, what exactly is it an abbreviation for? It is an abbreviation for the following
where the symbol denotes equality of vectors. Note that the argument of
is the vector
(i.e., function
evaluated at
), not the function
. But, what does the equation above say? It says the following:
Proposition: Given a vector field , there exists a function
, whose first derivative is
, such that for every
we have that
.
This proposition may be true or false. Not all differential equations have solutions, after all, which is why the study of existence of solutions is (was?) an active area of research. Note the presence of the words “for every” in the proposition above. Do you see any universal quantifier, , anywhere? You do not, and that is due to the fact that
is also an abbreviation! The universal quantifier is implicit. If we make it explicit, we obtain
where ranges over
. The universally quantified formula above is the conjunction of infinitely many equations. However, do note that the words “there exists” also appear in the proposition, which suggests that an existential quantifier is missing. Therefore, the non-abbreviated notation would be as follows
where ranges over the set of all (continuous?) functions from
to
, and
ranges over
.
__________
A better notation
So far, I have criticized the abbreviated notation for differential equations. I consider it harmful, as confusing a function with its evaluation
is an atrocious crime against types. To make this post more “constructive”, I will take the liberty of proposing a better notation.
We start by noting that can be written in the form
, where the symbol
denotes function composition. We now introduce a differential operator
, which maps functions to functions, so that we obtain the first derivative of function
via
. Hence,
, which allows us to rewrite
in the following form
where ranges over
. The universally quantified formula above states that functions
and
, both of which from
to
evaluate to the same values for all possible choices of the input
, i.e., functions
and
are equal. The formula above can thus be rewritten more compactly as
.
Finally, we conclude that is equivalent to the following existentially quantified formula
where the symbol denotes equality of functions. This new notation may not be as compact as
, but at least now the RHS does not result in type error. Criticism would be most welcome.
Tags: Abbreviated Notation, Crimes against types, Differential Equations, Mathematical Notation, ODEs, Ordinary Differential Equations
June 5, 2012 at 20:06 |
Totally agree. I had same gripes. Wrote fair amount about it too.
June 7, 2012 at 10:14 |
Thanks for sharing. You may want to take a look at this:
Some thoughts on differential equation notation
June 7, 2012 at 21:06 |
Thanks a lot for this resource! btw, you wrote this blog right? am making a link to here, and would add the proper name if you don’t mind. thanks.
March 16, 2013 at 07:25 |
Thanks for the post. Math grammar is a very interesting topic. (Mathematicians, AFAIK, generally don’t think so.)
However I have some criticism. I think that even the “moderate” use of quantifiers could be almost as harmful as the lousy notation.
You could (and did) get rid of
by introducing the abstract
in place of
. When you have a statement about functions you don’t have to care about their parameters at all. And that is good, because this is one of those abstractions that make math powerful.
And I think that the
quantifier is unnecessary too. Without it, you’d got the equation. It is not a definition, and it is not an assignment. It is just another mathematical object, like a function, or a vector, or a vector field, and it is valuable by itself.
ODE is not a statement that “something exists”. There could be only a trivial solution to ODE, or no solution at all. The absence of solutions doesn’t necessarily make this particular ODE uninteresting.
March 16, 2013 at 16:49 |
Thanks for your constructive comment. I must say that I agree with your criticism. In retrospect, I believe I focused too much on the existence of solutions. Deciding the existence of solutions is merely one interesting problem that arises in the study of ODEs, but it’s not the only one.
Hence, I could just rewrite the ODE
in the form
. No quantifiers required. It is not a big change, but at least it type-checks.
March 16, 2013 at 22:10 |
Type-checking is really important. It could be very difficult to understand the author’s reasoning if there’s an unannounced notation abuse, and the material is new to the reader.
Alas, I’ve seen a lot of experienced people who don’t really see the difference between “function $f$” and “function $f(x)$”. I would even say it’s a bit frightening.
“$\dot{x} = f(x)$” seems weird whatever meaning one puts into $x$. This $x$ could be a function, but traditional math notation then implies that $f(x)$ is a single vector from a vector field $f$, picked in point $x$. But if $x$ is point, then type-checking for the LHS fails: derivative of a point is meaningless (zero at best).
The notation “$\dot{x} = f \circ x$” (or “$D(x) = f \circ x$”) looks more consistent from the geometrical point of view, becuse it emphasises that $x\colon \mathbb{R} \to \mathbb{R}^n$ is a curve in the space $\mathbb{R}^n$, and both $f \circ x$ and $\dot{x}$ are curves in the corresponding tangent space $T \mathbb{R}^n$.