Given a smooth real function f, we can approximate it as a sum of polynomials as

f(x+h)=f(x)+hf′(x)+h22!f″

where \lim_{h\to0} R_n(h)=0.There are multiple ways to derive this result. Some are already discussed in the answers to the question “Where do the factorials come from in the taylor series?“.

An easy way to see why the 1/k! factors must be there is to observe that computing \partial_h^k f(x+h)\rvert_{h=0}, we need the 1/k! factors to balance the k! factors arising from \partial_h^k h^k=k! in order to get a consistent result on the left- and right-hand sides.

However, even though algebraically it is very clear why we need these factorials, I don’t have any intuition as to why they should be there. Is there any geometrical (or similarly intuitive) argument to see where they come from?

**Answer**

Yes. There is a *geometric* explanation.

For simplicity, let me take x=0 and h=1.

By the Fundamental Theorem of Calculus (FTC),

f(1)=f(0)+\int_{0}^{1}dt_1\ f'(t_1)\ .

Now use the FTC for the f'(t_1) inside the integral, which gives

f'(t_1)=f'(0)+\int_{0}^{t_1}dt_2\ f”(t_2)\ ,

and insert this in the previous equation.

We then get

f(1)=f(0)+f'(0)+\int_{0}^{1}dt_1\int_{0}^{t_1}dt_2 f”(t_2)\ .

Keep iterating this, using the FTC to rewrite the last integrand, each time invoking a new variable t_k.

At the end of the day, one obtains

f(1)=\sum_{k=0}^{n}\int_{\Delta_k} dt_1\cdots dt_k\ f^{(k)}(0)\ +\ {\rm remainder}

where \Delta_k is the simplex

\{(t_1,\ldots,t_k)\in\mathbb{R}^k\ |\ 1>t_1>\cdots>t_k>0\}\ .

For example \Delta_{2} is a triangle in the plane, and \Delta_3 is a tetrahedron in 3D, etc.

The \frac{1}{k!} is just the *volume* of \Delta_k. Indeed, by a simple change of variables (renaming), the volume is the same for all k! simplices of the form

\{(t_1,\ldots,t_k)\in\mathbb{R}^k\ |\ 1>t_{\sigma(1)}>\cdots>t_{\sigma(k)}>0\}

where \sigma is a permutation of \{1,2,\ldots,k\}.

Putting all these simplices together essentially reproduces the cube [0,1]^k which of course has volume 1.

**Exercise:** Recover the usual formula for the integral remainder using the above method.

**Remark 1:** As Sangchul said in the comment, the method is related to the notion of ordered exponential. In a basic course on ODEs, one usually sees the notion of fundamental solution \Phi(t) of a linear system of differential equations X'(t)=A(t)X(t). One can rewrite the equation for \Phi(t) in integral form and do the same iteration as in the above method with the result

\Phi(s)=\sum_{k=0}^{\infty}\int_{s\Delta_k} dt_1\cdots dt_k\ A(t_1)\cdots A(t_k)\ .

It is only when the matrices A(t) for different times *commute*, that one can use the above permutation and cube reconstruction, in order to write the above series as an exponential. This happens in one dimension and also when A(t) is time independent, i.e., for the two textbook examples where one has explicit formulas.

**Remark 2:** The method I used for the Taylor expansion is related to how Newton approached the question using divided differences.

The relation between Newton’s iterated divided differences and the iterated integrals I used is provide by the Hermite-Genocchi formula.

**Remark 3:** These iterated integrals are also useful in proving some combinatorial identities, see this MO answer:

https://mathoverflow.net/questions/74102/rational-function-identity/74280#74280

They were also used by K.T. Chen in topology, and they also feature in the theory of rough paths developed by Terry Lyons.

The best I can do, as far as (hopefully) nice figures, is the following.

The simplex \Delta_1

has one-dimensional volume, i.e., length =1=\frac{1}{1!}.

The simplex \Delta_2

has two-dimensional volume, i.e., area =\frac{1}{2}=\frac{1}{2!}.

The simplex \Delta_3

has three-dimensional volume, i.e., just volume =\frac{1}{6}=\frac{1}{3!}.

For obvious reasons, I will stop here.

**Attribution***Source : Link , Question Author : glS , Answer Author : Abdelmalek Abdesselam*