1.4 Dividing FPSs
1.4.1 Inverses in commutative rings
Let \(L\) be a commutative ring. Let \(a\in L\). Then:
(a) An inverse (or multiplicative inverse) of \(a\) means an element \(b\in L\) such that \(ab=ba=1\).
(b) We say that \(a\) is invertible in \(L\) (or a unit of \(L\)) if \(a\) has an inverse.
Let \(L\) be a commutative ring. Let \(a\in L\). Then, there is at most one inverse of \(a\).
Let \(b\) and \(c\) be two inverses of \(a\). Since \(b\) is an inverse of \(a\), we have \(ab=ba=1\). Since \(c\) is an inverse of \(a\), we have \(ac=ca=1\). Now, \(b\left(ac\right) = b\cdot 1 = b\) and \(\left(ba\right)c = 1\cdot c = c\). However, \(b\left(ac\right) = \left(ba\right)c\) by associativity. Hence, \(b = b\left(ac\right) = \left(ba\right)c = c\).
Let \(L\) be a commutative ring. Let \(a\in L\). Assume that \(a\) is invertible. Then:
(a) The inverse of \(a\) is called \(a^{-1}\).
(b) For any \(b\in L\), the product \(b\cdot a^{-1}\) is called \(\frac{b}{a}\) (or \(b/a\)).
(c) For any negative integer \(n\), we define \(a^{n}\) to be \(\left( a^{-1}\right)^{-n}\). Thus, the \(n\)-th power \(a^{n}\) is defined for each \(n\in \mathbb {Z}\).
- AlgebraicCombinatorics.divByUnit b a = b * ↑a⁻¹
Let \(L\) be a commutative ring. Then:
(a) Any invertible element \(a\in L\) satisfies \(a^{-1}=1/a\).
(b) For any invertible elements \(a,b\in L\), the element \(ab\) is invertible as well, and satisfies \(\left(ab\right)^{-1}=b^{-1}a^{-1}=a^{-1}b^{-1}\).
(c) If \(a\in L\) is invertible, and if \(n\in \mathbb {Z}\) is arbitrary, then \(a^{-n}=\left(a^{-1}\right)^{n}=\left(a^{n}\right)^{-1}\).
(d) Laws of exponents hold for negative exponents as well: If \(a\) and \(b\) are invertible elements of \(L\), then
(e) Laws of fractions hold: If \(a\) and \(c\) are two invertible elements of \(L\), and if \(b\) and \(d\) are any two elements of \(L\), then
(f) Division undoes multiplication: If \(a,b,c\) are three elements of \(L\) with \(a\) being invertible, then the equality \(\frac{c}{a}=b\) is equivalent to \(c=ab\).
Exercise (see, e.g., [ 19s , solution to Exercise 4.1.1 ] for a proof of parts (c) and (d) in the special case where \(L=\mathbb {C}\); essentially the same argument works in the general case).
1.4.2 Inverses in \(K[[x]]\)
Let \(a\in K[[x]]\). Then, the FPS \(a\) is invertible in \(K[[x]]\) if and only if its constant term \([x^{0}]a\) is invertible in \(K\).
\(\Longrightarrow \): If \(ab=1\), then \([x^0]a\cdot [x^0]b = [x^0](ab)=1\) (by Lemma 1.87).
\(\Longleftarrow \): If \([x^0]a = a_0\) is invertible, define a sequence \((b_0, b_1, b_2, \ldots )\) recursively by \(b_0 = a_0^{-1}\) (Lemma 1.113) and \(b_n = -a_0^{-1}\sum _{k=1}^n a_k b_{n-k}\) for \(n \geq 1\) (Lemma 1.114). Then \(ab = 1\).
Assume that \(K\) is a field. Let \(a\in K[[x]]\). Then, the FPS \(a\) is invertible in \(K[[x]]\) if and only if \([x^{0}]a\neq 0\).
1.4.3 Coefficient formulas for inverses
Assume that \(K\) is a field. Let \(f\in K[[x]]\). Then the constant term of \(f^{-1}\) equals the inverse of the constant term of \(f\):
Direct consequence of the definition of the inverse of an FPS.
Assume that \(K\) is a field. Let \(f\in K[[x]]\). For each \(n\in \mathbb {N}\),
This recurrence allows computing the coefficients of \(f^{-1}\) one by one.
Follows from the recurrence relation for the coefficients of \(f^{-1}\) obtained by expanding the equation \(f\cdot f^{-1}=1\) and solving for \([x^{n+1}]f^{-1}\).
Assume that \(K\) is a field. Let \(f\in K[[x]]\) be an invertible FPS. Then the inverse of \(f\) obtained from the invertibility proof equals \(f^{-1}\) (the inverse as defined in the power series ring).
Both satisfy \(f\cdot g = 1\), and inverses are unique (Theorem 1.108).
Assume that \(K\) is a field. Let \(f\in K[[x]]\) be an invertible FPS. Then
Assume that \(K\) is a field. Let \(f\in K[[x]]\) be an invertible FPS. For each \(n\in \mathbb {N}\),
1.4.4 Newton’s binomial formula
The FPS \(1+x\in K[[x]]\) is invertible, and its inverse is
Direct computation: \((1+x)\cdot \sum _{n\geq 0}(-1)^n x^n = 1\) by telescoping (all powers of \(x\) other than \(1\) cancel out).
For each \(n\in \mathbb {Z}\), we have
For \(n\geq 0\), this is the standard binomial theorem (the sum \(\sum _{k\in \mathbb {N}}\binom {n}{k}x^{k}\) reduces to \(\sum _{k=0}^{n}\binom {n}{k}x^{k}\) since \(\binom {n}{k}=0\) for \(k {\gt} n\) by Proposition 1.8).
For \(n{\lt}0\), we have \(-n\in \mathbb {N}\), so Corollary 1.122 (applied to \(-n\) instead of \(n\)) yields \(\left(1+x\right)^{-(-n)} = \sum _{k\in \mathbb {N}}\binom {-(-n)}{k}x^k\), which rewrites as \(\left(1+x\right)^n = \sum _{k\in \mathbb {N}}\binom {n}{k}x^k\).
Let \(n\in \mathbb {C}\) and \(k\in \mathbb {Z}\). Then,
If \(k{\lt}0\), then both \(\binom {-n}{k}\) and \(\binom {k+n-1}{k}\) are \(0\). For \(k\geq 0\), expand using the definition and observe the numerators differ by a factor of \((-1)^k\).
For each \(n\in \mathbb {N}\), we have
By induction on \(n\).
Induction base: \(\left(1+x\right)^{-0} = 1\), and \(\sum _{k\in \mathbb {N}} (-1)^k \binom {0+k-1}{k} x^k = 1\) since \(\binom {k-1}{k} = 0\) for all \(k {\gt} 0\) (by Proposition 1.8).
Induction step: Assume the formula holds for \(n = j\). We must prove it for \(n = j+1\). We have \(\left(1+x\right)^{-(j+1)} = \left(1+x\right)^{-j} \cdot \left(1+x\right)^{-1}\). Using the induction hypothesis and multiplying by \(1+x\), it suffices to show
Expanding the right hand side, using Pascal’s identity \(\binom {j+k}{k} = \binom {j+k-1}{k-1} + \binom {j+k-1}{k}\) (Proposition 1.5), and collecting terms (the \(\binom {j+k-1}{k-1}\) terms telescope), we obtain \(\sum _{k\in \mathbb {N}} (-1)^k \binom {j+k-1}{k} x^k = \left(1+x\right)^{-j}\) by the induction hypothesis.
For each \(n\in \mathbb {N}\), we have
1.4.5 Dividing by \(x\)
Let \(a=\left(a_{0},a_{1},a_{2},\ldots \right)\) be an FPS whose constant term \(a_{0}\) is \(0\). Then, \(\frac{a}{x}\) is defined to be the FPS \(\left(a_{1},a_{2},a_{3},\ldots \right)\).
- AlgebraicCombinatorics.PowerSeries.divByX a x✝ = PowerSeries.mk fun (n : ℕ) => (PowerSeries.coeff (n + 1)) a
Let \(a\in K[[x]]\) with \([x^0]a=0\). Then for each \(n\in \mathbb {N}\),
By definition, \(\frac{a}{x}=(a_1,a_2,a_3,\ldots )\), so its \(n\)-th coefficient is \(a_{n+1}=[x^{n+1}]a\).
Let \(a,b\in K[[x]]\) be two FPSs. Then, \(a=xb\) if and only if \(\left[x^{0}\right]a=0\) and \(b=\frac{a}{x}\).
Follows directly from the definitions.
Helpers for division by \(x\)
Let \(a\in K[[x]]\) with \([x^0]a=0\). Then \(a = x\cdot \frac{a}{x}\).
Follows from Proposition 1.125: setting \(b=\frac{a}{x}\), the right-to-left direction gives \(a=xb\).
For any FPS \(b\in K[[x]]\), we have \(\frac{xb}{x}=b\).
For each \(n\in \mathbb {N}\), \([x^n]\frac{xb}{x}=[x^{n+1}](xb)=[x^n]b\) (by Lemma 1.124 and the coefficient formula for \(xb\)).
1.4.6 A few lemmas
Let \(a\in K[[x]]\) be an FPS with \(\left[x^{0}\right]a=0\). Then, there exists an \(h\in K[[x]]\) such that \(a=xh\).
The FPS \(\frac{a}{x}\) is well-defined (since \(a_0 = 0\)). Set \(h = \frac{a}{x}\); then \(a = x \cdot \frac{a}{x} = xh\) (by Lemma 1.126).
Let \(k\in \mathbb {N}\). Let \(a\in K[[x]]\) be any FPS. Then, the first \(k\) coefficients of the FPS \(x^{k}a\) are \(0\).
For \(m{\lt}k\), we have \([x^m](x^k a) = \sum _{i=0}^{m} [x^i](x^k)\cdot [x^{m-i}]a = 0\) since \([x^i](x^k) = 0\) for all \(i\leq m {\lt} k\).
Let \(k\in \mathbb {N}\). Let \(f\in K[[x]]\) be any FPS. Then, the first \(k\) coefficients of \(f\) are \(0\) if and only if \(f\) is a multiple of \(x^{k}\).
\(\Longleftarrow \): By Lemma 1.129.
\(\Longrightarrow \): If \(f_n = 0\) for each \(n \in \{ 0, 1, \ldots , k-1\} \), then \(f = \sum _{n \geq k} f_n x^n = x^k \sum _{n \geq k} f_n x^{n-k}\).
Let \(a,f,g\in K[[x]]\) be three FPSs. Let \(n\in \mathbb {N}\). Assume that \([x^{m}]f=[x^{m}]g\) for each \(m\in \{ 0,1,\ldots ,n\} \). Then, \([x^{m}](af)=[x^{m}](ag)\) for each \(m\in \{ 0,1,\ldots ,n\} \).
Let \(m \in \{ 0,1,\ldots ,n\} \). Each \(j \in \{ 0,1,\ldots ,m\} \) satisfies \(j \leq m \leq n\) and thus \([x^j]f = [x^j]g\). Hence, \([x^m](af) = \sum _{j=0}^m [x^{m-j}]a \cdot [x^j]f = \sum _{j=0}^m [x^{m-j}]a \cdot [x^j]g = [x^m](ag)\).
Let \(u,v\in K[[x]]\) be two FPSs such that \(v\) is a multiple of \(u\). Let \(n\in \mathbb {N}\). Assume \([x^{m}]u=0\) for each \(m\in \{ 0,1,\ldots ,n\} \). Then \([x^{m}]v=0\) for each \(m\in \{ 0,1,\ldots ,n\} \).
Write \(v=ua\). We have \([x^m]u = 0 = [x^m]0\) for each \(m \in \{ 0,1,\ldots ,n\} \). Lemma 1.131 (applied to \(f=u\) and \(g=0\)) yields \([x^m](au) = [x^m](a \cdot 0) = 0\) for each \(m \in \{ 0,1,\ldots ,n\} \). Since \(v = ua = au\), we get \([x^m]v = 0\).
Let \(a,b,c,d\in K[[x]]\) be four FPSs. Let \(n\in \mathbb {N}\). Assume \([x^{m}]a=[x^{m}]b\) and \([x^{m}]c=[x^{m}]d\) for each \(m\in \{ 0,1,\ldots ,n\} \). Then \([x^{m}](ac)=[x^{m}](bd)\) for each \(m\in \{ 0,1,\ldots ,n\} \).
The FPS \(ac - bc = (a-b)c\) is a multiple of \(a-b\), and \([x^m](a-b) = 0\) for each \(m \in \{ 0,1,\ldots ,n\} \). By Lemma 1.132, \([x^m](ac - bc) = 0\), so \([x^m](ac) = [x^m](bc)\).
Similarly, \(bc - bd = b(c-d) = (c-d)b\) is a multiple of \(c-d\), and \([x^m](c-d) = 0\). By Lemma 1.132, \([x^m](bc - bd) = 0\), so \([x^m](bc) = [x^m](bd)\).
Combining: \([x^m](ac) = [x^m](bc) = [x^m](bd)\).