| advertise add site services publishers database health videos | ![]() | about toolbar stats live show health store more stuff JOIN/LOGIN |
ASMB - American Society for Matrix Biology - Matrix Biology Journal asmb.net | Lumbar Matrix Scan Training || Spine Matrix spinematrix.com | Matrix Institute The Matrix Model matrixinstitute.org | Cal Matrix by Metagenics: Cal Matrix 180 tablets 204 milligrams... acuatlanta.net |
In mathematics, matrix calculus is a specialized notation for doing multivariable calculus, especially over spaces of matrices, where it defines the matrix derivative. This notation is well-suited to describing systems of differential equations, and taking derivatives of matrix-valued functions with respect to matrix variables. This notation is commonly used in statistics and engineering, while the tensor index notation is preferred in physics.
[edit] NoticeThis article uses another definition for vector and matrix calculus than the form often encountered within the field of estimation theory and pattern recognition. The resulting equations will therefore appear to be transposed when compared to the equations used in textbooks within these fields. [edit] NotationLet M(n,m) denote the space of real n×m matrices with n rows and m columns, such matrices will be denoted using bold capital letters: A, X, Y, etc. An element of M(n,1), that is, a column vector, is denoted with a boldface lowercase letter: a, x, y, etc. An element of M(1,1) is a scalar, denoted with lowercase italic typeface: a, t, x, etc. XT denotes matrix transpose, tr(X) is trace, and det(X) is the determinant. All functions are assumed to be of differentiability class C1 unless otherwise noted. Generally letters from first half of the alphabet (a, b, c, …) will be used to denote constants, and from the second half (t, x, y, …) to denote variables. [edit] Vector calculusMain article: Vector calculus Because the space M(n,1) is identified with the Euclidean space Rn and M(1,1) is identified with R, the notations developed here can accommodate the usual operations of vector calculus.
[edit] Matrix calculus
For the purposes of defining derivatives of simple functions, not much changes with matrix spaces; the space of n×m matrices is isomorphic to the vector space Rnm. The three derivatives familiar from vector calculus have close analogues here, though beware the complications that arise in the identities below.
[edit] Identities
Note that matrix multiplication is not commutative, so in these identities, the order must not be changed.
[edit] Examples
[edit] Derivative of linear functionsThis section lists some commonly used vector derivative formulas for linear equations evaluating to a vector. [edit] Derivative of quadratic functionsThis section lists some commonly used vector derivative formulas for quadratic matrix equations evaluating to a scalar. Related to this is the derivative of the Euclidean norm: [edit] Derivative of matrix tracesThis section shows examples of matrix differentiation of common trace equations. [edit] Derivative of matrix determinant[edit] Relation to other derivativesThe matrix derivative is a convenient notation for keeping track of partial derivatives for doing calculations. The Fréchet derivative is the standard way in the setting of functional analysis to take derivatives with respect to vectors. In the case that a matrix function of a matrix is Fréchet differentiable, the two derivatives will agree up to translation of notations. As is the case in general for partial derivatives, some formulae may extend under weaker analytic conditions than the existence of the derivative as approximating linear mapping. [edit] UsagesMatrix calculus is used for deriving optimal stochastic estimators, often involving the use of Lagrange multipliers. This includes the derivation of: [edit] AlternativesThe tensor index notation with its Einstein summation convention is very similar to the matrix calculus, except one writes only a single component at a time. It has the advantage that one can easily manipulate arbitrarily high rank tensors, whereas tensors of rank higher than two are quite unwieldy with matrix notation. Note that a matrix can be considered simply a tensor of rank two. [edit] See also[edit] External links
|
| ↑ top of page ↑ | about thumbshots |