Tensors¶

Definition / Introduction¶

In the context of data science and deep learning (and some physics/engineering fields), a tensor can be thought of as a generalization of scalars, vectors, and matrices to potentially more dimensions.
It's a multi-dimensional array of numbers (scalars).
While tensors have a more rigorous definition in physics and differential geometry involving basis vector transformations, in ML/DL we primarily use them as containers for numerical data with multiple axes.

The rank of a tensor is its number of axes or dimensions.
Rank 0 Tensor: A Scalar (a single number). Has 0 axes. Shape: ()
Rank 1 Tensor: A Vector (a 1D array of numbers). Has 1 axis. Shape: (n,)
Rank 2 Tensor: A Matrix (a 2D array of numbers). Has 2 axes (rows, columns). Shape: (m, n)
Rank 3 Tensor: A 3D array of numbers (like a cube or cuboid). Has 3 axes. Shape: (d, m, n).
- Example: A color image might be represented as a Rank 3 tensor (height, width, color_channels=3). A batch of grayscale images could be (batch_size, height, width).
Rank n Tensor: An n-dimensional array of numbers. Has \(n\) axes. Shape: \((d_1, d_2, ..., d_n)\).

A tensor \(\mathbf{T}\) requires multiple indices to access its elements. For a [[Tensor Rank|rank 3 tensor]], an element would be \(T_{ijk}\) (element at index \(i\) on axis 0, \(j\) on axis 1, \(k\) on axis 2).
Deep learning libraries (TensorFlow, PyTorch, JAX) use tensors as their fundamental data structure.

The shape of a tensor is a tuple of integers defining the size of the array along each dimension (axis).
Example Shapes:
- Scalar: ()
- Vector (length 5): (5,)
- Matrix (3 rows, 4 cols): (3, 4)
- [[Tensor Rank|Rank 3]] Tensor (e.g., 10 images, 28x28 pixels): (10, 28, 28)

Input Data (Deep Learning): Tensors are the standard way to represent batches of data.
- Vector Data: (batch_size, num_features) - [[Tensor Rank|Rank 2]]
- Time Series / Sequence Data (NLP): (batch_size, sequence_length, num_features) - [[Tensor Rank|Rank 3]]
- Grayscale Images: (batch_size, height, width) or (batch_size, height, width, 1) - [[Tensor Rank|Rank 3]] or [[Tensor Rank|Rank 4]]
- Color Images: (batch_size, height, width, num_channels=3) - [[Tensor Rank|Rank 4]]
- Video Data: (batch_size, num_frames, height, width, num_channels) - [[Tensor Rank|Rank 5]]
Model Parameters: Weights and biases in neural network layers can be organized into tensors of various ranks.
Intermediate Activations: The outputs of layers in a neural network are tensors.

Tensors generalize Scalars, Vectors, and Matrices.
Operations like addition, scalar multiplication, and more complex tensor operations (e.g., tensor product, contraction, reshaping) are defined on tensors. These are implemented efficiently in libraries like NumPy, TensorFlow, and PyTorch.

A tensor is a multi-dimensional array of numbers; a generalization of scalars, vectors, and matrices.
[[Tensor Rank|Rank]] (or order) refers to the number of dimensions/axes.
Shape defines the size along each dimension.
The fundamental data structure for representing data and parameters in Deep Learning.