A-Z of Machine Learning and Computer Vision Terms

Gradient Descent

Hierarchical Clustering

Histogram of Oriented Gradients (HOG)

Human Pose Estimation

Human in the Loop (HITL)

Hyperparameter Tuning

Hyperparameters

Image Annotation

Image Augmentation

Image Captioning

Image Classification

Image Degradation

Image Generation

Image Processing

Image Recognition

Image Restoration

Image Segmentation

Imbalanced Data

Imbalanced Dataset

In-Context Learning

Instance Segmentation

Instance Segmentation

Interpretability

Intersection over Union (IoU)

Jupyter Notebooks

K-Means Clustering

Knowledge Graphs

Large Language Model (LLM)

Latent Dirichlet Allocation (LDA)

Linear Discriminant Analysis (LDA)

Linear Regression

Logistic Regression

Long Short-Term Memory (LSTM)

Machine Learning (ML)

Manifold Learning

Mean Average Precision (mAP)

Mean Squared Error (MSE)

Medical Image Segmentation

Model Parameters

Model Validation

Motion Detection

Motion Estimation

Multi-Task Learning

Natural Language Processing (NLP)

Neural Architecture Search

Neural Networks

Neural Style Transfer

Object Detection

Object Localization

Object Recognition

Object Tracking

One-Shot Learning

Optical Character Recognition (OCR)

Optimization Algorithms

Outlier Detection

PACS (Picture Archiving and Communication System)

Pandas and NumPy

Panoptic Segmentation

Parameter-Efficient Fine-Tuning (Prefix-Tuning)

Pattern Recognition

Pool-Based Sampling

Pose Estimation

Predictive Model Validation

Principal Component Analysis

Prompt Chaining

Prompt Engineering

Prompt Injection

M

Mean Squared Error (MSE)

Mean Squared Error (MSE) is a common loss function for regression tasks and a measure of the quality of an estimator. It’s defined as the average of the squares of the differences between predicted values (\hat{y}) and actual values (y): MSE = (1/n) * Σ (y_i - \hat{y}_i)^2. Squaring the error amplifies larger errors (makes the loss more sensitive to outliers). MSE is differentiable, which is convenient for optimization (gradient is the mean of residuals times -2). The square root of MSE is the RMSE (Root Mean Squared Error), which is in the same unit as the original output (for interpretability). MSE is also related to variance: it can be decomposed into Bias^2 + Variance + noise^2 for an estimator. In model training, minimizing MSE leads to the optimal prediction being the mean of the target distribution (for a given input) if the data has Gaussian noise. While simple and widely used, MSE may not be ideal if outliers are prevalent (MAE might be better in those cases) or if one cares about relative vs absolute errors.

Further Reading

🔗 Research Paper 📄 Blog Post 📄 Blog Post

Explore Our Products

Lightly One

Data Selection & Data Viewer

Get data insights and find the perfect selection strategy

Lightly Train

Self-Supervised Pretraining

Leverage self-supervised learning to pretrain models

Lightly Edge

Smart Data Capturing on Device

Find only the most valuable data directly on devide

Ready to Get Started?

Experience the power of automated data curation with Lightly

Class Boundary (Statistics & Machine Learning)

Long Short-Term Memory (LSTM)

Recurrent Neural Network (RNN)

Large Language Model (LLM)

Chain of Thought (CoT)

Foundation Models

Semantic Segmentation

Variance (Model Variance)

XAI (Explainable AI)

YOLO (You Only Look Once)

Weight Decay (L2 Regularization)

Text Generation Inference

True Positive Rate (TPR)

Type II Error (False Negative)

Type I Error (False Positive)

Transformers (Transformer Networks)

Stream-Based Selective Sampling

Support Vector Machine (SVM)

Sentiment Analysis

Surrogate Model

Supervised Learning

Semi-supervised Learning

Selective Sampling

Sliding Window Attention

Sensitivity and Specificity of Machine Learning

Segment Anything Model (SAM)

Regularization Algorithms

ROC (Receiver Operating Characteristic) Curve

Scale Imbalance

Regression (Regression Analysis)

Region-Based CNN (R-CNN)

Recall (Sensitivity or True Positive Rate)

RAG Architecture

Query Synthesis Methods

Query Strategy (Active Learning)

Predictive Model Validation

Prompt Injection

Prompt Engineering

Prompt Chaining

Pose Estimation

Pool-Based Sampling

Pattern Recognition

Parameter-Efficient Fine-Tuning (Prefix-Tuning)

Pandas and NumPy

Panoptic Segmentation

PACS (Picture Archiving and Communication System)

Outlier Detection

Object Tracking

Optical Character Recognition (OCR)

One-Shot Learning

Object Localization

Object Detection

Natural Language Processing (NLP)

Neural Networks

Multi-Task Learning

Motion Detection

Motion Estimation

Model Validation

Latent Dirichlet Allocation (LDA)

Medical Image Segmentation

Model Parameters

Mean Squared Error (MSE)

Mean Average Precision (mAP)

Machine Learning (ML)

Linear Regression

Linear Discriminant Analysis (LDA)

Intersection over Union (IoU)

Interpretability

Imbalanced Dataset

Instance Segmentation

Image Processing

Image Restoration

Image Segmentation

Image Recognition