A-Z of Machine Learning and Computer Vision Terms

  • This is some text inside of a div block.
  • This is some text inside of a div block.
  • This is some text inside of a div block.
  • This is some text inside of a div block.
  • This is some text inside of a div block.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
PyTorch
PyTorch
Q
Q
Quantum Machine Learning
Quantum Machine Learning
Query Strategy (Active Learning)
Query Strategy (Active Learning)
Query Synthesis Methods
Query Synthesis Methods
R
R
RAG Architecture
RAG Architecture
ROC (Receiver Operating Characteristic) Curve
ROC (Receiver Operating Characteristic) Curve
Random Forest
Random Forest
Recall (Sensitivity or True Positive Rate)
Recall (Sensitivity or True Positive Rate)
Recurrent Neural Network (RNN)
Recurrent Neural Network (RNN)
Region-Based CNN (R-CNN)
Region-Based CNN (R-CNN)
Regression (Regression Analysis)
Regression (Regression Analysis)
Regularization Algorithms
Regularization Algorithms
Reinforcement Learning
Reinforcement Learning
Responsible AI
Responsible AI
S
S
Scale Imbalance
Scale Imbalance
Scikit-Learn
Scikit-Learn
Segment Anything Model (SAM)
Segment Anything Model (SAM)
Selective Sampling
Selective Sampling
Self-Supervised Learning
Self-Supervised Learning
Semantic Segmentation
Semantic Segmentation
Semi-supervised Learning
Semi-supervised Learning
Sensitivity and Specificity of Machine Learning
Sensitivity and Specificity of Machine Learning
Sentiment Analysis
Sentiment Analysis
Sliding Window Attention
Sliding Window Attention
Stream-Based Selective Sampling
Stream-Based Selective Sampling
Supervised Learning
Supervised Learning
Support Vector Machine (SVM)
Support Vector Machine (SVM)
Surrogate Model
Surrogate Model
Synthetic Data
Synthetic Data
T
T
Tabular Data
Tabular Data
Text Generation Inference
Text Generation Inference
Training Data
Training Data
Transfer Learning
Transfer Learning
Transformers (Transformer Networks)
Transformers (Transformer Networks)
Triplet Loss
Triplet Loss
True Positive Rate (TPR)
True Positive Rate (TPR)
Type I Error (False Positive)
Type I Error (False Positive)
Type II Error (False Negative)
Type II Error (False Negative)
U
U
Unsupervised Learning
Unsupervised Learning
V
V
Variance (Model Variance)
Variance (Model Variance)
Variational Autoencoders
Variational Autoencoders
W
W
Weak Supervision
Weak Supervision
Weight Decay (L2 Regularization)
Weight Decay (L2 Regularization)
X
X
XAI (Explainable AI)
XAI (Explainable AI)
XGBoost
XGBoost
Y
Y
YOLO (You Only Look Once)
YOLO (You Only Look Once)
Yolo Object Detection
Yolo Object Detection
Z
Z
Zero-Shot Learning
Zero-Shot Learning
C

Computer Vision

Computer vision is a field of artificial intelligence that focuses on enabling computers and systems to see, interpret, and understand visual information from the world​. It sits at the intersection of image processing, pattern recognition, and machine learning. In essence, computer vision algorithms take digital images or videos as input and produce some form of output that can be an understanding of the scene – for example, identifying what objects are present, where they are, what action is happening, or even generating a description in words. If “AI enables computers to think”, then one often says “computer vision enables computers to see”​. This involves mimicking aspects of human vision, but also going beyond it in speed and precision for certain tasks.At its core, computer vision involves processing an image at the pixel level and then extracting higher-level features and information​. Key tasks in computer vision include: image classification – determining what main object or category an image belongs to (e.g., does this image contain a cat or a dog?)​, object detection – locating and classifying multiple objects within an image (e.g., finding all instances of cars, pedestrians, etc., in a street photo); segmentation – partitioning the image into regions corresponding to different objects or classes (semantic segmentation labels each pixel by class, instance segmentation further separates individual object instances); pose estimation – finding the pose of persons or objects, such as the positions of human joints in an image; face recognition – identifying or verifying persons in images; and many more specialized problems (like optical character recognition, depth estimation, image captioning, etc.). Modern computer vision heavily relies on deep learning. In particular, convolutional neural networks (CNNs) revolutionized vision in the 2010s by automatically learning hierarchical feature detectors from data (edges -> textures -> parts -> objects), rather than requiring handcrafted filters. For example, a CNN-based model can learn to detect the presence of a cat in an image by training on thousands of cat vs. non-cat images, implicitly learning the visual features that distinguish cats (like fur patterns, face shape, etc.). Vision transformers (ViT and its variants) have also emerged as an alternative architecture, modeling images with self-attention mechanisms. These models now achieve superhuman performance on certain benchmarks of image classification.Computer vision technology has far-reaching applications. In autonomous vehicles, vision systems detect lane markings, traffic signs, and pedestrians to navigate safely. In medical imaging, computer vision algorithms assist in analyzing X-rays, MRIs, or histopathology slides to detect anomalies or disease (e.g., tumor detection). In security and surveillance, vision is used for motion detection, intruder detection, and face recognition.

Explore Our Products

Lightly One

Data Selection & Data Viewer

Get data insights and find the perfect selection strategy

Learn More

Lightly Train

Self-Supervised Pretraining

Leverage self-supervised learning to pretrain models

Learn More

Lightly Edge

Smart Data Capturing on Device

Find only the most valuable data directly on devide

Learn More

Ready to Get Started?

Experience the power of automated data curation with Lightly

Learn More