A-Z of Machine Learning and Computer Vision Terms

AI Assisted Labeling

Active Learning

Anomaly Detection

Artificial Intelligence (AI)

Backpropagation

Batch Normalization

Bayesian Network

Binary Classification

Calibration Curve

Canonical Correlation Analysis (CCA)

Case-Based Reasoning

Chain of Thought (CoT)

Chi-Squared Automatic Interaction Detection (CHAID)

Class Boundary (Statistics & Machine Learning)

Class Imbalance

Collaborative Filtering

Computer Vision

Computer Vision Model

Conditional Random Field (CRF)

Confusion Matrix

Constrained Clustering

Contrastive Learning

Convolutional Neural Networks (CNNs)

Cross-Validation

Data Approximation

Data Augmentation

Data Operations

Data Pre-processing

Decision Boundary

Deep Neural Networks

Dimensionality Reduction

Dynamic and Event-Based Classifications

Embedding Spaces

Ensemble Learning

Expectation-Maximization Algorithm (EM)

Extreme Learning Machine

FP-Growth Algorithm

Factor Analysis

False Positive Rate

Feature Engineering

Feature Extraction

Feature Hashing

Feature Learning

Feature Scaling

Feature Selection

Few-shot Learning

Fisher’s Linear Discriminant

Foundation Models

Frames Per Second (FPS)

Fully Connected Layer

Generative Adversarial Network (GAN)

Generative Adversarial Networks

Generative Pre-Trained Transformer

Gradient Descent

T

Text Generation Inference

Text Generation Inference generally refers to the process of generating text from a trained language model given some input, and more specifically to frameworks that serve such models efficiently. In particular, Text Generation Inference (TGI) is the name of an open-source toolkit for deploying and serving large language models, released by Hugging Face.TGI provides a high-performance inference server (written in Rust and Python) that is optimized for text generation workloads – it can handle features like model sharding, batch scheduling of requests, and streaming token output, enabling deployment of LLMs (like BLOOM, GPT-type models) in production with low latency. The goal is to maximize throughput and utilization when multiple generation requests are made. More broadly, when one discusses text generation inference, they may be talking about how a model like GPT-3 or GPT-4 is used at inference time: feeding a prompt and sampling or decoding the output text (using strategies like greedy, beam search, or nucleus sampling). Specialized inference engines (like Hugging Face’s TGI or OpenAI’s hosted inference) are important because large models are resource-intensive; they ensure that the model produces text responses efficiently and can scale to many users.

Further Reading

🔗 Research Paper 📄 Blog Post 📄 Blog Post

Explore Our Products

Lightly One

Data Selection & Data Viewer

Get data insights and find the perfect selection strategy

Lightly Train

Self-Supervised Pretraining

Leverage self-supervised learning to pretrain models

Lightly Edge

Smart Data Capturing on Device

Find only the most valuable data directly on devide

Ready to Get Started?

Experience the power of automated data curation with Lightly

Region Proposal Network (RPN)

Class Boundary (Statistics & Machine Learning)

Long Short-Term Memory (LSTM)

Recurrent Neural Network (RNN)

Large Language Model (LLM)

Chain of Thought (CoT)

Foundation Models

Semantic Segmentation

Variance (Model Variance)

XAI (Explainable AI)

YOLO (You Only Look Once)

Weight Decay (L2 Regularization)

Text Generation Inference

True Positive Rate (TPR)

Type II Error (False Negative)

Type I Error (False Positive)

Transformers (Transformer Networks)

Stream-Based Selective Sampling

Support Vector Machine (SVM)

Sentiment Analysis

Surrogate Model

Supervised Learning

Semi-supervised Learning

Selective Sampling

Sliding Window Attention

Sensitivity and Specificity of Machine Learning

Segment Anything Model (SAM)

Regularization Algorithms

ROC (Receiver Operating Characteristic) Curve

Scale Imbalance

Regression (Regression Analysis)

Region-Based CNN (R-CNN)

Recall (Sensitivity or True Positive Rate)

RAG Architecture

Query Synthesis Methods

Query Strategy (Active Learning)

Predictive Model Validation

Prompt Injection

Prompt Engineering

Prompt Chaining

Pose Estimation

Pool-Based Sampling

Pattern Recognition

Parameter-Efficient Fine-Tuning (Prefix-Tuning)

Pandas and NumPy

Panoptic Segmentation

PACS (Picture Archiving and Communication System)

Outlier Detection

Object Tracking

Optical Character Recognition (OCR)

One-Shot Learning

Object Localization

Object Detection

Natural Language Processing (NLP)

Neural Networks

Multi-Task Learning

Motion Detection

Motion Estimation

Model Validation

Latent Dirichlet Allocation (LDA)

Medical Image Segmentation

Model Parameters

Mean Squared Error (MSE)

Mean Average Precision (mAP)

Machine Learning (ML)

Linear Regression

Linear Discriminant Analysis (LDA)

Intersection over Union (IoU)

Interpretability

Imbalanced Dataset

Image Processing

Image Restoration

Image Segmentation

Image Recognition