Unsupervised learning is a machine learning paradigm in which algorithms discover hidden patterns or structures in data that has no predefined labels. Instead of learning from correct answers, the model identifies inherent groupings, densities, or low-dimensional representations within the data. Common applications include customer segmentation, anomaly detection, topic modeling, and dimensionality reduction.
| Technique | Goal | Algorithm Example | Typical Use Case |
|---|---|---|---|
| Clustering | Group similar points | K-Means, DBSCAN | Customer segmentation |
| Dimensionality Reduction | Compress features | PCA, t-SNE, UMAP | Data visualization |
| Density Estimation | Model data distribution | GMM, KDE | Anomaly detection |
| Association Rules | Find co-occurrence rules | Apriori, FP-Growth | Market basket analysis |
| Generative Modeling | Learn data distribution | VAE, GAN | Synthetic data generation |
Wikimedia Commons, CC BY-SA
Machine learning is a branch of artificial intelligence in which systems learn from data to improve their performance on tasks without being explicitly programmed for each task. It works by identifying statistical patterns in training data and using those patterns to make predictions or decisions on new, unseen data. Machine learning powers applications ranging from spam filters and recommendation engines to medical diagnosis and autonomous vehicles.
Supervised learning is a machine learning approach where a model is trained on a labeled dataset, meaning each training example is paired with the correct output (label). The model learns a mapping from inputs to outputs by minimizing the difference between its predictions and the true labels. It is the most widely used ML paradigm and underpins applications such as image recognition, speech transcription, and credit scoring.
A neural network is a computational model loosely inspired by the structure of biological brains, consisting of layers of interconnected nodes (neurons) that process and transform data. Each neuron computes a weighted sum of its inputs, applies a non-linear activation function, and passes the result to the next layer. Neural networks are the foundation of modern AI and are capable of learning highly complex patterns in images, text, audio, and tabular data.
"Unsupervised" is formed by the prefix "un-" (Old English, meaning not) plus "supervised" (from Latin "supervidere," to oversee). The contrast with supervised learning was formalized by researchers including Geoff Hinton and Yann LeCun in the 1980s.