Self Supervised and Semi Supervised Learning - tech9tel/ai GitHub Wiki

๐Ÿคน Self-Supervised & Semi-Supervised Learning

Explore the middle-ground approaches between supervised and unsupervised learning. These methods use unlabeled data in smart ways to boost learning.

๐Ÿ“˜ Definition:

  • Self-Supervised Learning: Leverages unlabeled data by generating labels from the data itself.
  • Semi-Supervised Learning: Combines a small amount of labeled data with a large amount of unlabeled data.

๐ŸŽฏ Goal: Reduce the need for expensive manual labeling.


๐Ÿ”„ Self-Supervised Learning

Learn from data without human-labeled examples.

  • ๐Ÿงฉ Definition: Learns from unlabeled data by creating labels from the data itself.
  • ๐Ÿง  Analogy: Like solving a jigsaw puzzle without the box cover โ€” the learner figures it out through structure and clues.
  • ๐Ÿ“˜ Technical Insight: A pretext task is designed (e.g., predicting missing words in a sentence) so the model learns representations that are useful for downstream tasks.
  • ๐Ÿ” Example Tasks: Image colorization, sentence completion, contrastive learning.

๐ŸŒ“ Semi-Supervised Learning

Blend of a few labeled samples + many unlabeled ones.

  • ๐Ÿท๏ธ Definition: Uses a small set of labeled data and a large set of unlabeled data to improve learning accuracy.
  • ๐Ÿง  Analogy: A student with a few solved examples (labeled) learns to solve the rest on their own (unlabeled).
  • ๐Ÿ“˜ Technical Insight: The model first learns from the labeled data, then generalizes patterns using the structure of unlabeled data.
  • ๐Ÿ” Example Use Cases: Text classification with limited annotations, medical imaging with a few labeled scans.

๐Ÿ“Š Machine Learning Types โ€“ Summary Table

Learning Type Input Data Labels Used? Key Idea Common Use Cases Examples
๐Ÿง  Supervised Labeled data โœ… Yes Learn mapping from input to known output Classification, Regression Spam detection, House price prediction
๐Ÿ” Unsupervised Unlabeled data โŒ No Find hidden patterns or structure Clustering, Dimensionality Reduction Customer segmentation, PCA
๐ŸŽฎ Reinforcement Agent in environment โŒ No (rewards) Learn actions via rewards & penalties Game AI, Robotics, Self-driving cars AlphaGo, Robot arm training
๐Ÿงฉ Self-Supervised Raw data (generates own labels) โš ๏ธ Indirect Predict parts of data from other parts Pretraining LLMs, Contrastive Learning BERT, SimCLR, GPT Pretraining
๐ŸŒ“ Semi-Supervised Small labeled + large unlabeled dataset โš ๏ธ Partial Use few labels + structure from data NLP, Medical imaging, Fraud Detection Pseudo-labeling, Mean Teacher