Self Supervised and Semi Supervised Learning - tech9tel/ai GitHub Wiki
๐คน Self-Supervised & Semi-Supervised Learning
Explore the middle-ground approaches between supervised and unsupervised learning. These methods use unlabeled data in smart ways to boost learning.
๐ Definition:
- Self-Supervised Learning: Leverages unlabeled data by generating labels from the data itself.
- Semi-Supervised Learning: Combines a small amount of labeled data with a large amount of unlabeled data.
๐ฏ Goal: Reduce the need for expensive manual labeling.
๐ Self-Supervised Learning
Learn from data without human-labeled examples.
- ๐งฉ Definition: Learns from unlabeled data by creating labels from the data itself.
- ๐ง Analogy: Like solving a jigsaw puzzle without the box cover โ the learner figures it out through structure and clues.
- ๐ Technical Insight: A pretext task is designed (e.g., predicting missing words in a sentence) so the model learns representations that are useful for downstream tasks.
- ๐ Example Tasks: Image colorization, sentence completion, contrastive learning.
๐ Semi-Supervised Learning
Blend of a few labeled samples + many unlabeled ones.
- ๐ท๏ธ Definition: Uses a small set of labeled data and a large set of unlabeled data to improve learning accuracy.
- ๐ง Analogy: A student with a few solved examples (labeled) learns to solve the rest on their own (unlabeled).
- ๐ Technical Insight: The model first learns from the labeled data, then generalizes patterns using the structure of unlabeled data.
- ๐ Example Use Cases: Text classification with limited annotations, medical imaging with a few labeled scans.
๐ Machine Learning Types โ Summary Table
Learning Type | Input Data | Labels Used? | Key Idea | Common Use Cases | Examples |
---|---|---|---|---|---|
๐ง Supervised | Labeled data | โ Yes | Learn mapping from input to known output | Classification, Regression | Spam detection, House price prediction |
๐ Unsupervised | Unlabeled data | โ No | Find hidden patterns or structure | Clustering, Dimensionality Reduction | Customer segmentation, PCA |
๐ฎ Reinforcement | Agent in environment | โ No (rewards) | Learn actions via rewards & penalties | Game AI, Robotics, Self-driving cars | AlphaGo, Robot arm training |
๐งฉ Self-Supervised | Raw data (generates own labels) | โ ๏ธ Indirect | Predict parts of data from other parts | Pretraining LLMs, Contrastive Learning | BERT, SimCLR, GPT Pretraining |
๐ Semi-Supervised | Small labeled + large unlabeled dataset | โ ๏ธ Partial | Use few labels + structure from data | NLP, Medical imaging, Fraud Detection | Pseudo-labeling, Mean Teacher |