Home - Paper-Reading-Study/2025 GitHub Wiki
2025
May 2025
- Group Normalization
- Continuous Thought Machines
- AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms
- FunSearch: Making new discoveries in mathematical sciences using Large Language Models
- Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
- All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
- Proximal Policy Optimization
- Layers at Similar Depths Generate Similar Activations Across LLM Architectures
- Trust Region Policy Optimization
Apr 2025
- Dropout: A Simple Way to Prevent Neural Networks from Overfitting
- Neural Machine Translation by Jointly Learning to Align and Translate
- Solving Olympiad Geometry Without Human Demonstrations (AlphaGeometry)
- Welcome to the Era of Experience
- Inference-Time Scaling for Generalist Reward Modeling
- Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
- On the Biology of a Large Language Model
- KAN: Kolmogorov-Arnold Networks
Mar 2025
- The Llama 3 Herd of Models
- Denoising Diffusion Probabilistic Models
- Neural Discrete Representation Learning (VQ-VAE)
- Deep Dive into LLMs like ChatGPT | Youtube
Feb 2025
- Generative Adversarial Nets
- Auto-Encoding Variational Bayes (VAE)
- NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
- Direct Preference Optimization: Your Language Model is Secretly a Reward Model
- RoFormer: Enhanced Transformer with Rotary Position Embedding
- Mamba: Linear-Time Sequence Modeling with Selective State Spaces
- Robust Speech Recognition via Large-Scale Weak Supervision (Whisper)
- Learning Transferable Visual Models From Natural Language Supervision (CLIP)