Content mirrored for search engine indexing from:

https://github.com/Paper-Reading-Study/2025/wiki/Home

Why does this service exist?

📅 Last Modified: Mon, 19 May 2025 13:07:22 GMT

Home - Paper-Reading-Study/2025 GitHub Wiki

2025

Prompt Template

May 2025

Group Normalization
Continuous Thought Machines
AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms
FunSearch: Making new discoveries in mathematical sciences using Large Language Models
Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
Proximal Policy Optimization
Layers at Similar Depths Generate Similar Activations Across LLM Architectures
Trust Region Policy Optimization

Apr 2025

Dropout: A Simple Way to Prevent Neural Networks from Overfitting
Neural Machine Translation by Jointly Learning to Align and Translate
Solving Olympiad Geometry Without Human Demonstrations (AlphaGeometry)
Welcome to the Era of Experience
Inference-Time Scaling for Generalist Reward Modeling
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
On the Biology of a Large Language Model
KAN: Kolmogorov-Arnold Networks

Mar 2025

The Llama 3 Herd of Models
Denoising Diffusion Probabilistic Models
Neural Discrete Representation Learning (VQ-VAE)
Deep Dive into LLMs like ChatGPT | Youtube

Feb 2025

Generative Adversarial Nets
Auto-Encoding Variational Bayes (VAE)
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
RoFormer: Enhanced Transformer with Rotary Position Embedding
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Robust Speech Recognition via Large-Scale Weak Supervision (Whisper)
Learning Transferable Visual Models From Natural Language Supervision (CLIP)

Jan 2025

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (ViT)
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Training Large Language Models to Reason in a Continuous Latent Space
Attention is All You Need

🗂️ Page Index for this GitHub Wiki