LLM - gusenov/kb GitHub Wiki
- Cake is a Rust framework for distributed inference of large models like LLama3 based on Candle.
- YouTube / @TheOfficialACM / Understanding the LLM Development Cycle: Building, Training, and Finetuning
- middleware.io / What is LLM Observability? By Vivek Tilva
MCP (Model Context Protocol)
- The GitHub Blog / What the heck is MCP and why is everyone talking about it? TL;DR: It’s an open standard for connecting LLMs to data and tools.
- GitHub / tadata-org/fastapi_mcp A zero-configuration tool for automatically exposing FastAPI endpoints as Model Context Protocol (MCP) tools.
- KDnuggets.com / Building A Simple MCP Server Give your LLMs the extra ability to fetch live stock prices, compare them, and provide historical analysis by implementation tools within the MCP Server.
From scratch, reinventing a wheel
- freeCodeCamp.org
- Train Your Own LLM
- Understand and Code DeepSeek V3 DeepSeek V3 is a popular Large-Language Model that's a good one to learn if you're interested in AI. And in this course, you'll learn how it works. It covers DeepSeek's Multi-Head Latent Attention mechanism, Mixture of Experts architecture, and lots more.
- YouTube / Build a Large Language Model (From Scratch) by Sebastian Raschka
- TowardsDataScience.com / Training LLM, from Scratch, in Rust
- Medium / Towards Data Science / How to Build Your Own AI: Creating an LLM from Scratch 🤯 by Leo Anello
- GitHub
- naklecha/llama3-from-scratch llama3 implementation one matrix multiplication at a time
- jaymody/picoGPT An unnecessarily tiny implementation of GPT-2 in NumPy.
Benchmarks
- SWE-bench Can Language Models Resolve Real-World GitHub Issues?
- LLMARENA / Оценка систем LLM: основные метрики, бенчмарки и лучшие практики
Courses
- TowardsDataScience.com / The Large Language Model Course
- DeepLearning.AI / Getting Structured LLM Output