智能助手网
标签聚合 Llama

/tag/Llama

hnrss.org · 2026-04-15 22:01:34+08:00 · tech

I'm a software engineer who works with LLMs professionally (Forward Deployed Engineer at TrueFoundry). Over the past year I built up implementations of five LLM architectures from scratch and wrote a book around them. The progression: - Ch1: Vanilla encoder-decoder transformer (English to Hindi translation) - Ch2: GPT-2 124M from scratch, loads real OpenAI pretrained weights - Ch3: Llama 3.2-3B by swapping 4 components of GPT-2 (LayerNorm to RMSNorm, learned PE to RoPE, GELU to SwiGLU, MHA to GQA), loads Meta's pretrained weights - Ch4: KV cache, MQA, GQA (inference optimisation) - Ch5: DeepSeek MLA (absorption trick, decoupled RoPE), DeepSeekMoE, Multi-Token Prediction, FP8 quantisation All code is open source: https://github.com/S1LV3RJ1NX/mal-code The book provides the explanations, derivations, diagrams, and narrative: https://leanpub.com/adventures-with-llms (free sample available) I wrote it because most resources stop at GPT-2 and I wanted something that covered what's actually in production models today. Happy to answer questions about any of the implementations. Comments URL: https://news.ycombinator.com/item?id=47779084 Points: 2 # Comments: 0

linux.do · 2026-04-13 18:33:29+08:00 · tech

昨天看到有佬友发帖询问 Ollama 的订阅,Plan 截图里的 icon 和界面设计挺有意思。我就把各家 AI 厂商不同风格的订阅页面收集起来,供各位佬友观赏。 可爱风格: Ollama 经典白底 + 每种订阅的小羊驼读书 icon 人文风格: Claude 暖色背景 + 手绘风格人文 icon 严谨风格: ChatGPT 黑底 + 每种订阅只有文字描述,无 icon 全家桶风格: Google Gemini 白底 + Google 定制字体。Plan 涉及的权益太多,一张屏幕放不下,但是核心权益被削的很惨。 黑金风格: GLM 海外 黑底 + 金银色打光。Pro 订阅是积木 icon + 银色打光 + 小钻石图标。 Max 订阅的 logo 是经典的原子 icon + 金色打光 + 金色皇冠。 促销风格: 智谱国内 一眼就是经典的云服务商促销页面风格,附带 PDD 文案。 喜庆风格: minimax 红艳艳的顶部宣传插画 + 较为克制的套餐样式设计 工单风格: 阿里云百炼 Coding Plan 目前新购只有 Pro 这一种套餐,所以之前的多种 Plan 的界面已经撤下了。 音乐风格: Kimi(月之暗面) 鼠标放到不同的订阅套餐上会显示不同的五线谱,很有意思 以下是 Gemini 对于不同套餐英文名的解释: 1 个帖子 - 1 位参与者 阅读完整话题