~ nghia-pham.dev _

$ blog $ series $ tags $ about

$ Esc

Type to search posts...

grep -r "llm"

70 posts tagged llm

LLM từ zero: Series Plan Apr 22, 2026 ~6 min read
AI Agents từ zero: Series Plan May 18, 2026 ~5 min read
AI Coding Providers Series: Chọn đúng plan cho workload của bạn Apr 21, 2026 ~1 min read
Mua AI Coding Plan nào? Research 5 providers lớn (2026-04) Apr 21, 2026 ~11 min read
Tiếng Việt tốn hơn x2 token? Data nói khác Apr 21, 2026 ~14 min read
Does Vietnamese really cost 2x+ tokens in LLM prompts? Data from 5626 real messages Apr 21, 2026 ~13 min read
LLM hoạt động thế nào: mental model cho dev Apr 22, 2026 ~16 min read
Calculus cho LLM: gradient, chain rule, backprop intuition Apr 22, 2026 ~9 min read
Linear algebra cho LLM: vector, matrix, dot product Apr 22, 2026 ~13 min read
Neural network tối giản: perceptron, MLP từ zero Apr 22, 2026 ~12 min read
Probability cho LLM: softmax, cross-entropy, perplexity Apr 22, 2026 ~11 min read
Build BPE tokenizer từ đầu (theo Karpathy minbpe) Apr 22, 2026 ~12 min read
Attention mechanism: Query, Key, Value intuition Apr 22, 2026 ~11 min read
Embeddings: word2vec, contextual, và positional encoding (RoPE) Apr 22, 2026 ~11 min read
Multi-head attention: tại sao chia nhiều head Apr 22, 2026 ~13 min read
nanoGPT: 300 dòng PyTorch tái tạo GPT từ đầu Apr 22, 2026 ~12 min read
Self-attention: code từ đầu bằng NumPy Apr 22, 2026 ~10 min read
Transformer block: attention + MLP + layer norm + residual Apr 22, 2026 ~13 min read
Tokenization: BPE, WordPiece, SentencePiece Apr 22, 2026 ~14 min read
AI agent local: pattern cũ, blast radius mới May 11, 2026 ~10 min read
Distributed training: DP, DDP, FSDP, pipeline parallel May 17, 2026 ~9 min read
DPO và RLHF: alignment với preference data May 17, 2026 ~9 min read
Evaluation: MMLU, GSM8K, HumanEval, custom benchmark May 17, 2026 ~10 min read
Hands-on fine-tune Llama-3 với dataset tiếng Việt LoRA $20 GPU May 17, 2026 ~9 min read
KV cache và PagedAttention: tăng throughput inference May 17, 2026 ~10 min read
LLM Agents: ReAct, tool use, planning, multi-step reasoning May 17, 2026 ~10 min read
Long context: RoPE scaling, YaRN, ALiBi extrapolation May 17, 2026 ~9 min read
LoRA và QLoRA: parameter-efficient fine-tuning May 17, 2026 ~9 min read
Mixed precision FP16 BF16 và gradient checkpointing May 17, 2026 ~9 min read
Mixture of Experts (MoE): Mixtral, DeepSeek architecture May 17, 2026 ~9 min read
Quantization: INT8, INT4, GGUF, AWQ và BitNet 1.58-bit May 17, 2026 ~8 min read
RAG: retrieval-augmented generation từ vector DB tới prompt May 17, 2026 ~10 min read
Reasoning models: o1, R1, chain-of-thought training May 17, 2026 ~9 min read
Scaling laws: Chinchilla, compute-optimal, data efficient May 17, 2026 ~8 min read
Serving frameworks: vLLM, llama.cpp, Ollama, bitnet.cpp đối chiếu May 17, 2026 ~9 min read
SFT: supervised fine-tuning với instruction dataset May 17, 2026 ~8 min read
Training loop: forward, backward, optimizer, lr schedule May 17, 2026 ~9 min read
30 bài LLM bằng agents trong 1 tháng: cái được, cái dở, ~0.5M token May 18, 2026 ~9 min read
Agent là gì: LLM cộng tools cộng memory cộng loop May 18, 2026 ~8 min read
Control loop: ReAct, agentic loop, điều kiện dừng May 18, 2026 ~10 min read
Chain-of-Thought so với structured reasoning May 18, 2026 ~10 min read
Build agent từ đầu: 100 dòng Python với Anthropic SDK May 18, 2026 ~10 min read
Memory cho agent: context window, scratchpad, summarization May 18, 2026 ~10 min read
Plan-and-Execute: tách planning khỏi execution May 18, 2026 ~11 min read
Tree of Thoughts và tree search cho agent May 18, 2026 ~10 min read
Agent communication: shared state so với message passing May 18, 2026 ~10 min read
Eval cho agent: trace, replay, golden set, regression May 18, 2026 ~12 min read
Cost và latency: token budget, streaming, prompt caching May 18, 2026 ~12 min read
Failure modes: hallucination, infinite loop, hijacking May 18, 2026 ~12 min read
On-call cho agent: monitoring, alerts, rollback, A/B test May 18, 2026 ~14 min read
Security: prompt injection, tool sandboxing, secrets May 18, 2026 ~12 min read
Case study: Anthropic SDK agents và Claude Code agents May 18, 2026 ~10 min read
Browser automation cho agent: Playwright và computer use May 18, 2026 ~11 min read
Code execution sandbox: subprocess, Docker, e2b May 18, 2026 ~11 min read
LangGraph, CrewAI, AutoGen: framework so sánh May 18, 2026 ~10 min read
MCP (Model Context Protocol): chuẩn hoá tool layer May 18, 2026 ~9 min read
Multi-agent patterns: supervisor, handoff, debate May 18, 2026 ~11 min read
RAG cho agents: retrieval trong vòng lặp, không phải QA May 18, 2026 ~10 min read
ReAct: thought, action, observation cycle May 18, 2026 ~11 min read
Self-reflection: critic, verifier, retry pattern May 18, 2026 ~10 min read
Specialized agent roles: planner, executor, reviewer May 18, 2026 ~10 min read
Tool design: schema, validation, idempotency May 18, 2026 ~11 min read
Tool use cơ bản: function calling, JSON schema, error handling May 18, 2026 ~14 min read
Hermes Agent: AI tự học, persistent memory, chạy trên $5 VPS May 18, 2026 ~12 min read
OpenClaw: open-source agent framework đang dẫn đầu 2026 May 18, 2026 ~11 min read
So sánh giá API LLM: DeepSeek, MiniMax, Doubao, Kimi và mấy cái bẫy khi tính tiền Jun 3, 2026 ~8 min read
DeepSeek V4 Flash vs Claude Haiku 4.5: rẻ hơn chưa chắc là chọn được Jun 8, 2026 ~10 min read
Xiaomi MiMo v2.5: tân binh China LLM đáng để thử Jun 9, 2026 ~6 min read
DeepSeek V4 Flash trong opencode: 18 ngày, 431 session, $22 Jun 11, 2026 ~8 min read
DeepSeek V4 Pro trong thực tế: 18 ngày, 431 phiên, $22 Jun 11, 2026 ~6 min read

$ echo "built with Astro"

© 2026 Nghia Pham | RSS | GitHub | nghia-pham.com