grep -r "machine-learning"
24 posts tagged machine-learning
- LLM từ zero: Series Plan
- LLM hoạt động thế nào: mental model cho dev
- Calculus cho LLM: gradient, chain rule, backprop intuition
- Linear algebra cho LLM: vector, matrix, dot product
- Neural network tối giản: perceptron, MLP từ zero
- Probability cho LLM: softmax, cross-entropy, perplexity
- Tokenization: BPE, WordPiece, SentencePiece
- Distributed training: DP, DDP, FSDP, pipeline parallel
- DPO và RLHF: alignment với preference data
- Evaluation: MMLU, GSM8K, HumanEval, custom benchmark
- Hands-on fine-tune Llama-3 với dataset tiếng Việt LoRA $20 GPU
- KV cache và PagedAttention: tăng throughput inference
- LLM Agents: ReAct, tool use, planning, multi-step reasoning
- Long context: RoPE scaling, YaRN, ALiBi extrapolation
- LoRA và QLoRA: parameter-efficient fine-tuning
- Mixed precision FP16 BF16 và gradient checkpointing
- Mixture of Experts (MoE): Mixtral, DeepSeek architecture
- Quantization: INT8, INT4, GGUF, AWQ và BitNet 1.58-bit
- RAG: retrieval-augmented generation từ vector DB tới prompt
- Reasoning models: o1, R1, chain-of-thought training
- Scaling laws: Chinchilla, compute-optimal, data efficient
- Serving frameworks: vLLM, llama.cpp, Ollama, bitnet.cpp đối chiếu
- SFT: supervised fine-tuning với instruction dataset
- Training loop: forward, backward, optimizer, lr schedule