- LLM Fun: Building a Q&A Bot of Myself
- Application development with large language models
- Future of this Blog
- Bayesian Learning via Stochastic Gradient Langevin Dynamics and Bayes by Backprop
- Training compute-optimal Perceiver AR language models
- Handy ChatGPT Prompts
- A gentle introduction to Rotary Position Embedding
- An Introduction to Stochastic Calculus
- Is LaMDA Sentient/Sapient/Conscious?
- Mimicking DeepMind's Chinchilla with GPT-3
- Normalizing Flows with Real NVP
- Black Swan events and the new Cold War
- Fault-tolerant spot instance training with PyTorch Lightning on SageMaker
- Multi-node, multi-GPU training with PyTorch Lightning on SageMaker
- Handy GPT-3 Prompts