- Training compute-optimal Perceiver AR language models
- A gentle introduction to Rotary Position Embedding
- An Introduction to Stochastic Calculus
- Is LaMDA Sentient/Sapient/Conscious?
- Mimicking DeepMind's Chinchilla with GPT-3
- Normalizing Flows with Real NVP
- Black Swan events and the new Cold War
- Fault-tolerant spot instance training with PyTorch Lightning on SageMaker
- Multi-node, multi-GPU training with PyTorch Lightning on SageMaker
- Handy GPT-3 Prompts
- Hamiltonian Monte Carlo
- Until Next Time
- CLIP Prompt Engineering for Generative Art
- Lossless Compression with Latent Variable Models using Bits-Back Coding
- Messing with GPT-Neo