Cameron R. Wolfe, Ph.D.inTowards Data ScienceA Practitioners Guide to Retrieval Augmented Generation (RAG)How basic techniques can be used to build powerful applications with LLMs…·27 min read·Mar 26, 2024--6--6
Cameron R. Wolfe, Ph.D.inTowards Data ScienceThe Basics of AI-Powered (Vector) SearchHow the modern AI boom has completely revolutionized search applications…·32 min read·Mar 18, 2024--2--2
Cameron R. Wolfe, Ph.D.inTowards Data ScienceExplaining ChatGPT to Anyone in <20 MinutesDistilling the core components of generative LLMs into an accessible framework…·14 min read·Mar 14, 2024----
Cameron R. Wolfe, Ph.D.inTowards Data ScienceEasily Train a Specialized LLM: PEFT, LoRA, QLoRA, LLaMA-Adapter, and MoreTraining a specialized LLM over your own data is easier than you think…·31 min read·Mar 9, 2024--1--1
Cameron R. Wolfe, Ph.D.inTowards Data ScienceThe Story of RLHF: Origins, Motivations, Techniques, and Modern ApplicationsHow learning from human feedback revolutionized generative language models…·31 min read·Feb 29, 2024----
Cameron R. Wolfe, Ph.D.inTowards Data ScienceProximal Policy Optimization (PPO): The Key to LLM AlignmentModern policy gradient algorithms and their application to language models…·18 min read·Feb 15, 2024--1--1
Cameron R. Wolfe, Ph.D.inTowards Data SciencePolicy Gradients: The Foundation of RLHFUnderstanding policy optimization and how it is used in reinforcement learning·15 min read·Feb 6, 2024----
Cameron R. Wolfe, Ph.D.inTowards Data ScienceBasics of Reinforcement Learning for LLMsUnderstanding the problem formulation and basic algorithms for RL·18 min read·Jan 31, 2024--1--1
Cameron R. Wolfe, Ph.D.inTowards Data ScienceRLAIF: Reinforcement Learning from AI FeedbackMaking alignment via RLHF more scalable by automating human feedback…·18 min read·Jan 23, 2024--1--1
Cameron R. Wolfe, Ph.D.inTowards Data ScienceSupervised Fine-Tuning (SFT) with Large Language ModelsUnderstanding how SFT works from idea to a working implementation…·15 min read·Jan 16, 2024--3--3