Published inTDS ArchiveA Practitioners Guide to Retrieval Augmented Generation (RAG)How basic techniques can be used to build powerful applications with LLMs…Mar 26, 20246Mar 26, 20246
Published inTDS ArchiveThe Basics of AI-Powered (Vector) SearchHow the modern AI boom has completely revolutionized search applications…Mar 18, 20242Mar 18, 20242
Published inTDS ArchiveExplaining ChatGPT to Anyone in <20 MinutesDistilling the core components of generative LLMs into an accessible framework…Mar 14, 2024Mar 14, 2024
Published inTDS ArchiveEasily Train a Specialized LLM: PEFT, LoRA, QLoRA, LLaMA-Adapter, and MoreTraining a specialized LLM over your own data is easier than you think…Mar 9, 20241Mar 9, 20241
Published inTDS ArchiveThe Story of RLHF: Origins, Motivations, Techniques, and Modern ApplicationsHow learning from human feedback revolutionized generative language models…Feb 29, 2024Feb 29, 2024
Published inTDS ArchiveProximal Policy Optimization (PPO): The Key to LLM AlignmentModern policy gradient algorithms and their application to language models…Feb 15, 2024Feb 15, 2024
Published inTDS ArchivePolicy Gradients: The Foundation of RLHFUnderstanding policy optimization and how it is used in reinforcement learningFeb 6, 2024Feb 6, 2024
Published inTDS ArchiveBasics of Reinforcement Learning for LLMsUnderstanding the problem formulation and basic algorithms for RLJan 31, 20241Jan 31, 20241
Published inTDS ArchiveRLAIF: Reinforcement Learning from AI FeedbackMaking alignment via RLHF more scalable by automating human feedback…Jan 23, 20241Jan 23, 20241
Published inTDS ArchiveSupervised Fine-Tuning (SFT) with Large Language ModelsUnderstanding how SFT works from idea to a working implementation…Jan 16, 20243Jan 16, 20243