Published inTDS ArchiveA Practitioners Guide to Retrieval Augmented Generation (RAG)How basic techniques can be used to build powerful applications with LLMs…Mar 26, 2024A response icon6Mar 26, 2024A response icon6
Published inTDS ArchiveThe Basics of AI-Powered (Vector) SearchHow the modern AI boom has completely revolutionized search applications…Mar 18, 2024A response icon2Mar 18, 2024A response icon2
Published inTDS ArchiveExplaining ChatGPT to Anyone in <20 MinutesDistilling the core components of generative LLMs into an accessible framework…Mar 14, 2024Mar 14, 2024
Published inTDS ArchiveEasily Train a Specialized LLM: PEFT, LoRA, QLoRA, LLaMA-Adapter, and MoreTraining a specialized LLM over your own data is easier than you think…Mar 9, 2024A response icon1Mar 9, 2024A response icon1
Published inTDS ArchiveThe Story of RLHF: Origins, Motivations, Techniques, and Modern ApplicationsHow learning from human feedback revolutionized generative language models…Feb 29, 2024Feb 29, 2024
Published inTDS ArchiveProximal Policy Optimization (PPO): The Key to LLM AlignmentModern policy gradient algorithms and their application to language models…Feb 15, 2024Feb 15, 2024
Published inTDS ArchivePolicy Gradients: The Foundation of RLHFUnderstanding policy optimization and how it is used in reinforcement learningFeb 6, 2024Feb 6, 2024
Published inTDS ArchiveBasics of Reinforcement Learning for LLMsUnderstanding the problem formulation and basic algorithms for RLJan 31, 2024A response icon1Jan 31, 2024A response icon1
Published inTDS ArchiveRLAIF: Reinforcement Learning from AI FeedbackMaking alignment via RLHF more scalable by automating human feedback…Jan 23, 2024A response icon1Jan 23, 2024A response icon1
Published inTDS ArchiveSupervised Fine-Tuning (SFT) with Large Language ModelsUnderstanding how SFT works from idea to a working implementation…Jan 16, 2024A response icon3Jan 16, 2024A response icon3