Published inTowards Data ScienceA Practitioners Guide to Retrieval Augmented Generation (RAG)How basic techniques can be used to build powerful applications with LLMs…Mar 26, 20246Mar 26, 20246
Published inTowards Data ScienceThe Basics of AI-Powered (Vector) SearchHow the modern AI boom has completely revolutionized search applications…Mar 18, 20242Mar 18, 20242
Published inTowards Data ScienceExplaining ChatGPT to Anyone in <20 MinutesDistilling the core components of generative LLMs into an accessible framework…Mar 14, 2024Mar 14, 2024
Published inTowards Data ScienceEasily Train a Specialized LLM: PEFT, LoRA, QLoRA, LLaMA-Adapter, and MoreTraining a specialized LLM over your own data is easier than you think…Mar 9, 20241Mar 9, 20241
Published inTowards Data ScienceThe Story of RLHF: Origins, Motivations, Techniques, and Modern ApplicationsHow learning from human feedback revolutionized generative language models…Feb 29, 2024Feb 29, 2024
Published inTowards Data ScienceProximal Policy Optimization (PPO): The Key to LLM AlignmentModern policy gradient algorithms and their application to language models…Feb 15, 2024Feb 15, 2024
Published inTowards Data SciencePolicy Gradients: The Foundation of RLHFUnderstanding policy optimization and how it is used in reinforcement learningFeb 6, 2024Feb 6, 2024
Published inTowards Data ScienceBasics of Reinforcement Learning for LLMsUnderstanding the problem formulation and basic algorithms for RLJan 31, 20241Jan 31, 20241
Published inTowards Data ScienceRLAIF: Reinforcement Learning from AI FeedbackMaking alignment via RLHF more scalable by automating human feedback…Jan 23, 20241Jan 23, 20241
Published inTowards Data ScienceSupervised Fine-Tuning (SFT) with Large Language ModelsUnderstanding how SFT works from idea to a working implementation…Jan 16, 20243Jan 16, 20243