{"product_id":"build-a-large-language-model-from-scratch-sebastian-raschka","title":"Build a Large Language Model (From Scratch) – Sebastian Raschka |9781633437166|","description":"\u003cp data-path-to-node=\"7\"\u003e\u003cb data-path-to-node=\"7\" data-index-in-node=\"0\"\u003eBuild a Large Language Model (From Scratch)\u003c\/b\u003e is the definitive, hands-on guide for engineers who want to demystify the technology behind ChatGPT and Claude. Written by renowned educator Sebastian Raschka, this book takes you on a step-by-step journey through the creation of a fully functional LLM using Python and PyTorch. By building every component from the ground up, you gain an unparalleled understanding of how these massive models process language and generate human-like text.\u003c\/p\u003e\n\u003ch3 data-path-to-node=\"8\"\u003eAbout the Book\u003c\/h3\u003e\n\u003cp data-path-to-node=\"9\"\u003eThis Manning publication provides a clear, technical blueprint for the \"Transformer\" architecture that powers the modern AI revolution. \u003cb data-path-to-node=\"9\" data-index-in-node=\"136\"\u003eBuild a Large Language Model (From Scratch)\u003c\/b\u003e avoids the \"black box\" approach by requiring you to code the attention mechanisms, data loaders, and training loops yourself. Raschka’s pedagogical style simplifies complex concepts like BPE tokenization and multi-head self-attention, making this the perfect resource for developers ready to transition from AI consumers to AI creators.\u003c\/p\u003e\n\u003ch3 data-path-to-node=\"10\"\u003eWhat You’ll Learn \/ Why Read\u003c\/h3\u003e\n\u003cp data-path-to-node=\"11\"\u003e\u003cb data-path-to-node=\"11\" data-index-in-node=\"0\"\u003eBuild a Large Language Model (From Scratch)\u003c\/b\u003e teaches you the full lifecycle of an LLM. You will learn how to prepare massive datasets for training, implement the core Transformer layers, and load pre-trained weights into your custom-built architecture. The book also covers the essential final steps: fine-tuning your model for specific tasks and using human feedback to improve its performance. This is an essential read for software architects, data scientists, and curious programmers who want to stay at the cutting edge of the AI landscape.\u003c\/p\u003e\n\u003ch3 data-path-to-node=\"12\"\u003eAuthor Bio\u003c\/h3\u003e\n\u003cp data-path-to-node=\"13\"\u003eSebastian Raschka, PhD, is a machine learning researcher and the author of several bestselling books on Python and AI. He is known for his work in the open-source community and his ability to explain high-level research in a way that is accessible to practical developers.\u003c\/p\u003e\n\u003ch3 data-path-to-node=\"14\"\u003eProduct Details\u003c\/h3\u003e\n\u003cul data-path-to-node=\"15\"\u003e\n\u003cli\u003e\n\u003cp data-path-to-node=\"15,0,0\"\u003e\u003cb data-path-to-node=\"15,0,0\" data-index-in-node=\"0\"\u003eAuthor:\u003c\/b\u003e Sebastian Raschka\u003c\/p\u003e\n\u003c\/li\u003e\n\u003cli\u003e\n\u003cp data-path-to-node=\"15,1,0\"\u003e\u003cb data-path-to-node=\"15,1,0\" data-index-in-node=\"0\"\u003ePublisher:\u003c\/b\u003e Manning Publications\u003c\/p\u003e\n\u003c\/li\u003e\n\u003cli\u003e\n\u003cp data-path-to-node=\"15,2,0\"\u003e\u003cb data-path-to-node=\"15,2,0\" data-index-in-node=\"0\"\u003eLanguage:\u003c\/b\u003e English\u003c\/p\u003e\n\u003c\/li\u003e\n\u003cli\u003e\n\u003cp data-path-to-node=\"15,3,0\"\u003e\u003cb data-path-to-node=\"15,3,0\" data-index-in-node=\"0\"\u003eFormat:\u003c\/b\u003e Paperback\u003c\/p\u003e\n\u003c\/li\u003e\n\u003cli\u003e\n\u003cp data-path-to-node=\"15,4,0\"\u003e\u003cb data-path-to-node=\"15,4,0\" data-index-in-node=\"0\"\u003eISBN-13:\u003c\/b\u003e 978-1633437166\u003c\/p\u003e\n\u003c\/li\u003e\n\u003cli\u003e\n\u003cp data-path-to-node=\"15,5,0\"\u003e\u003cb data-path-to-node=\"15,5,0\" data-index-in-node=\"0\"\u003eGenre:\u003c\/b\u003e Computers \/ Artificial Intelligence \/ Data Science\u003c\/p\u003e\n\u003c\/li\u003e\n\u003cli\u003e\n\u003cp data-path-to-node=\"15,6,0\"\u003e\u003cb data-path-to-node=\"15,6,0\" data-index-in-node=\"0\"\u003ePages:\u003c\/b\u003e 350+ Pages\u003c\/p\u003e\n\u003c\/li\u003e\n\u003c\/ul\u003e\n\u003ch3 data-path-to-node=\"16\"\u003eWhy Buy from Nybookshub\u003c\/h3\u003e\n\u003cp data-path-to-node=\"17\"\u003e\u003cb data-path-to-node=\"17\" data-index-in-node=\"0\"\u003eAI Researchers\u003c\/b\u003e and engineers choose Nybookshub because we provide 100% authentic editions from Manning Publications. In a field as complex as Large Language Models, the clarity of code blocks and the precision of architecture diagrams are critical to your success; we ensure you receive a verified printing that is as durable as it is detailed. Our global shipping network ensures that Sebastian Raschka’s essential guide reaches innovators from Silicon Valley to Bangalore. At Nybookshub, we empower the creators of the next generation of intelligence.\u003c\/p\u003e\n\u003ch3 data-path-to-node=\"18\"\u003eQuestions \u0026amp; Answers\u003c\/h3\u003e\n\u003cp data-path-to-node=\"19\"\u003e\u003cb data-path-to-node=\"19\" data-index-in-node=\"0\"\u003eDo I need a supercomputer to follow this book?\u003c\/b\u003e No, the book is designed so that the core components can be built and tested on standard consumer hardware (laptops with modest GPUs or cloud-based environments).\u003c\/p\u003e\n\u003cp data-path-to-node=\"20\"\u003e\u003cb data-path-to-node=\"20\" data-index-in-node=\"0\"\u003eIs the code written in PyTorch or TensorFlow?\u003c\/b\u003e The book focuses entirely on PyTorch, which is the industry standard for LLM research and development.\u003c\/p\u003e\n\u003cp data-path-to-node=\"21\"\u003e\u003cb data-path-to-node=\"21\" data-index-in-node=\"0\"\u003eDoes Nybookshub ship this internationally?\u003c\/b\u003e Yes, we offer fast, tracked global shipping to ensure developers everywhere can master LLM construction.\u003c\/p\u003e\n\u003cp data-path-to-node=\"22\"\u003e\u003cb data-path-to-node=\"22\" data-index-in-node=\"0\"\u003eDoes it cover the math behind self-attention?\u003c\/b\u003e Yes, but it does so through a \"code-first\" approach, explaining the linear algebra by implementing it in Python.\u003c\/p\u003e\n\u003cp data-path-to-node=\"23\"\u003e\u003cb data-path-to-node=\"23\" data-index-in-node=\"0\"\u003eWill I be able to build a model as big as GPT-4?\u003c\/b\u003e While the book teaches you the \u003ci data-path-to-node=\"23\" data-index-in-node=\"80\"\u003earchitecture\u003c\/i\u003e of such models, it focuses on building a smaller, manageable version (GPT-2 scale) that you can actually train and run yourself.\u003c\/p\u003e","brand":"NYBooksHub","offers":[{"title":"Default Title","offer_id":48225669513471,"sku":null,"price":39.24,"currency_code":"USD","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0794\/2809\/2159\/files\/build-a-large-language-model-from-scratch-sebastian-raschka.jpg?v=1774289337","url":"https:\/\/nybookshub.com\/products\/build-a-large-language-model-from-scratch-sebastian-raschka","provider":"rajvinder kaur","version":"1.0","type":"link"}