The Hundred-Page Language Models Book (True Positive )
Monday, 03 February 2025

This book by Andriy Burkov is the PyTorch follow-up to his bestselling The Hundred-Page Machine Learning Book. This book offers a concise yet thorough journey from language modeling fundamentals to the cutting edge of modern Large Language Models (LLMs). Written in a hands-on style with working Python code examples, this book progressively builds understanding from basic machine learning concepts to advanced language model architectures. All code examples run on Google Colab, making it accessible to anyone with a modern laptop.

 

Author: Andriy Burkov
Publisher: True Positive Inc.
Date: January 2025
Pages: 156
ISBN: 978-1778042720
Print: 1778042724
Kindle: ‎ B0DT413SKG
Audience: Developers interested in AI
Level: Introductory/Intermediate
Category: Artificial Intelligence

hundred

Topics include:

  • Master the mathematical foundations of modern machine learning and neural networks
  • Build and train three architectures of language models in Python
  • Understand and code a Transformer language model from scratch in PyTorch
  • Work with LLMs, including instruction finetuning and prompt engineering

For recommendations of Python books see Books for Pythonistas and Python Books For Beginners in our Programmer's Bookshelf section.

For more Book Watch just click.

Book Watch is I Programmer's listing of new books and is compiled using publishers' publicity material. It is not to be read as a review where we provide an independent assessment. Some, but by no means all, of the books in Book Watch are eventually reviewed.

To have new titles included in Book Watch contact  BookWatch@i-programmer.info

Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.

 

 

Banner


SQL Server Query Tuning and Optimization (Packt)

Author: Benjamin Nevarez
Publisher: Packt Publishing Pages: 446
ISBN: 9781803242620
Print: 1803242620
Kindle: B0B42SVBFY
Audience: Intermediate to advanced DBAs and developers
Rating: 4.7
Reviewer: Ian Stirk 

This book aims to give you the tools and knowledge to get peak performance from your que [ ... ]



Understanding Software Dynamics (Addison-Wesley)

Author: Richard L. Sites
Publisher: Addison-Wesley
Pages: 464
ISBN: 978-0137589739
Print: 0137589735
Kindle: B09H5JB5HC
Audience: Every developers
Rating: 5
Reviewer: Kay Ewbank

This book looks at the different reasons why software runs too slowly, and what developers can do about it, starting by looki [ ... ]


More Reviews