Web Scraping with Python 2nd Ed (O'Reilly)
Thursday, 24 May 2018

By writing a simple automated program, you can query web servers, request data, and parse it to extract the information you need. In the expanded edition of this practical book, author Ryan Mitchell not only introduces you web scraping, but also provides a comprehensive guide to scraping almost every type of data from the modern web.

 

Author: Ryan Mitchell 
Publisher: O'Reilly
Date: Apr, 2018
Pages: 308
ISBN: 978-491985571
Print: 1491985577
Kindle: B07BMGBYSK
Audience: Student in sciences, engineering, and computer science.
Level: Beginners.
Category:Web design and development, Python

 

 

  • Parse complicated HTML pages
  • Develop crawlers with the Scrapy framework
  • Learn methods to store data you scrape
  • Read and extract data from documents
  • Clean and normalize badly formatted data
  • Read and write natural languages
  • Crawl through forms and logins
  • Scrape JavaScript and crawl through APIs
  • Use and write image-to-text software
  • Avoid scraping traps and bot blockers
  • Use scrapers to test your website

For recommendations of Python books see Books for Pythonistas and Python Books For Beginners in our Programmer's Bookshelf section.

For more Book Watch just click.

Book Watch is I Programmer's listing of new books and is compiled using publishers' publicity material. It is not to be read as a review where we provide an independent assessment. Some, but by no means all, of the books in Book Watch are eventually reviewed.

To have new titles included in Book Watch contact  BookWatch@i-programmer.info

Follow @bookwatchiprog on Twitter or subscribe to I Programmer's Books RSS feed for each day's new addition to Book Watch and for new reviews.

 

 

Banner
 


Learn Enough JavaScript to Be Dangerous

Author: Michael Hartl
Publisher: Addison-Wesley
Date: June 2022
Pages: 304
ISBN: 978-0137843749
Print: 0137843747
Kindle: B09RDSVV7N
Audience: Would-be JavaScript developers
Rating: 2
Reviewer: Mike James
To be dangerous? Is this a good ambition?



Balancing Coupling in Software Design

Author: Vlad Khononov
Publisher: Addison-Wesley
Date: October 2024
Pages: 320
ISBN: 978-0137353484
Print: 0137353480
Kindle: B09RV3Z3TP
Audience: General
Rating: 4.5
Reviewer: Kay Ewbank

This book looks in detail at coupling, the degree of interdependence between software modules, and how to use coupling  [ ... ]


More Reviews