This repository contains a web scraper designed to extract property listings from Zillow. The scraper retrieves data such as property details, prices, and locations, allowing users to analyze real estate trends and opportunities.
The details tutorial can be found in Rayobyte Community Web Scraping Project page.
This Zillow Scraper consists of TWO parts
The first part consists of scraping these values:
HOUSE URL
, PHOTO URLs
, PRICE
, FULL ADDRESS
, STREET
, CITY
, STATE
, ZIP CODE
, NUMBER OF BEDROOMS
, NUMBER OF BATHROOMS
, HOUSE SIZE
, LOT SIZE
and HOUSE TYPE
The output is stored in OUTPUT_1
folder
The second parts consist of scraping the following values:
YEAR BUILT
, DESCRIPTION
, LISTING DATE
, DAYS ON ZILLOW
, TOTAL VIEWS
, TOTAL SAVED
, REALTOR NAME
, REALTOR CONTACT NO
, AGENCY
, CO-REALTOR NAME
, CO-REALTOR CONTACT NO
and CO-REALTOR AGENCY
The output is stored in OUTPUT_2
folder
To get started with the Zillow Properties for Sale Scraper, follow these steps:
-
Clone the Repository:
git clone https://github.com/ainacodes/zillow_properties_for_sale_scraper.git
-
Navigate to the Directory:
cd zillow_properties_for_sale_scraper
zillow_draft.ipynb
: Consist of step-by-step elements scrape.
first_page.py
: Consist of complete code to scrape the first page that apears from the search resultsall_pages.py
: Consist of complete code to scrape all pages or until which page you want to scrape.
add_other_info.py
: Consist of the complete code with additional data that scrape inside the properties page.add_other_info_proxy_rotate.py
: Consist of the complete code with additional data that scrape inside the properties page with implementations of proxy rotation.
Contributions are welcome! If you have suggestions for improvements or find bugs, please open an issue or submit a pull request.
For any inquiries, please email me at [email protected]