Skip to content

john-wd/real-estate-scrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Real estate ad scrapper and notifier

This quick and simple scrapy application crawls some well known real estate websites to fetch new listings and notify me through a Telegram bot so I can track fresh deals that may arise.

It is supposed to be scheduled in a cronjob to run with some recurrence.

As of now, it has the following spiders implemented:

  • OLX: general product listing; since it works by given a query ID, you can track any kind of products listed in the website
  • Vivareal
  • Zapimoveis

The last two are big real estate aggregators that contains listings for may different agencies throughout Brazil.

External dependencies

The crawler works with selenium using the Firefox headless driver, so please install both firefox and geckodriver to your distro.

Installation

First clone this repo and set up a new virtual environment

$ python3 -m venv venv
$ source venv/bin/activate

Install dependencies

(venv) $ pip install -r requirements.txt

Create a custom telegram bot and configure it in a .env file in the root of this repo.

Run the application

(venv) $ ./crawl.sh

Configuring cronjob

crawl.sh is really an utility script that enters the virtualenv and runs the spiders configured in it, so you can just configure your crontab to invoke it.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published