Real estate ad scrapper and notifier

This quick and simple scrapy application crawls some well known real estate websites to fetch new listings and notify me through a Telegram bot so I can track fresh deals that may arise.

It is supposed to be scheduled in a cronjob to run with some recurrence.

As of now, it has the following spiders implemented:

OLX: general product listing; since it works by given a query ID, you can track any kind of products listed in the website
Vivareal
Zapimoveis

The last two are big real estate aggregators that contains listings for may different agencies throughout Brazil.

External dependencies

The crawler works with selenium using the Firefox headless driver, so please install both firefox and geckodriver to your distro.

Installation

First clone this repo and set up a new virtual environment

$ python3 -m venv venv
$ source venv/bin/activate

Install dependencies

(venv) $ pip install -r requirements.txt

Create a custom telegram bot and configure it in a .env file in the root of this repo.

Run the application

(venv) $ ./crawl.sh

Configuring cronjob

crawl.sh is really an utility script that enters the virtualenv and runs the spiders configured in it, so you can just configure your crontab to invoke it.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
olx_notifier		olx_notifier
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
crawl.sh		crawl.sh
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real estate ad scrapper and notifier

External dependencies

Installation

Configuring cronjob

About

Releases

Packages

Languages

john-wd/real-estate-scrapper

Folders and files

Latest commit

History

Repository files navigation

Real estate ad scrapper and notifier

External dependencies

Installation

Configuring cronjob

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages