Skip to content

Zakiirim/WebScraping_Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Beautiful Soup Version

You can run the script from the command by using

python IndeedBS.py [optional criteria] -O [output]


Possible Kwargs:

  • -job job title or company name
  • -city city name
  • -contract type of contract
  • -maxold how old the job posting
  • -max how many pages to scrap (default value - min(maximum number of pages, 100))
  • -O outputname.xlsx


Selenium Version

You can run the script from the command by using

python IndeedSelenium.py [optional criteria] -O [output]


Cancel changes

Possible Kwargs:

  • -job job title or company name
  • -city city name
  • -contract type of contract
  • -maxold how old the job posting
  • -max how many pages to scrap (default value - min(maximum number of pages, 100))
  • -O outputname.xlsx
  • -h (or -headless) to run the script headless

Scrapy Version

You can run the script from the command by using

scrapy crawl job_offers [optional criteria] -o [output]


Possible Kwargs:

  • -job job title or company name
  • -city city name
  • -contract type of contract
  • -maxold how old the job posting
  • -o outputname.csv

Example Filters:


  • -job: 'Data Scientist'
  • -city: 'Warsaw'
  • -contract: 'fulltime', 'permanent', 'subcontract', 'internship', 'parttime'
  • -maxold: '1' (24 hours), '3' (3 days), '7' (7 days), '14' (14 days)
  • -o: 'job_offers.csv'

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages