Parser of public information about vacancies for Python developer from site Djinni.co
, save into csv file and analyse this information and creation graphics.
![diagram](data analysis/plots/technology_occurrence.png)
-
Asynchronous Parsing with asyncio: Enable asynchronous data parsing using the asyncio library for efficient resource utilization and improved performance.
-
Optimized Processing: Utilize optimized algorithms to reduce the number of requests and optimize resource usage, ensuring efficient data processing.
-
Thoughtful Data Analysis: Employ carefully crafted strategies for data analysis, enabling comprehensive and meaningful insights from data processing.
To set up and run the project locally, follow these steps:
-
Clone the Repository:
git clone https://github.com/AnyoneClown/djinni-scraping.git
-
Install Dependencies:
pip install -r requirements.txt
-
Open file
main.py
in folderscraping
and run the project -
File
vacancies.csv
will be generated with data in folderscraping
-
To display the graphs, open the file
analisys.ipynb
file indata analysis
folder and clickRun All
I welcome contributions to make this project even better. If you're interested in contributing, here's a step-by-step guide:
-
Fork the Repository: Click the "Fork" button on the top right of the repository's page to create your own fork.
-
Create a New Branch: Create a new branch for your feature or bug fix. Be descriptive with your branch name.
-
Implement Your Changes: Make your changes to the codebase. Ensure your code adheres to the project's coding standards.
-
Test Thoroughly: Test your changes rigorously to ensure they work as expected.
-
Submit a Pull Request: When ready, submit a pull request with details about your changes. Provide a clear and concise explanation of the problem and solution.