Skip to content

This repository contains structured datasets in various categories

Notifications You must be signed in to change notification settings

Stefen-Taime/open-source-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Open Source Data

Description

This repository contains structured datasets in various categories such as "bank", "beer", "coffee", "commerce", "company", "computer", "credit_card", "dessert", "device", "food", "keywords", "movies", "ratings", "restaurant", "stripe", "subscription", and "user". Each category includes data in three different formats: CSV, JSON, and XML, with relevant and updated information as of January 16, 2024. The data is organized to facilitate access and exploitation for various analyses and developments.

Repository Structure

The repository is organized as follows:

.
├── bank
│   ├── csv
│   │   ├── csv_bank_20240116_1.csv
│   │   ├── csv_bank_20240116_2.csv
│   │   ├── csv_bank_20240116_3.csv
│   │   ├── csv_bank_20240116_4.csv
│   │   └── csv_bank_20240116_5.csv
│   ├── json
│   │   ├── json_bank_20240116_1.json
│   │   ├── json_bank_20240116_2.json
│   │   ├── json_bank_20240116_3.json
│   │   ├── json_bank_20240116_4.json
│   │   └── json_bank_20240116_5.json
│   └── xml
│       ├── xml_bank_20240116_1.xml
│       ├── xml_bank_20240116_2.xml
│       ├── xml_bank_20240116_3.xml
│       ├── xml_bank_20240116_4.xml
│       └── xml_bank_20240116_5.xml
├── bank.py
├── [Other Categories]
└── [Corresponding Files]

Usage

Each category comes with a Python script (e.g., bank.py, beer.py, etc.) to facilitate interaction with the data. These scripts are designed to import and process data in CSV, JSON, and XML formats. Users can leverage these scripts to develop applications or perform data analysis.

Reference Key

Data across all categories use a common user_id as the primary reference key, allowing for coherent integration and comparison across different categories.

Contribution

Contributions to the repository are welcome. Please follow the contribution guidelines to submit your changes or additions.

Contact

For any questions or comments, feel free to contact [Stefen Taime] at [[email protected]].

About

This repository contains structured datasets in various categories

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages