Skip to content

Latest commit

 

History

History
51 lines (36 loc) · 1.82 KB

README.md

File metadata and controls

51 lines (36 loc) · 1.82 KB

ohw24_proj_big_data_us

Folder Structure

  • contributor_folders Each contributor can make a folder here and push their work here during the week. This will allow everyone to see each others work but prevent any merge conflicts.
  • final_notebooks When the team develops shared final notebooks, they can be shared here. Make sure to communicate so that you limit merge conflicts.
  • scripts Shared scripts or functions can be added here.
  • data Shared dataset can be shared here. Note, do not put large datasets on GitHub. Speak to the organizers if you need to share large datasets. Each team member can have a version of the dataset locally in the same folder to preserve relative paths, but the dataset does not need to be added to git/GitHub (you can use .gitignore).

You can start with a simple structure and as you progress you can refine it to contain more components. Here is an example of a more elaborate structure for a data science project.

Project Name

Big data parallel computing

One-line Description

In this project, we try to compute the project using all cores. In this regard, we use parallel computing.

Planning

Using public OOI DAS data from Oregon shore region. Using Dask package.

Collaborators

Name Location Role
Erfan B. Horeh Bigelow Project Facilitator

Planning

  • Initial idea: "Parallel computing using all cores in the system"
  • Ideation jam board: Add link
  • Ideation Presentation: Add link
  • Slack channel: ohw24_proj_name
  • Project google drive: Add link
  • Final presentation: Add link

Background

Goals

Datasets

Workflow/Roadmap

Results/Findings

Lessons Learned

References