Skip to content

Commit

Permalink
data cleanup
Browse files Browse the repository at this point in the history
  • Loading branch information
nofurtherinformation committed Mar 29, 2024
1 parent 187809e commit 67cb682
Show file tree
Hide file tree
Showing 5 changed files with 1 addition and 0 deletions.
Binary file modified public/data/_concentration_metrics_wide_ds.parquet
Binary file not shown.
Binary file modified public/data/concentration_metrics_wide_ds.parquet
Binary file not shown.
Binary file modified public/data/gravity_dollar_pivoted.parquet
Binary file not shown.
Binary file modified public/data/gravity_no_dollar_pivoted.parquet
Binary file not shown.
1 change: 1 addition & 0 deletions scripts/csv_to_parquet.py
Original file line number Diff line number Diff line change
Expand Up @@ -10,6 +10,7 @@
compression = "gzip"
for csv in csvs:
df = pd.read_csv(csv)
df['GEOID'] = df['GEOID'].astype(str).str.zfill(11)
parquet = path.splitext(csv)[0] + ".parquet"
df.to_parquet(parquet, compression=compression)
# %%

0 comments on commit 67cb682

Please sign in to comment.