Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge missing file fix from clinic fork #79

Merged
merged 10 commits into from
Mar 28, 2024
9 changes: 9 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -65,3 +65,12 @@ Student Email: [email protected]

Student Name: Nayna Pashilkar
Student Email: [email protected]

Student Name: Yangge Xu
Student Email: [email protected]

Student Name: Bhavya Pandey
Student Email: [email protected]

Student Name: Kaya Lee
Student Email: [email protected]
10 changes: 6 additions & 4 deletions src/utils/linkage_and_network_pipeline.py
Original file line number Diff line number Diff line change
Expand Up @@ -150,10 +150,12 @@ def preprocess_transactions(transactions: pd.DataFrame) -> pd.DataFrame:

transactions["purpose"] = transactions["purpose"].str.upper()

deduped = pd.read_csv(BASE_FILEPATH / "output" / "deduplicated_UUIDs.csv")
transactions[["donor_id", "recipient_id"]] = transactions[
["donor_id", "recipient_id"]
].replace(deduped)
deduplicated_uuid_path = BASE_FILEPATH / "output" / "deduplicated_UUIDs.csv"
if deduplicated_uuid_path.exists():
deduped = pd.read_csv(BASE_FILEPATH / "output" / "deduplicated_UUIDs.csv")
transactions[["donor_id", "recipient_id"]] = transactions[
["donor_id", "recipient_id"]
].replace(deduped)

return transactions

Expand Down
Loading