Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix names #231

Merged
merged 3 commits into from
Nov 21, 2024
Merged
Show file tree
Hide file tree
Changes from 2 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 4 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,10 @@

* Add: new repos to track contribs (@lwasser)

### Fixed

* Fix: Updated documentation throughout with a focus on how a user's name is accessed and updated (@lwasser)

## [v0.3.7] - 2024-08-27

* Add: after-date to api call and return clearer 401 message when token needs a refresh. (@lwasser)
Expand Down
6 changes: 6 additions & 0 deletions src/pyosmeta/cli/process_reviews.py
Original file line number Diff line number Diff line change
Expand Up @@ -3,6 +3,12 @@
website. It also grabs some of the package metadata such as stars,
last commit, etc.

This script also checks:
* That each packages documentation is both https compliant and resolves
* If documentation link is broken it removes it.
* NOTE: we may want to have the website note if docs are not available for a
package and have a process to followup

Output: packages.yml file containing a list of:
1. all packages with accepted reviews
2. information related to the review including reviewers, editors
Expand Down
20 changes: 17 additions & 3 deletions src/pyosmeta/cli/update_contributors.py
Original file line number Diff line number Diff line change
@@ -1,3 +1,14 @@
"""
A CLI script that parses through our existing contributor list stored
in pyopensci.github.io repo in the _data/contributors.yml file.

This script should respect if a persons's name is in the contributor file,
then it should be able to retain that name as their desired name (they can update
that file if they wish). It should NOT overwrite that name

It can also use that name when searching for maintainers
"""

import argparse
import pickle
from datetime import datetime
Expand Down Expand Up @@ -95,9 +106,12 @@ def main():
if key == "mastodon":
# Mastodon isn't available in the GH api yet
continue
# Don't replace the value if there is a noupdate flag
# TODO: This approach doesn't work, ruemal-yaml doesn't
# preserve inline comments
# Don't replace the Name value from GitHub if there is a name
# already listed in the contributors.yml file.
# This allows for users to manually update their names and it
# won't be overwritten. This also means an update on GitHub will
# not be updated here which for now is ok. Not everyone has their
# name listed on GH.
if key == "name" and existing[key]:
continue
else:
Expand Down
88 changes: 75 additions & 13 deletions src/pyosmeta/cli/update_review_teams.py
Original file line number Diff line number Diff line change
@@ -1,10 +1,11 @@
"""
This script parses through our reviews and contributors and:
This script parses through our packages.yml and contributors.yml files
and:

1. Updates reviewer, editor and maintainer data in the contributor.yml file to
ensure all packages they supported are listed there.
1b: And that they have a listing as peer-review under contributor type
2. Updates the packages metadata with the participants names if it's missing
2. TODO: not working as expected? Updates the `packages.yml` file with review contributor's names if it's missing
lwasser marked this conversation as resolved.
Show resolved Hide resolved
3. Finally it looks to see if we are missing review participants from
the review issues in the contributor file and updates that file.

Expand All @@ -15,7 +16,6 @@
# TODO - FEATURE we have some packages that were NOT approved but we had
# editors and reviewers who participated. We need to acknowledge these people
# Make sure each issue is tagged "on hold" and then parse contributions
# TODO: package-wide feature: create no update flag for entries
# TODO: make sure the script recognizes a 3rd or 4th reviewer - crowsetta has
# this as will biocypher

Expand All @@ -40,16 +40,73 @@ def process_user(
contribs: dict[str, PersonModel],
processor: ProcessContributors,
) -> tuple[ReviewUser, dict[str, PersonModel]]:
"""
- Add a new contributor to `contribs` (mutating it)
- Add user to any reviews/etc. that they're on (i don't rly understand that part,
someone else write these docs plz (mutating `contribs`)
- get their human name from the github name, mutating the `user` object.
"""Updates a ReviewUser which represents software review participant

The contributor entry looks something like this:
https://github.com/pyOpenSci/pyopensci.github.io/blob/main/_data/contributors.yml
- name: First Last
github_username: ghid
github_image_id: 7649194
contributor_type:
- editor
- package-guide
- package-maintainer
- peer-review
- submitting-author
packages_editor:
- errdapy
- nbless
- pandera
- pygmt
- pyrolite
packages_submitted:
- earthpy
packages_reviewed:

This function updates contributor's contribution types entries and the
package reviews that they's contributed to in the `contribs` object which
gets dumped into our contributors.yml file in the website repo.

It also updates the contributors human name (if only the GitHub username is
available in packages.yml) in
the packages object IF that name is available in the contributors.yml file.

Finally, if the contributor is new and not yet in our contributors.yml file,
it adds them to the contribs object

Parameters
----------
user : ReviewUser
The user object representing a person involved in the peer review process.
Must have a GitHub username to process their data. Name is optional and
is looked up from the contribs dict
role : str
The role of the user in the review process (e.g., "reviewer", "editor").
Used to update their contributions in the contribs dict
pkg_name : str
The name of the package the user is contributing to or reviewing.
Used to update their list of packages that they've supported through
the review process
contribs : dict[str, PersonModel]
A dictionary mapping GitHub usernames to `PersonModel` objects. This
contains information about all contributors and is updated in
this function.
processor : ProcessContributors
A utility object for handling contributor processing logic, including
role mapping and fetching contributor details.

Returns
-------
tuple[ReviewUser, dict[str, PersonModel]]
- Updated `user` object, potentially with additional details (e.g., name).
- Updated `contribs` dictionary, including any new contributor or or
updated data for existing contributors.
"""
gh_user = get_clean_user(user.github_username)

if gh_user not in contribs.keys():
# If they aren't already in contribs, add them
# If they aren't in the existing contribs.yml data, add them by using
# their github username and hitting the github api
print("Found a new contributor!", gh_user)
new_contrib = processor.return_user_info(gh_user)
new_contrib["date_added"] = datetime.now().strftime("%Y-%m-%d")
Expand All @@ -58,15 +115,18 @@ def process_user(
except ValidationError as ve:
print(ve)

# Update user package contributions (if it's unique)
# Update user the list of contribution types if there are new types to add
# for instance a new reviewer would have a "Reviewer" contributor type
# added to their contributors.yml entry. But if reviewer is already in the
# list we don't need to have it twice!
review_key = processor.contrib_types[role][0]
contribs[gh_user].add_unique_value(review_key, pkg_name.lower())

# Update user contrib list (if it's unique)
# Update users contribution type list (if the role is not already there)
review_roles = processor.contrib_types[role][1]
contribs[gh_user].add_unique_value("contributor_type", review_roles)

# If users's name is missing in issue, populate from contribs
# If users's name is missing in issue, populate from contribs dict.
if not user.name:
user.name = getattr(contribs[gh_user], "name")

Expand All @@ -85,11 +145,13 @@ def main():
contrib_types = process_contribs.contrib_types

for pkg_name, review in packages.items():
if pkg_name == "automata":
print("Let's check this data out")
print("Processing review team for:", pkg_name)
for role in contrib_types.keys():
user: list[ReviewUser] | ReviewUser = getattr(review, role)

# handle lists or singleton users separately
# Handle lists or single users separately
if isinstance(user, list):
for i, a_user in enumerate(user):
a_user, contribs = process_user(
Expand Down