Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data Quality Monitor #194

Open
vanderburgt opened this issue Feb 6, 2025 · 1 comment
Open

Data Quality Monitor #194

vanderburgt opened this issue Feb 6, 2025 · 1 comment
Labels
user story Describes a new feature or requirement

Comments

@vanderburgt
Copy link
Collaborator

User story

As a journalist using Bron,
I want to access a Data Quality Monitor,
so that I can assess the completeness of data from various government organizations over time and identify potential gaps.

Description
Bron aggregates documents from multiple sources like ORI, PoliFLW, and open.overheid.nl, but data completeness is inconsistent across organizations and time periods. For example, certain municipalities may have missing data for specific years or lack documents entirely. To ensure I am aware of these gaps before conducting my research, I need a tool that provides a clear, interactive overview of the data quality.

Acceptance criteria

1. Overview by organization:

  • The monitor should display a list of all government organizations (e.g., municipalities, water boards, ministries) with document coverage information.
  • Organizations missing from the data entirely should be flagged.

2. Time-based analysis:

  • The monitor should present data coverage broken down by week, month, and year for each organization.
  • Time periods with missing or sparse documents should be visually highlighted (e.g., using color codes or patterns).

3. Filters:

  • Users can filter the overview by organization type (e.g., municipalities only) or by source (ORI, PoliFLW, etc.).- Users can filter by a specific time range to focus on relevant periods.

Maybe:

5. Interactive visualization:

  • Users should be able to interact with charts or tables to zoom in on specific organizations or time periods.
  • Hovering or clicking on a gap should provide additional metadata (e.g., exact missing years, potential source issues).

6. Data quality score

  • Data quality scoring per search/feed to indicate data completeness users can expect in a feed or search
@vanderburgt vanderburgt added the user story Describes a new feature or requirement label Feb 6, 2025
@vanderburgt
Copy link
Collaborator Author

@Coretteket let's discuss this story and generate some ideas for the implementation.

@Coretteket Coretteket marked this as a duplicate of #112 Feb 10, 2025
@Coretteket Coretteket moved this to Upcoming ⏳ in Bron Roadmap 🚀✨ Feb 10, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
user story Describes a new feature or requirement
Projects
Status: Upcoming ⏳
Development

No branches or pull requests

1 participant