You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Web of Science periodically fixes data problems, for example, typos and problems in identifiers, such as DOIs. Since we never update our source data once harvested, those typos and problems with DOIs will remain forever. This results, for example, in broken DOI links (of which we have ~1400 as of April 2023).
Since we used previously harvested publication data if possible when adding the same publication to a new author Profile, it will still have the old cached data even if the a new author harvests that previously harvested publication.
This task would involve periodically pulling updated WoS data for all our publications and re-updating our cached data (source records and pub-hash) or at least some portion of it (such as just the identifiers).
Note, this could have side-effects, as we would be changing data for publications already harvested, which could
change how they appear on user's profiles, even after approved
cause larger than expected nightly change updates when the Profiles API updates publications via our API
Note that some data problems are likely never fixed in Web of Science source records, and this work would thus have no impact on records with persistent bad metadata.
The text was updated successfully, but these errors were encountered:
See also #87 and #179 which are related
Web of Science periodically fixes data problems, for example, typos and problems in identifiers, such as DOIs. Since we never update our source data once harvested, those typos and problems with DOIs will remain forever. This results, for example, in broken DOI links (of which we have ~1400 as of April 2023).
Since we used previously harvested publication data if possible when adding the same publication to a new author Profile, it will still have the old cached data even if the a new author harvests that previously harvested publication.
This task would involve periodically pulling updated WoS data for all our publications and re-updating our cached data (source records and pub-hash) or at least some portion of it (such as just the identifiers).
Note, this could have side-effects, as we would be changing data for publications already harvested, which could
Note that some data problems are likely never fixed in Web of Science source records, and this work would thus have no impact on records with persistent bad metadata.
The text was updated successfully, but these errors were encountered: