Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Detect and merge duplicate items #865

Open
qwertea opened this issue Feb 15, 2025 · 1 comment
Open

Detect and merge duplicate items #865

qwertea opened this issue Feb 15, 2025 · 1 comment
Assignees
Labels
local Local Account-specific issues

Comments

@qwertea
Copy link

qwertea commented Feb 15, 2025

(Another non bug, I think; feel free to close if this is out of scope.)

BBC Science & other BBC feeds seem to randomly duplicate items1, and it's annoying to see a duplicate of an item you already read marked as new. It would be nice if duplicates were automatically detected and pruned/merged.

Image
feed: https://feeds.bbci.co.uk/news/science_and_environment/rss.xml
archived feed in case the duplicate is removed later: https://files.catbox.moe/p76r5v.rss


1. At least, that's what I think is going on here.
@jocmp
Copy link
Owner

jocmp commented Feb 15, 2025

Thanks for the info @qwertea. I have noticed this with a few feeds. In this case the feed has the same link twice, example: <link>https://www.bbc.com/news/articles/cvgeydkz08go</link>. While the link is the same, the guid is just slightly different between the two. Note the different between the extra hash at the end.

- https://www.bbc.com/news/articles/cvgeydkz08go#0
+ https://www.bbc.com/news/articles/cvgeydkz08go#3

I think In this case I can skip duplicates if the link is the same between multiple items.

@jocmp jocmp self-assigned this Feb 15, 2025
@jocmp jocmp changed the title Enhancement: detect & merge duplicate items. Detect & merge duplicate items. Feb 15, 2025
@jocmp jocmp changed the title Detect & merge duplicate items. Detect a merge duplicate items Feb 15, 2025
@jocmp jocmp changed the title Detect a merge duplicate items Detect and merge duplicate items Feb 15, 2025
@jocmp jocmp added the local Local Account-specific issues label Feb 15, 2025
@jocmp jocmp moved this to On Deck in Capy Reader Feb 15, 2025
@jocmp jocmp moved this from On Deck to In Progress in Capy Reader Feb 23, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
local Local Account-specific issues
Projects
Status: In Progress
Development

No branches or pull requests

2 participants