Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable creation of wget scripts with no duplicates #1307

Open
LucaCinquini opened this issue Aug 3, 2016 · 1 comment
Open

Enable creation of wget scripts with no duplicates #1307

LucaCinquini opened this issue Aug 3, 2016 · 1 comment

Comments

@LucaCinquini
Copy link
Contributor

Who: Karl Taylor

t's not obvious to me how I should create a wget script that can retrieve all available output (but not duplicates) for a search satisfying:

CMIP5
1pctCO2
mon
atmos
Amon
r1i1p1

If I click on "show all replicas" I will get results for 33 models and many, many duplicates.
If I click on "search local node only (including all replicas), I miss 11 models and I still get a few duplicates.
If I add "Datanode = "aims3.llnl.gov" to my search, I miss the same 11 models, and the datasets for two models are split into two (each containing only some of the dataset variables).

I would like to get 33 models, but not any duplicates. Can this be done?

@LucaCinquini LucaCinquini added this to the Release 3.8 milestone Aug 3, 2016
@LucaCinquini LucaCinquini self-assigned this Aug 3, 2016
@LucaCinquini
Copy link
Contributor Author

Possible strategies:

o Return no duplicates when executing the Solr search - can this be done over a distributed search ?
o Remove duplicate when the wget script is created (might not be possible because they are created for a single index node)

o Have an option "Remove duplicates" in the data cart
Need to create a dictionary of the form (datase_id) : ( node1, node2, ...) and expose that information to the user to select which node to remove duplicates from.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants