-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Get publication counts per gene per year #39
Comments
You may want to take a look at the |
Hi I have gotten this far. For an example gene id (816394) in taxon Arabidopsis thaliana (txid3702) I can get the count of all the pubmed articles related to this gene
After this the next step is to download in xml or docsum format the articles and filter the articles by date [PDAT] of publication. This is the strategy I am using. I used this next command but the error was "Too many requests"
I don't know how to get around this. Thanks for the help |
Are you using the eUtils API keys? |
Hi vkkodali The initial part of the error looks like below
In the latter part I get a truncated output. The query should yield me 154 articles. I get 54. Thanks for the help. |
You need to create an API key as mentioned in the 'How do I get a key?' section here. After that, you need to either run the following command before executing
|
It worked!!! Thanks a bunch vkkodali. One unrelated comment. I download specific EST, cDNA datasets from NCBI every quarter. I use a combination of epost and efetch. There too I face this issue sometimes, and I rerun after a gap of 250 seconds. exporting the API_KEY should take care of this too right? Thanks for your help |
Hi
I would like to write a edirect query to extract number of publications per gene per year. The group I am interested in is Viridiplantae. So for all species under this group, given a date range, I would like to get the publication count for each gene in that species. The final output that I am looking for is something like
YEAR Genus_Species Gene_Symbol Publication_Count
1970 Arabidopsis thaliana PHYA 3
1971 Arabidopsis thaliana PHYA 2
I can get [PDAT] to work for -db pubmed but not [GENE] or [ORGN]. Need Help. Thanks
The text was updated successfully, but these errors were encountered: