Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Wrong cluster reference sequence in otu file #27

Open
ikostadi opened this issue Dec 18, 2015 · 3 comments
Open

Wrong cluster reference sequence in otu file #27

ikostadi opened this issue Dec 18, 2015 · 3 comments

Comments

@ikostadi
Copy link
Member

Dear all,

unfortunately, a bug in the SILVAngs system has affected one of the files in the results directory of all projects analysed before September 22nd 2015 (that means all currently available OSD results calculated with SILVAngs). The affected file is the 'prjname---ssu_or_lsu---otus.csv' file in the 'exports' subdirectory of your results.

The sequence that is listed for each cluster / OTU in this file, is not the sequence of the reference of the cluster / OTU but is randomly selected from all sequences of that cluster. All other data in this file is correct.

All other sequence exports are not affected and contain the correct sequence for each cluster. The unaffected files include the ARB and FASTA files in the main results directory, as well as the FASTA exports in the 'exports/otu_references' subdirectory.

The bug has been fixed in the meantime and all results starting from September 22nd 2015 are okay.

@gipsilim
Copy link

gipsilim commented Feb 5, 2016

Hello Ivo,

Just saw this post, I have used 2 files dating from before Sept 22. One of them is clearly affected:
osd2014_18s_lgc_otu_by_sample.csv, but not sure about the other one: osd2014_EMG-SINA-SILVA-119.1_otu_by_sample.tsv
Could you please tell me?
best,
Gipsi

@ikostadi
Copy link
Member Author

ikostadi commented Feb 8, 2016

Hi Gipsi,

the files you mentioned are not the type of files referred to in the original post (please compare naming scheme). The affected files contain a comma (or tab) separated information about all per-sample OTUs (in SILVA speak "local clusters"), whereas the file names you mention:

  1. do not contain any sequences in them, only (global) cluster (i.e. OTU) counts
  2. are generated in a way that is independent of the error in the *-otus.csv files

Therefore, they should not be affected in any way by the pipeline bug. If you have any doubt, please feel free to double-check and don't hesitate to ask again or notify is should you find anything suspicious.

Best,
Ivo

@gipsilim
Copy link

gipsilim commented Feb 8, 2016

Hi Ivo,

Thanks,

I thought one of them could be affected, the 18S: osd2014_18s_lgc_otu_by_sample.csv, true that it does not have same naming schema as prjname---ssu_or_lsu—otus.csv

anyway, better like this :)

cheers,

Gipsi

On 08 Feb 2016, at 17:11, Ivo <[email protected]mailto:[email protected]> wrote:

Hi Gipsi,

the files you mentioned are not the type of files referred to in the original post (please compare naming scheme). The affected files contain a comma (or tab) separated information about all per-sample OTUs (in SILVA speak "local clusters"), whereas the file names you mention:

  1. do not contain any sequences in them, only (global) cluster (i.e. OTU) counts
  2. are generated in a way that is independent of the error in the *-otus.csv files

Therefore, they should not be affected in any way by the pipeline bug. If you have any doubt, please feel free to double-check and don't hesitate to ask again or notify is should you find anything suspicious.

Best,
Ivo


Reply to this email directly or view it on GitHubhttps://github.com//issues/27#issuecomment-181422478.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants