Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Format conversion recommendation? #17

Open
hepcat72 opened this issue Jan 27, 2020 · 2 comments
Open

Format conversion recommendation? #17

hepcat72 opened this issue Jan 27, 2020 · 2 comments

Comments

@hepcat72
Copy link
Contributor

This is my first foray into single cell. I'm working in galaxy and I'd like to run seurat on data from your paper to develop a pipeline, but it appears that the galaxy wrapper takes a tsv instead of a mtx file. I've written a quick conversion in perl for testing purposes, but is there an established tool for doing the conversion?

@yjzhang
Copy link
Owner

yjzhang commented Jan 28, 2020

We do not have an "official" tool for doing the mtx to tsv conversion; you can use whatever you'd like.

@hepcat72
Copy link
Contributor Author

So I've been learning a few things in the conversion.

  1. The underscores at the end of the cell barcodes cause cells with the same 16nts to be grouped together by Seurat's CellsByIdentities method
  2. Gene/row names cannot have underscores, so including the species/chromosome from genes.csv is problematic
  3. Joining multiple values from gene.csv with commas as row names is problematic in some steps in Seurat which do not allow commas
  4. For various QC steps, row names at least have to indicate when a gene is from the mitochondrial chromosome

And I'm not entirely certain that gene symbols in the tsv is appropriate and I'm not entirely sure that allowing genes (and cells) from multiple species is supported/correct when using it in seurat.

So again, some guidance on conversion would be helpful.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants