-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to check which genes are in the database? #24
Comments
You can use: # cd create_cisTarget_Databases
import feather_v1_or_v2
all_columns_in_ctx_db = get_all_column_names_from_feather(feather_file="hg38__refseq-r80__10kb_up_and_down_tss.mc9nr.feather')
all_columns_in_ctx_db Gene names for hg38 are HGNC symbols as linked to RefSeq r80. |
Hi @ghuls , Thank you very much for your help. I'm still having some trouble getting what I want.
I checked the website 'https://resources.aertslab.org/cistarget/' and found out a tf_lists/allTFs_hg38.txt file。 I'm wondering 1) whether this 'allTFs_hg38.txt' file contains all the genes in the database? Or 2) what should I do to make the 'get_all_column_names_from_feather' function works? Thank you for your help. |
You don't need to compile Cluster-Buster to be able to check the feather databases. You can even just load the whole feather database with pandas in the worst case: import pandas as pd
df = pd.read_feather("hg38__refseq-r80__10kb_up_and_down_tss.mc9nr.feather')
df.columns |
Hello, I would like to ask how to obtain the gene_ID of the. feature file on a Linux terminal? I would greatly appreciate it if you could provide some suggestions. |
Hi,
I'm using pySCENIC to analyze human iPSC data. We are interested in some genes and have the following questions:
The database we used are 'hg38__refseq-r80__10kb_up_and_down_tss.mc9nr.feather', 'hg38__refseq-r80__500bp_up_and_100bp_down_tss.mc9nr.feather', 'motifs-v9-nr.hgnc-m0.001-o0.0.tbl'.
Thank you for your help.
Best
The text was updated successfully, but these errors were encountered: