-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
comparing classic umap and topometry #14
Comments
Hi @Marwansha . Not naive at all; this is indeed quite complex. Let's go over your questions one by one. I took the liberty of rewriting them a bit (you seem to be in a hurry) so that this is also useful to others with similar questions.
Yes, the eigencomponents are the 'latent space' (a.k.a. the dimensionality reduced spaced), similar to the latent space learned by autoencoders like scVI or the principal components learned by PCA. TopOMetry comes with three main flavours: multiscale Diffusion Maps ( I would recommend going with Keep in mind that TopOMetry assumes the data has been scaled (i.e.
Yes, the TopOGraph object will automatically use only the number of components detected with the eigengap. I think your code might be somewhat different, as 150 were plotted (so 150 were computed).
Yes, that would make Remember that PCA and spectral methods are not equivalent - how much covariance is explained by 120 PCs in your data?
Now, this is a tough question. As of now, no one knows for sure. In our paper, using images of handwritten digits, we saw that digits with high variability had the highest i.d. estimates, and digits with low variability had the lowest i.d. estimates. Two other works observed a similar phenomenon: Bastien Rick's work on TARDIS, and In Uzu Lim's work HADES, both focusing on singularity detection and i.d. estimation. Overall, this phenomenon appears to be related to higher intra-cluster variability. Whether that's because of noise, low-quality cells, or actual biological signals is still unclear. I'm not aware of much work done on this. I hope this was informative - let me know if you have more questions :) |
sorry if my question is naive but its for better understanding
1-in a classical workflow, if i use Topometry, is the eigncomponents here considered the dimensionality reduction method?
2- for my adata.x object I observed an eigengap around 120, so does the projection used in the model use this number of eigenvectors to do the projection?
3-also for comparison of results shouldn't i use (120 pca) equivalent to 120 EV and use those to compute neighbors and plot a umap to compare there results with
tg.ProjectionDict['MAP of bw_adaptive from msDM with bw_adaptive']
4- a cell type having a higher i.d. estimates than other, how to interept this and could this be just an effect of the cell proportion being small ( low number of cells of thos celltype should lead to high I.d right? AND shold mean they doesnt cluster very well together?)
finally
i did this comparision using same number of pca to generate knn and then the umap with scanpy and another time with topoMAP
here is both of projection for my dataset of pbmc that contain different individuals with different condition (non stimualted and stimualted with covid) how could u interpret the different visualization specifically in the b cell cluster in pink
The text was updated successfully, but these errors were encountered: