Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cedar SFU supercomputer #8

Open
tmozgach opened this issue Feb 23, 2018 · 2 comments
Open

Cedar SFU supercomputer #8

tmozgach opened this issue Feb 23, 2018 · 2 comments

Comments

@tmozgach
Copy link
Owner

tmozgach commented Feb 23, 2018

Cedar is a heterogeneous cluster suitable for a variety of workloads;
Registration link:
http://www.rcg.sfu.ca/hpc/cedar/register-cc.
Registration Tutorial:
https://www.youtube.com/watch?v=EtFUvBYaaZE&list=PLlIsA3IgWjnweFiHlOjBmsF2NTeJ6f5rT

Log in on Mac or Linux:

How to deal with software:

module load python/3.6.3

https://docs.computecanada.ca/wiki/Available_software

Transfer files from laptop to Cedar sever:

scp TPostRawShort.csv [email protected]:/home/tmozgach/scratch/TM

Transfer files from Cedar to laptop

scp -r TM [email protected]:/home/tmozgach/scratch /home/tatyana/nlp
scp [email protected]:/home/tmozgach/scratch/TM/TM_lda25.model* .

OR

Go to http://globus.computecanada.ca. Your "existing organizational login" is your CCDB account. Ensure that "Compute Canada" is selected in the drop-down, then click Continue. Supply your CCDB username and password on the Compute Canada MyProxy page which appears. This takes you to the web portal for Globus. 

How to set up Globus:
https://docs.computecanada.ca/wiki/Globus

Full webinars video:
https://docs.computecanada.ca/wiki/Getting_Started_with_the_new_National_Systems

Install Python module inside Cedar:
https://docs.computecanada.ca/wiki/Python

Main set up:
Once:

mkdir ~/virtualenvironment
virtualenv ~/virtualenvironment
source ~/virtualenvironment/bin/activate
(virtualenvironment) [tmozgach@cedar5 TM]$ pip install nltk gensim pandas pyLDAvis

Every time when you log in:

source ~/virtualenvironment/bin/activate

Install Python module:

python2 -m pip install --user NAME_OF_MODULE

You MUST install the following python packages in Cedar in YOUR virtualenvironment:

nltk
gensim 
pyLDAvis
textblob
pandas
@tmozgach
Copy link
Owner Author

@tmozgach
Copy link
Owner Author

Running on Cedar (First trial, in /home/tmozgach/scratch/TM):
TM_job.sh

#!/bin/bash
#SBATCH --time=04:00:00
#SBATCH --account=def-emodata
#SBATCH --mem=2000
python ./TopicModeling.py
echo 'I finished'

by command

sbatch TM_job.sh

for TPostRaw.csv (only title and post, NO comments)

Repository owner deleted a comment from neowangkkk Feb 24, 2018
Repository owner deleted a comment from neowangkkk Feb 24, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant