Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Errata and updates for Chapter 7 #18

Open
haruosuz opened this issue Nov 8, 2020 · 0 comments
Open

Errata and updates for Chapter 7 #18

haruosuz opened this issue Nov 8, 2020 · 0 comments

Comments

@haruosuz
Copy link

haruosuz commented Nov 8, 2020


(Page 131)

i() { (head -n 2; tail -n 2) < "$1" | column -t}

should be

i() { (head -n 2; tail -n 2) < "$1" | column -t; }


Page 150

$ tail -n +6 Mus_musculus.GRCm38.75_chr1.gtf | awk -F "\t" '{print NF; exit}'
16

should be

$ tail -n +6 Mus_musculus.GRCm38.75_chr1.gtf | awk -F "\t" '{print NF; exit}'
9

(Page 152)

(e.g., 50% with -S 50%).

should be

(e.g., 50% with -S 50%).


(Page 154)

uniq -d test.bed | wc -l had no output. uniq -d Mus_musculus.GRCm38.75_chr1.bed | wc -l printed 22925. Thus,

A file with duplicates, like the test.bed file, has multiple lines returned:

uniq -d test.bed | wc -l
22925

should be

A file with duplicates, like the Mus_musculus.GRCm38.75_chr1.bed file, has multiple lines returned:

uniq -d Mus_musculus.GRCm38.75_chr1.bed | wc -l
22925

(Page 156)

join -1 1 -2 1 -a 1 example_sorted.bed example_lengths_alt.txt # GNU join only

worked on Mac OS X 10.9.5


(Page 163)

brew tap homebrew/science; brew install bioawk

should be

brew tap brewsci/bio; brew install bioawk

https://github.com/Homebrew/homebrew-science/issues/6617


(Page 163)

(just as regular Awk sets the columns of a tabular text file to $1, $1, $2, etc.).

should be

(just as regular Awk sets the columns of a tabular text file to $1, $2, $3, etc.).


(Page 163)

let’s read in example.bed

should be

let’s read in Mus_musculus.GRCm38.75_chr1.gtf


(Page 164)

$ bioawk -c fastx '{print $name,length($seq)}' \
    Mus_musculus.GRCm38.75.dna_rm.toplevel.fa.gz > mm_genome.txt
$ head -n 4 mm_genome.txt

The file "Mus_musculus.GRCm38.75.dna_rm.toplevel.fa.gz" is located in not "chapter-07-unix-data-tools" but in "chapter-09-working-with-range-data" in the GitHub repository.

$ bioawk -c fastx '{print $name,length($seq)}' \
    ../chapter-09-working-with-range-data/Mus_musculus.GRCm38.75.dna_rm.toplevel_chr1.fa.gz > mm_genome.txt
$ head -n 4 mm_genome.txt
1	195471971
$cat mm_genome.txt 
1	195471971

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant