User:Lindenb/Notebook/UMR915/20101013: Difference between revisions
From OpenWetWare
Line 15: | Line 15: | ||
sort -t ' ' -k 1,1 jeter2.txt | uniq > jeter02.tsv | sort -t ' ' -k 1,1 jeter2.txt | uniq > jeter02.tsv | ||
join -t ' ' -1 2 -2 1 jeter01.tsv jeter02.tsv | join -t ' ' -1 2 -2 1 jeter01.tsv jeter02.tsv | ||
==Biostart/Blast== | |||
http://biostar.stackexchange.com/questions/2892 | |||
formatdb some seqs | |||
formatdb -n testDB -p F -i test.fa | |||
xjc -httpproxy "xxxxxxxxxxxxxxx:3128" -d src -p blast -dtd "http://www.ncbi.nlm.nih.gov/dtd/NCBI_BlastOutput.dtd" |
Revision as of 04:27, 13 October 2010
Belgium
modified bomcat for handling more than one directory for the 454HC*.txt.gz stuff and added the new file for RR.
RR ask for joining result with original CSV data:
gunzip -c XXXXunpaired_trimmed_mapping_SNP_DIP.csv.gz | tr ";" " " | egrep '(chrXXXXX|position)'| egrep '(SNP|position)' | cut -d ' ' -f 2,8,9,10-17 | tr -d "." | tr -d '"' > ~/jeter2.txt wc result.txt 352 13253 82717 result.txt
sort -t ' ' -k 2,2 result.txt > jeter01.tsv sort -t ' ' -k 1,1 jeter2.txt | uniq > jeter02.tsv join -t ' ' -1 2 -2 1 jeter01.tsv jeter02.tsv
Biostart/Blast
http://biostar.stackexchange.com/questions/2892
formatdb some seqs
formatdb -n testDB -p F -i test.fa
xjc -httpproxy "xxxxxxxxxxxxxxx:3128" -d src -p blast -dtd "http://www.ncbi.nlm.nih.gov/dtd/NCBI_BlastOutput.dtd"