Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Introduce Ns at random positions #84

Closed
arnikz opened this issue Dec 2, 2020 · 1 comment
Closed

Introduce Ns at random positions #84

arnikz opened this issue Dec 2, 2020 · 1 comment
Assignees
Milestone

Comments

@arnikz
Copy link
Contributor

arnikz commented Dec 2, 2020

Currently, the data/test.fasta does not include Ns so data/reference_N_regions_bed remains empty after

sv-channels/run.sh

Lines 84 to 89 in 923b9ce

# extract N's from sequence into BED
p=seqkit
cmd="$p locate -i -P -r -p "N+" --bed \"$FASTA\" -o \"$REF_REG\""
JOB_ID=$(submit "$cmd" "$p")
JOBS+=($JOB_ID)

Related to #76, #80.

@arnikz arnikz self-assigned this Dec 2, 2020
@arnikz arnikz added this to the 0.1.0 milestone Dec 2, 2020
@arnikz
Copy link
Contributor Author

arnikz commented Dec 2, 2020

Use the following code to introduce point mutations ({A,T,G,C} -> N) into sequences:

$ for p in $(seq 2000000 | shuf -n 10); do
  seqkit mutate -p "$p:N" test.fasta -o test-$p.fasta
  mv test-$p.fasta test.fasta
done

arnikz pushed a commit that referenced this issue Dec 2, 2020
@arnikz arnikz closed this as completed Dec 2, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant