You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
mELM is a tool for masking or assigning Eukaryotic Linear Motifs to protein sequences. Both TSV/GFF3 output or FASTA is possible.
7
7
Essentially the tool is a CLI to the ELM.eu.org online resource with additional tools for dealing with short motif assignment within disordered regions.
@@ -32,7 +32,7 @@ If you have used mELM with ANCHOR predictions please cite the following:
if ANCHOR is installed use that to filter ELM output based on ANCHOR's IUPred disorder prediction, only include ELMs that fall within disordered regions
72
72
-d <>, --anchor-datapath=<>
73
73
provide the location of the anchor data path, mELM will otherwise assume it's in the same directory as your anchor binary
74
+
-g <>, --go-filter=<>
75
+
turn on GO filtering, only show results for ELM classes that have been associated with the GO ID specified
76
+
-o <>, --organism-filter=<>
77
+
turn on organism filtering, only show results for ELM instances that have been observed in a given organism
74
78
75
79
Example Use Cases
76
80
=================
@@ -91,6 +95,11 @@ Get a GFF3 file for a whole genomes worth of protein annotations
Copy file name to clipboardExpand all lines: bin/melm
+31-8
Original file line number
Diff line number
Diff line change
@@ -1,4 +1,5 @@
1
1
#!/usr/bin/env perl
2
+
use utf8;
2
3
=head1NAME
3
4
4
5
I<melm> - mELM masking of assigned ELM motifs
@@ -20,7 +21,7 @@ I<melm> - mELM masking of assigned ELM motifs
20
21
get TSV output of all the ELM instances held in the melm cache
21
22
-a, --assign
22
23
do not mask sequences, instead output a TSV format of all the ELM assignments made per sequence
23
-
-G, -GFF3
24
+
-G, --GFF3
24
25
produce GFF3 output when using --assign, useful if you wish to add this assignment to a genome browser or similar
25
26
-E <>, --max-class-expect=<>
26
27
filter out ELM classes based on their annotated expectation, bigger E means allow for more common motifs
@@ -44,6 +45,10 @@ I<melm> - mELM masking of assigned ELM motifs
44
45
if ANCHOR is installed use that to filter ELM output based on ANCHOR's IUPred disorder prediction, only include ELMs that fall within disordered regions
45
46
-d <>, --anchor-datapath=<>
46
47
provide the location of the anchor data path, mELM will otherwise assume it's in the same directory as your anchor binary
48
+
-g <>, --go-filter=<>
49
+
turn on GO filtering, only show results for ELM classes that have been associated with the GO ID specified
50
+
-o <>, --organism-filter=<>
51
+
turn on organism filtering, only show results for ELM instances that have been observed in a given organism
47
52
48
53
=head1DESCRIPTION
49
54
@@ -88,9 +93,9 @@ Get a GFF3 file for a whole genome's worth of protein annotations
* Added GO data support and bumped the version for release
134
+
127
135
2019-05-17
128
136
* Updated timeout so that downloads from ELM.eu.org are more likely to succeed.
129
-
* Cut a new maintenance release 1.4.2
137
+
* Cut a new maintenance release 1.4.3
130
138
131
139
2016-07-05 - Matt Oates
132
140
* Majority of refactor complete
@@ -181,7 +189,7 @@ use ELM;
181
189
use ELM::Utils 'get_www';
182
190
183
191
#Current version of the script
184
-
our$VERSION = "v1.4.2";
192
+
our$VERSION = "v1.4.3";
185
193
186
194
#User options
187
195
my$help;
@@ -205,8 +213,10 @@ my $disorder_filter;
205
213
my$type;
206
214
my$gff;
207
215
my$anchor_datapath;
216
+
my$go_filter;
217
+
my$organism_filter;
208
218
209
-
#Flags used h,v,u,U,c,i,a,G,E,X,m,n,C,P,l,t,M,D,d
219
+
#Flags used h,v,u,U,c,i,a,G,E,X,m,n,C,P,l,t,M,D,d,g
210
220
GetOptions(
211
221
"help|h!"=> \$help,
212
222
"verbose|v!"=> \$verbose,
@@ -227,14 +237,16 @@ GetOptions(
227
237
"morf-filter|M!"=> \$morf_filter,
228
238
"disorder-filter|D!"=> \$disorder_filter,
229
239
"anchor-datapath|d=s"=> \$anchor_datapath,
240
+
"go-filter|g=s"=> \$go_filter,
241
+
"organism-filter|g=s"=> \$organism_filter,
230
242
) ordie"Fatal Error: Problem parsing command-line ".$!;
231
243
232
244
my@fasta_files = @ARGV;
233
245
234
246
#Print out some help if it was asked for or if no arguments were given.
235
247
pod2usage(-exitstatus=> 0, -verbose=> 2) if$help;
236
248
237
-
pod2usage(-exitstatus=> 0, -verbose=> 1, -msg=>"mELM version $VERSION by Matt Oates (C) 2014-2019. Please provide some sequence files to mask or assign ELM motifs to.")
249
+
pod2usage(-exitstatus=> 0, -verbose=> 1, -msg=>"mELM version $VERSION by Matt Oates (C) 2014-2022. Please provide some sequence files to mask or assign ELM motifs to.")
0 commit comments