Reads filtration change seq length + RF model update #5

TsabarM · 2020-09-30T09:16:17Z

No description provided.

shacharmo · 2020-09-30T23:39:20Z

IgOmeProfiling_pipeline.py

@@ -42,7 +42,7 @@ def run_pipeline(fastq_path, barcode2samplename_path, samplename2biologicalcondi

        module_parameters = [fastq_path, first_phase_output_path, first_phase_logs_path,
                             barcode2samplename_path, left_construct, right_construct,
-                             max_mismatches_allowed, min_sequencing_quality, first_phase_done_path,
+                             max_mismatches_allowed, min_sequencing_quality, minimal_length_required,first_phase_done_path,


Mismatch with the definition in read_filtration/module_wrapper.py.
You define that as a named parameter (starts with --) and pass it here as positional parameter.
Also the order is incorrect/doesn't match, you are passing minimal length as done path.

To summarize, this change is wrong and doesn't work

shacharmo · 2020-09-30T23:44:38Z

model_fitting/random_forest.py

+
+def get_hyperparameters_grid(seed):
+    # Number of trees in random forest
+    n_estimators = [int(x) for x in np.linspace(start=100, stop=2000, num=20)]


Set parameters using command arguments, parameters shouldn't be hardcoded.
This remark should be applied to entire file, not just this line

shacharmo · 2020-09-30T23:46:11Z

model_fitting/random_forest.py

+    for i in range(num_of_configurations_to_sample):
+        configuration = {}
+        for key in hyperparameters_grid:
+            configuration[key] = np.random.choice(hyperparameters_grid[key], size=1)[0]


It this seeded? Would we get same results every experiment run?

shacharmo · 2020-09-30T23:55:10Z

model_fitting/random_forest.py

+    data.drop(['sample_name', 'label'], axis=1, inplace=True)
+    # a matrix of the actual feature values
+    X_train = data[train_rows_mask].values
+    X_test = data[test_rows_mask].values


There is no usage in (modified) code - not X_test and not Y_test

Tool all data rf

Rf parallel

Validation files

…s/IgomeProfiling into change_place_stop_machines

change the place of file AWS stop machines from tools to auxiliaries

fix the script by the new requests

change the path of the wsl tutorial

Order of sort motifs bc

Connect positive motifs to pipeline

…mples

Flag num sample build cluster

add new script for summary reads in one csv file

Join samples to groups

The motif samples were by sort_by_num_samples, sort_by_unique_memebers, sort_by_cluster_size now its sort_by_num_samples, sort_by_cluster_size , sort_by_unique_memebers when unique members goes from low to high

changed the order of the samples

fixed bug of biological condition type value

Fix unite motifs

TsabarM changed the base branch from master to controlled_shuffles September 30, 2020 09:42

shacharmo requested changes Sep 30, 2020

View reviewed changes

yael1994 and others added 28 commits May 19, 2021 14:37

change the list of colors

883f75a

last changes

96faaa2

change the default values

f63417a

names and spaces fix

9b99cab

swap values

06ed787

fix comments

106a3ae

Added log scale to unite heatmap tool

83eb49c

Merge pull request #25 from Webiks/tool_all_data_RF

4ea4fd6

Tool all data rf

Fixed typeos

63112a8

add json file for read filteration phase

37ad77b

changes in the cmd

3609553

check the scema of the json file without load twice

9b1476f

Merge branch 'controlled_shuffles_tools' into validation_files

6db177b

Merge branch 'reads_filtration_change_seq_length' into RF_parallel

9525e37

Merge pull request #20 from Webiks/RF_parallel

f29afc4

Rf parallel

call the json valid scema from load table

220f138

fix conflict

ce5e848

Fixed file validations including imports

074abf4

Merge pull request #14 from Webiks/validation_files

36b2073

Validation files

part 1 cross exp

8816ab8

change the place of file AWS stop machines from tools to auxiliaries

16107f0

Merge branch 'controlled_shuffles_tools' into change_place_stop_machines

c45e662

support mapitope

b5a2f3b

remove import src_dir

40cc915

Merge branch 'change_place_stop_machines' of https://github.com/Webik…

f7ef3d9

…s/IgomeProfiling into change_place_stop_machines

Merge pull request #27 from Webiks/change_place_stop_machines

1a6a549

change the place of file AWS stop machines from tools to auxiliaries

phase 1 and 2

c38eabf

commit

2a3cc96

yael1994 and others added 30 commits May 3, 2022 12:19

sorted the cluster to combine by number of samples, unique peptides, rpm

ffa9a27

add spaces

03b9271

add flag of sample2group

09e5a4f

script for join samples

18b6ffd

add flag for type sort, more readability, user floor instead of round

432c46a

change name of parameters and there explanation

3a10996

fix the script by the new requests

465df5a

remove the letter probability end

f5e7c38

change the path of the wsl tutorial

723bedd

Merge pull request #63 from Webiks/input_change_script_meme

ad8f414

fix the script by the new requests

Merge pull request #64 from Webiks/path_install_wsl_readme

2cf7db3

change the path of the wsl tutorial

Merge pull request #62 from Webiks/order_of_sort_motifs_BC

83f34ca

Order of sort motifs bc

Merge pull request #61 from Webiks/connect_positive_motifs_to_pipeline

d5fdf38

Connect positive motifs to pipeline

group in the file barcode 2 sample

53f57e1

remove not relevent function

57b0f05

fix spaces

433dbca

add new script for summary reads in one csv file

bbb9106

add flag that keep cluster of BC that build from minimun number of sa…

02f23ec

…mples

fix typo

d632f0f

Merge pull request #67 from Webiks/flag_num_sample_build_cluster

8c98b84

Flag num sample build cluster

Merge pull request #65 from Webiks/summary_log_csv

16fff30

add new script for summary reads in one csv file

Merge pull request #66 from Webiks/join_samples_to_groups

b87fb41

Join samples to groups

changed the order of the samples

5a595b5

The motif samples were by sort_by_num_samples, sort_by_unique_memebers, sort_by_cluster_size now its sort_by_num_samples, sort_by_cluster_size , sort_by_unique_memebers when unique members goes from low to high

Merge pull request #68 from Webiks/changeOrderMotifs

c8e19e7

changed the order of the samples

Update unite_motifs_of_biological_condition.py

f8e6295

Update worker_entrypoint.sh

4a24542

updated entrypoint

082648c

fixed bug of biological condition type value

Update unite_motifs_of_biological_condition.py

aba7152

Update unite_motifs_of_biological_condition.py

5f6b79e

Merge pull request #69 from Webiks/fixUniteMotifs

7eed72c

Fix unite motifs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reads filtration change seq length + RF model update #5

Reads filtration change seq length + RF model update #5

TsabarM commented Sep 30, 2020

shacharmo Sep 30, 2020

shacharmo Sep 30, 2020

shacharmo Sep 30, 2020

shacharmo Sep 30, 2020

Reads filtration change seq length + RF model update #5

Are you sure you want to change the base?

Reads filtration change seq length + RF model update #5

Conversation

TsabarM commented Sep 30, 2020

shacharmo Sep 30, 2020

Choose a reason for hiding this comment

shacharmo Sep 30, 2020

Choose a reason for hiding this comment

shacharmo Sep 30, 2020

Choose a reason for hiding this comment

shacharmo Sep 30, 2020

Choose a reason for hiding this comment