The minimal polymer length of 2, set in the min_lenght paremeter of the nextflow.config file, is for the homopol_summary.tsv file only (except the two last columns of this file).
Randomisation of each sequence was performed 10,000 times without any constrain. Then means were computed for each homopolymer length category.
Each dot is a value obtained for one sequence.
See the plot_raw_values.tsv file for the plot raw values.
Figure 1: Proportions of homopolymer lengths. See the scatterplot_stat.tsv file for values
Figure 2: Proportions of homopolymer lengths (log10)
Each dot is a value obtained for one sequence.
Figure 3: Proportions of homopolymer lengths: diamond, mean; whiskers, 1.5 x Inter Quartile Range; horizontal bars, quartiles; number at the top, mean. See the boxplot_stat.tsv file for values
Figure 4: Proportions of homopolymer lengths (log10)
See also the t_test.tsv file.
| length | obs_mean | theo_mean | obs_sd | theo_sd | df | t | p.value | BH.adj.p.value |
|---|---|---|---|---|---|---|---|---|
| 01 | 0.229166666666667 | 0.802950956661501 | 0.157747454050008 | 0.00828434936470266 | 3.0165477915132 | -7.2647090476186 | 0.00528339310285068 | 0.0211335724114027 |
| 02 | 0.645833333333333 | 0.187575628908654 | 0.307129783159666 | 0.0124818212833651 | 3.00990973678908 | 2.98166939316191 | 0.0582762434926234 | 0.116552486985247 |
| 03 | 0.0625 | 0.00837948666658332 | 0.125 | 0.0167589733331666 | 3.10781662731246 | 0.85824895163558 | 0.451856906885527 | 0.451856906885527 |
| 04 | 0.0625 | 0.00109392776326153 | 0.125 | 0.00218785552652306 | 3.00183809716059 | 0.982346696573283 | 0.39832626425463 | 0.451856906885527 |
See the reports folder for all the details of the analysis, including the parameters used in the .config file
Full .nextflow.log is in: /mnt/c/Users/gmillot/Documents/Git_projects/homopolymer
The one in the reports folder is not complete (miss the end)
| Variable | Value |
|---|---|
| Project (empty means no .git folder where the homopolymer.nf file is present) |
origin git@gitlab.pasteur.fr:gmillot/homopolymer.git (fetch) |
| Git info (empty means no .git folder where the homopolymer.nf file is present) |
v5.0-dirty |
| Cmd line | nextflow run main.nf -resume |
| execution mode | local |
| Manifest’s pipeline version | null |
| result path | /mnt/c/Users/gmillot/Documents/Git_projects/homopolymer/results/homopolymer_2_1708366715 |
| nextflow version | 23.04.4 |
| Name | Description | Value |
|---|---|---|
| launchDir | Directory where the workflow is run | /mnt/c/Users/gmillot/Documents/Git_projects/homopolymer |
| nprojectDir | Directory where the homopolymer.nf script is located | /mnt/c/Users/gmillot/Documents/Git_projects/homopolymer |
| workDir | Directory where tasks temporary files are created | /mnt/c/Users/gmillot/Documents/Git_projects/homopolymer/work |
| Name | Description | Value |
|---|---|---|
| out_path | output folder path | /mnt/c/Users/gmillot/Documents/Git_projects/homopolymer/results/homopolymer_2_1708366715 |
| in_path | input folder path | /mnt/c/Users/gmillot/Documents/Git_projects/homopolymer/dataset/test.fasta |
See the nf_dag.png file