Issue 207 #8

lucventurini · 2019-08-15T13:26:24Z

General improvements in speed, especially for multiprocessing; also, now Mikado pick will check at runtime whether there are transcripts that have too big introns and remove them.

…t level of approximation kicks in *before* loading transcript data: that should speed things up as well.

…reation and checking to the subprocesses. This should speed it up a lot in the presence of complex, long loci.

…aximum length. This should prevent stray transcripts from botched prepare runs to take up too much time.

codecov-io · 2019-08-15T13:33:50Z

Codecov Report

Merging #8 into master will increase coverage by 0.02%.
The diff coverage is 77.7%.

@@            Coverage Diff             @@
##           master       #8      +/-   ##
==========================================
+ Coverage   79.54%   79.57%   +0.02%     
==========================================
  Files          70       70              
  Lines       15589    15628      +39     
==========================================
+ Hits        12401    12436      +35     
- Misses       3188     3192       +4

This PR deals with the fact that Mikado pick was not leveraging correctly the multiple processors. This was due to the fact that the main process was taking up the job of checking transcripts and creating loci - expensive operations that acted as bottlenecks. Now the main process will only collate transcripts as GTF rows, do a minimal check on the fact that they do not have introns longer that the maximum size, and then and only then dispatch them. Moreover, the trigger to user reduction methods in a locus has been lowered (from 10,000 to 5,000) and the first method (removal of redundant, completely contained intron chains) will be triggered before loading transcript data from the database.

lucventurini added 8 commits August 13, 2019 13:22

Reduced the amount for starting pruning the graph. Also, now the firs…

983f086

…t level of approximation kicks in *before* loading transcript data: that should speed things up as well.

Fixing up small bugs introduced in previous commit

a966e68

EI-CoreBioinformatics#207: now Mikado pick will delegate transcript c…

adff915

…reation and checking to the subprocesses. This should speed it up a lot in the presence of complex, long loci.

Solved a bug in the submission of loci to the subprocesses

81056b2

Now Mikado pick will exclude any transcript with an intron over the m…

d2a6717

…aximum length. This should prevent stray transcripts from botched prepare runs to take up too much time.

Solved a couple of bugs in picker

57f8035

Bugfix for 57f8035

f21dcf7

Bugfix for f21dcf7 and 57f8035

538eeec

lucventurini merged commit 648af46 into master Aug 15, 2019

lucventurini deleted the issue-207 branch August 15, 2019 13:43

lucventurini added a commit that referenced this pull request Feb 11, 2021

Fix and test EI-CoreBioinformatics#197 (#8)

931d8fd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue 207 #8

Issue 207 #8

lucventurini commented Aug 15, 2019

codecov-io commented Aug 15, 2019

Issue 207 #8

Issue 207 #8

Conversation

lucventurini commented Aug 15, 2019

codecov-io commented Aug 15, 2019

Codecov Report