Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adjust also target ranges and make valid outputs #6

Merged
merged 9 commits into from
Apr 24, 2024

Conversation

AndreaGuarracino
Copy link
Member

@AndreaGuarracino AndreaGuarracino commented Apr 21, 2024

With this PR, now we:

  • also adjust target ranges (not only query ranges)
  • emit valid BEDPE and PAF outputs
  • fix a few bugs.
  • have a -c/--check-intervals flag to check if projected intervals are valid

@AndreaGuarracino
Copy link
Member Author

AndreaGuarracino commented Apr 21, 2024

Added -c/--check-intervals flag to check that the projected intervals are valid with respect to the CIGAR string:

-c, --check-intervals              Check the projected intervals, reporting the wrong ones (slow, useful for debugging)

Example (with fake errors):

impg -p scerevisiae7.s5k.p90.n1.c20k.aln.paf  -b scerevisiae.busco-genes.single.bed -c
  Query length mismatch: expected 14716 from the query range [343848-358564), got 14715 from the CIGAR string; SK1#1#chrXII	1035507	343848	358564	+	SK1#1#chrVI	298678	334149	348864	14715=
  Query length mismatch: expected 14716 from the query range [336487-351203), got 14715 from the CIGAR string; UWOPS034614#1#chrXII	1035507	336487	351203	+	SK1#1#chrVI	298678	334149	348864	72=1X569=1X41=1X86=1...
  Query length mismatch: expected 14716 from the query range [344179-358895), got 14715 from the CIGAR string; YPS128#1#chrXII	1035507	344179	358895	+	SK1#1#chrVI	298678	334149	348864	642=1X128=1X1=1X116=...
  Query length mismatch: expected 14716 from the query range [332043-346759), got 14715 from the CIGAR string; DBVPG6765#1#chrXII	1035507	332043	346759	+	SK1#1#chrVI	298678	334149	348864	129=1X446=1X194=1X1=...
  Query length mismatch: expected 14716 from the query range [349047-363763), got 14715 from the CIGAR string; S288C#1#chrXII	1035507	349047	363763	+	SK1#1#chrVI	298678	334149	348864	129=1X446=1X194=1X1=...
  Query length mismatch: expected 14695 from the query range [348060-362755), got 14694 from the CIGAR string; Y12#1#chrXII	1035507	348060	362755	+	SK1#1#chrVI	298678	334149	348864	642=1X128=1X1=1X116=...

@ekg ekg merged commit f24b697 into pangenome:main Apr 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants