Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

start modularizing intervals + BP graph, and generate AA-esque summary file #24

Open
wants to merge 21 commits into
base: main
Choose a base branch
from

Conversation

suhas-r
Copy link
Collaborator

@suhas-r suhas-r commented Dec 14, 2024

Currently, we generate a container class LongReadBAMToBreakpointMetadata that stores information such as Chimeric Alignments + Copy-Number segments from pre-processing our aligned BAM file and .cns seeds from CNVkit segmentation. We iteratively mutate components of this class to contain different transformed versions of these inputs in confusing + brittle ways. We also rely on metadata provided by the container class when attempting to generate the ILP model corresponding to a single BreakpointGraph, which shouldn't be necessary. After we do the initial amplified interval search and refinement, we should be able to generate self-contained BreakpointGraphs that can be processed independently.

In terms of the final outputted _graph.txt file, this PR adds PathConstraint + AmplifiedInterval information for each BreakpointGraph. This allows us to use just a *_graph.txt file with the cycle entrypoint to generate/solve an ILP model (no additional usage of a BAM or CNV seed file).

@suhas-r suhas-r force-pushed the modularize-breakpoints branch from 67da804 to dd50e4e Compare December 24, 2024 14:02
@suhas-r suhas-r force-pushed the modularize-breakpoints branch from a70bfc8 to ee6af40 Compare January 5, 2025 11:17
Copy link
Collaborator

@kyzhu kyzhu Jan 17, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi @suhas-r In the "cycle2bed" mode, can you remove the requirement of input --num-cycles and by default to convert all cycles to bed format?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes this has been done on the main branch in this commit.

@suhas-r suhas-r force-pushed the modularize-breakpoints branch from b44912a to 098f96c Compare January 24, 2025 11:17
@suhas-r suhas-r force-pushed the modularize-breakpoints branch from a83ebe6 to 1520f98 Compare January 27, 2025 09:20
@suhas-r suhas-r changed the title start modularizing intervals and generating AA-esque summary file start modularizing intervals + BP graph, and generate AA-esque summary file Jan 27, 2025
@suhas-r suhas-r self-assigned this Jan 27, 2025
@suhas-r suhas-r force-pushed the modularize-breakpoints branch from 46ddc58 to 111e7ea Compare January 30, 2025 00:22
@suhas-r suhas-r linked an issue Feb 6, 2025 that may be closed by this pull request
@suhas-r suhas-r force-pushed the modularize-breakpoints branch from 941b2fa to 90b5e37 Compare February 6, 2025 09:40
@suhas-r suhas-r force-pushed the modularize-breakpoints branch from 9f67b70 to 239aaee Compare February 6, 2025 11:15
@suhas-r suhas-r requested a review from kyzhu February 6, 2025 21:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

IndexError: list index out of range in coral seed
2 participants