-
Notifications
You must be signed in to change notification settings - Fork 727
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How can a branched ligand chain be encoded for a single glycan location in a JSON file? #288
Comments
Hi! This is documented in https://github.com/google-deepmind/alphafold3/blob/main/docs/input.md#defining-glycans. For your example, you will need to do something like this: "sequences": [
# Your protein sequence(s) would go here, let's assume it has ID "P".
{
"ligand": {
"id": ["A", "B"], "ccdCodes": ["NAG"]
}
},
{
"ligand": {
"id": ["C", "D", "E", "F", "G", "H"],
"ccdCodes": ["MAN"]
}
},
"bondedAtomPairs": [
# The protein - NAG bond.
[["P", <res num>, <atom name>], ["A", 1, <atom name>]],
# The NAG - NAG bond.
[["A", 1, <atom name>], ["B", 1, <atom name>]],
# The NAG - MAN bond.
[["B", 1, <atom name>], ["C", 1, <atom name>]],
# MAN - MAN bonds.
[["C", 1, <atom name>], ["D", 1, <atom name>]],
[["D", 1, <atom name>], ["E", 1, <atom name>]],
[["D", 1, <atom name>], ["F", 1, <atom name>]],
[["F", 1, <atom name>], ["G", 1, <atom name>]],
[["F", 1, <atom name>], ["H", 1, <atom name>]],
],
|
Thanks a lot for your suggestions! However, I didn’t get the expected glycan predictions using the following JSON file:
Would you please review the JSON file above and check for any errors? |
The problem is that you are reusing the I.e. you will need 4 * 2 = 8 The current definition basically bonds all of the things together creating a glycan shared by the 4 protein residues... I will improve the bond checking code though to help catch such cases (e.g. fail on duplicate bond definitions). |
I've added a check for bond uniqueness in a3cf058. |
Many thanks for your guidance! If I need to define the |
Use double-letter IDs, e.g.
Yes, AlphaFold Server uses a different data pipeline that allows this. This feature is alas not available in the standalone AlphaFold 3. |
Thanks again! Using double-letter IDs for ligands to multiple glycans is a good idea, but it becomes problematic if there are more than 100 glycans in a multi-chain protein. Can I use three-letter or even four-letter IDs if double-letter IDs are insufficient? |
Yes, you can as long as all are upper-case letters |
Thanks a lot!! |
Hi,
Many thanks for releasing the AF3 source!
The 7BBV in #54, which includes several single glycan residues in the JSON file, is working correctly. My question is how to specify multiple glycan residues per glycan location.
For example, how can I encode the following branched ligand chain for a single glycan location to run AF3 on a local server?
NAG(NAG(MAN(MAN(MAN)(MAN(MAN)(MAN))))): branched ligand chain
I appreciate your guidance on this.
The text was updated successfully, but these errors were encountered: