downloadDownloads

databaseAll Data

You can download data for the currently viewed protein, or for specific positions. Access is also available through our REST API.


eco Basic
biotech Genome
layers Complexity
coronavirus SNP & Disease
architecture Structure
waves Disorder
forest Conservation
labs Mutations (TCGA/COSMIC)
schema Motifs & PTMs
description UniProt Regions
handshake Binding Partners
water_drop Phase Separation
star Significant Mutations

science Protein-Specific Data


Protein-specific information can be downloaded by clicking on the button in the top-left corner of a protein page. The icon looks like:

explore Positional Data


Users can access detailed information for each residue by clicking on the amino acid sequence. For instance, TCGA data shows all mutations with their associated sample IDs.

These data can be downloaded in the following formats:

  • .txt
  • .json

api REST API

DisCanVis provides automated data access through a RESTful API, available for protein-level and region-specific queries.

biotech 1. Full Annotation or Fasta for a Protein

Users can access the sequence in fasta format and the whole annotation for the protein. Requests should be input following the syntax:

https://discanvis.elte.hu/rest/::identifier::.::response_type::

Where

::identifier:: is any valid identifier from DisCanVis and ::response_type:: is json or txt or fasta

Examples:

https://discanvis.elte.hu/rest/WNK3-201.fasta
https://discanvis.elte.hu/rest/WNK3-201.json
https://discanvis.elte.hu/rest/WNK3-201.txt


label 2. Specific Annotation
<

For specific annotation from the current protein users can access with the following syntax:

https://discanvis.elte.hu/rest/::identifier::/::annotation::

Examples:

https://discanvis.elte.hu/rest/WNK3-201/protein
https://discanvis.elte.hu/rest/WNK3-201/elm
https://discanvis.elte.hu/rest/WNK3-201/tcgam

Available Annotations:

protein exon phastcons complexity-seg complexity-dust complexity-trf polymorphism omim clinvar pdb pfam anchor iupred mobidb alphafold tcgam tcgaf tcgai cosmicm cosmicf cosmici roisig elm elmswitches ptm roi binding dibs mfib binding_domain phasepro conservation

aspect_ratio 3. Region-Based Annotation

The user can access all data with a given protein in specific region with a following syntax:

https://discanvis.elte.hu/visual/::identifier::/::positionstart::-::positionend::.::response_type::

Where

::identifier:: is any valid identifier from DisCanVis ::positionstart:: and ::positionend:: is a positive integer within range of the length of the sequence of protein coded by ::identifier:: . The positions should be in a valid range (start should be lower than end). The ::response_type:: is either json or txt

Examples:

https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537-550.json
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537-550.txt


pin_drop 4. Position-Based Annotation

Requests should be input following the syntax:

https://discanvis.elte.hu/visual/::identifier::/::position::.::response_type::

Where

::identifier:: is any valid identifier from DisCanVis ::position:: is a positive integer within range of the length of the sequence of protein coded by ::identifier::. ::response_type:: is either json or txt

Examples:

https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.json
https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.txt


swap_horiz 5. Annotation by Alternate ID

Requests should be input following the syntax:

https://discanvis.elte.hu/rest/::identifier::/::target_arguments::?search_type=::identifier_search_type::

Where

::identifier_search_type:: currently can be:

  • uniprot
  • transcript

Examples:


import requests
import json
from io import StringIO
"""
Position specific request
"""

"""
TXT read example
"""
response = requests.get('https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.txt')
txt = response.text[5:-7]
data = StringIO(txt)
dct = {}
annotation = None
annotation_txt = ''
for line in data:
    if line.startswith('#'):
        if annotation:
            dct[annotation] = annotation_txt
            annotation_txt = ''
        annotation = line.strip('#\n')
    else:
        annotation_txt += line
print(dct['protein'])

                        

Output:

	Protein ID	UniProt Accession	Gene Name	Name	Transcript ID	Chromosome	Cancer Driver
0	WNK3_ENST00000375159	Q9BYP7	WNK3	Serine/threonine-protein kinase WNK3	ENST00000375159.6	chrX	Not Cancer Driver	
"""
Json read example
"""
response = requests.get('https://discanvis.elte.hu/rest/WNK3_ENST00000375159/537.json')
dct = json.loads(response.text)
print(dct['elm'])

Output:

[
    {
        "ELMIdentifier": "DEG_Kelch_KLHL3_1",
        "ELMType": "DEG",
        "ELM_Accession": "ELMI002836",
        "End": 546,
        "InstanceLogic": "true positive",
        "Methods": "coimmunoprecipitation; fluorescence polarization spectroscopy",
        "Organism": "Homo sapiens",
        "PDB": "nan",
        "References": "23838290",
        "Start": 537,
        "id": 1109,
        "protein_id": "WNK3_ENST00000375159"
    }
]