Marine_Sequences_Protein_Annotations
Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT
- Sequence_accession_number
- Unique identifier for the genetic sequence as specified in INSDC databases, linking to the Marine Sequences table (Primary Key). Searchable
- f_header
- Header of the most similar protein entry in the reference dataset
- sseqid
- Sequence ID of the similar protein entry
- stitle
- Title of the similar protein entry
- pident
- Percentage of identical matches
- evalue
- E-value of the alignment
- qcovs
- Query coverage of the alignment
- annotation_source
- Source of the annotation (sp for Swiss-Prot; tr for TrEMBL)
1 row where Sequence_accession_number = "JC226457"
This data as json, CSV (advanced)
| Sequence_accession_number ▼ | f_header | sseqid | stitle | pident | evalue | qcovs | annotation_source | 
|---|---|---|---|---|---|---|---|
| JC226457 | 7987701_vibrio | sp|A5F449|ILVC_VIBC3 | sp|A5F449|ILVC_VIBC3 Ketol-acid reductoisomerase (NADP(+)) OS=Vibrio cholerae serotype O1 (strain ATCC 39541 / Classical Ogawa 395 / O395) OX=345073 GN=ilvC PE=3 SV=1 | 100.0 | 0.0 | 99.8 | sp | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE Marine_Sequences_Protein_Annotations (
        Sequence_accession_number TEXT PRIMARY KEY,
        f_header TEXT,
        sseqid TEXT,
        stitle TEXT,
        pident TEXT,
        evalue TEXT,
        qcovs TEXT,
        annotation_source TEXT,
        FOREIGN KEY (Sequence_accession_number) REFERENCES Marine_Sequences (Sequence_accession_number)
    );