Marine_Sequences_Protein_Annotations
Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT
- Sequence_accession_number
 - Unique identifier for the genetic sequence as specified in INSDC databases, linking to the Marine Sequences table (Primary Key). Searchable
 - f_header
 - Header of the most similar protein entry in the reference dataset
 - sseqid
 - Sequence ID of the similar protein entry
 - stitle
 - Title of the similar protein entry
 - pident
 - Percentage of identical matches
 - evalue
 - E-value of the alignment
 - qcovs
 - Query coverage of the alignment
 - annotation_source
 - Source of the annotation (sp for Swiss-Prot; tr for TrEMBL)
 
1 row where Sequence_accession_number = "GM958372"
This data as json, CSV (advanced)
| Sequence_accession_number ▼ | f_header | sseqid | stitle | pident | evalue | qcovs | annotation_source | 
|---|---|---|---|---|---|---|---|
| GM958372 | 44190782_vibrio | sp|A5F0N6|TDH_VIBC3 | sp|A5F0N6|TDH_VIBC3 L-threonine 3-dehydrogenase OS=Vibrio cholerae serotype O1 (strain ATCC 39541 / Classical Ogawa 395 / O395) OX=345073 GN=tdh PE=3 SV=1 | 100.0 | 7.03e-259 | 99.7 | sp | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE Marine_Sequences_Protein_Annotations (
        Sequence_accession_number TEXT PRIMARY KEY,
        f_header TEXT,
        sseqid TEXT,
        stitle TEXT,
        pident TEXT,
        evalue TEXT,
        qcovs TEXT,
        annotation_source TEXT,
        FOREIGN KEY (Sequence_accession_number) REFERENCES Marine_Sequences (Sequence_accession_number)
    );