Marine_Sequences_Protein_Annotations
Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT
- Sequence_accession_number
 - Unique identifier for the genetic sequence as specified in INSDC databases, linking to the Marine Sequences table (Primary Key). Searchable
 - f_header
 - Header of the most similar protein entry in the reference dataset
 - sseqid
 - Sequence ID of the similar protein entry
 - stitle
 - Title of the similar protein entry
 - pident
 - Percentage of identical matches
 - evalue
 - E-value of the alignment
 - qcovs
 - Query coverage of the alignment
 - annotation_source
 - Source of the annotation (sp for Swiss-Prot; tr for TrEMBL)
 
1 row where Sequence_accession_number = "HB900428"
This data as json, CSV (advanced)
| Sequence_accession_number ▼ | f_header | sseqid | stitle | pident | evalue | qcovs | annotation_source | 
|---|---|---|---|---|---|---|---|
| HB900428 | 51225259_oceanobacillus | tr|Q8EL24|Q8EL24_OCEIH | tr|Q8EL24|Q8EL24_OCEIH Transcriptional regulator (LysR family) OS=Oceanobacillus iheyensis (strain DSM 14371 / CIP 107618 / JCM 11309 / KCTC 3954 / HTE831) OX=221109 GN=OB3407 PE=3 SV=1 | 100.0 | 6.69e-206 | 99.7 | tr | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE Marine_Sequences_Protein_Annotations (
        Sequence_accession_number TEXT PRIMARY KEY,
        f_header TEXT,
        sseqid TEXT,
        stitle TEXT,
        pident TEXT,
        evalue TEXT,
        qcovs TEXT,
        annotation_source TEXT,
        FOREIGN KEY (Sequence_accession_number) REFERENCES Marine_Sequences (Sequence_accession_number)
    );