Marine_Sequences_Protein_Annotations
Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT
- Sequence_accession_number
 - Unique identifier for the genetic sequence as specified in INSDC databases, linking to the Marine Sequences table (Primary Key). Searchable
 - f_header
 - Header of the most similar protein entry in the reference dataset
 - sseqid
 - Sequence ID of the similar protein entry
 - stitle
 - Title of the similar protein entry
 - pident
 - Percentage of identical matches
 - evalue
 - E-value of the alignment
 - qcovs
 - Query coverage of the alignment
 - annotation_source
 - Source of the annotation (sp for Swiss-Prot; tr for TrEMBL)
 
1 row where Sequence_accession_number = "AR591201"
This data as json, CSV (advanced)
| Sequence_accession_number ▼ | f_header | sseqid | stitle | pident | evalue | qcovs | annotation_source | 
|---|---|---|---|---|---|---|---|
| AR591201 | 21749811_unknown. | tr|A0A2Y9DW95|A0A2Y9DW95_TRIMA | tr|A0A2Y9DW95|A0A2Y9DW95_TRIMA Muscarinic acetylcholine receptor OS=Trichechus manatus latirostris OX=127582 GN=LOC101349646 PE=3 SV=1 | 95.7 | 5.46000000020792e-314 | 99.8 | tr | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE Marine_Sequences_Protein_Annotations (
        Sequence_accession_number TEXT PRIMARY KEY,
        f_header TEXT,
        sseqid TEXT,
        stitle TEXT,
        pident TEXT,
        evalue TEXT,
        qcovs TEXT,
        annotation_source TEXT,
        FOREIGN KEY (Sequence_accession_number) REFERENCES Marine_Sequences (Sequence_accession_number)
    );