home / MABPAT_dataset

Marine_Sequences_Protein_Annotations

Contains annotations for protein-coding sequences, providing insights into the function and characteristics of the proteins encoded by the sequences.

Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT

Sequence_accession_number
Unique identifier for the genetic sequence as specified in INSDC databases, linking to the Marine Sequences table (Primary Key). Searchable
f_header
Header of the most similar protein entry in the reference dataset
sseqid
Sequence ID of the similar protein entry
stitle
Title of the similar protein entry
pident
Percentage of identical matches
evalue
E-value of the alignment
qcovs
Query coverage of the alignment
annotation_source
Source of the annotation (sp for Swiss-Prot; tr for TrEMBL)

1 row where Sequence_accession_number = "CS793805"

✎ View and edit SQL

This data as json, CSV (advanced)

Sequence_accession_number ▼ f_header sseqid stitle pident evalue qcovs annotation_source
CS793805 9668557_vibrio tr|A0A2I3CMT6|A0A2I3CMT6_VIBAX tr|A0A2I3CMT6|A0A2I3CMT6_VIBAX Transcriptional regulator, MarR family protein OS=Vibrio alginolyticus (strain ATCC 17749 / DSM 2171 / NBRC 15630 / NCIMB 1903 / NCTC 12160 / XII-53) OX=1219076 GN=N646_3410 PE=4 SV=1 100.0 9.86e-103 99.3 tr

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE Marine_Sequences_Protein_Annotations (
        Sequence_accession_number TEXT PRIMARY KEY,
        f_header TEXT,
        sseqid TEXT,
        stitle TEXT,
        pident TEXT,
        evalue TEXT,
        qcovs TEXT,
        annotation_source TEXT,
        FOREIGN KEY (Sequence_accession_number) REFERENCES Marine_Sequences (Sequence_accession_number)
    );
Powered by Datasette · Queries took 2485.131ms · Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT