home / MABPAT_dataset

Marine_Sequences

Provides detailed information about each marine genetic sequence, including its GC content, sequence length, and whether it contains protein-coding information.

Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT

Sequence_accession_number
Unique identifier for the genetic sequence as specified in INSDC databases (Primary Key). Searchable
Species_name
Species associated with the sequence, linking to the Marine Species table. Searchable
GC_content
GC content of the sequence
Sequence_length
Length of the sequence
Sequence_status
Status of the sequence (e.g., confirmed, predicted)
Is_protein_coding_sequence
Indicates if the sequence is protein-coding (1 for yes, 0 for no)
Is_annotated
Indicates if the sequence has been annotated (1 for yes, 0 for no)

10 rows where Is_protein_coding_sequence = 1 and Species_name = "Ecteinascidia turbinata"

✎ View and edit SQL

This data as json, CSV (advanced)

Species_name 1

  • Ecteinascidia turbinata · 10 ✖

Sequence_status 1

  • observed 10

Is_protein_coding_sequence 1

  • 1 · 10 ✖

Is_annotated 1

  • 0 10
Sequence_accession_number ▼ Species_name GC_content Sequence_length Sequence_status Is_protein_coding_sequence Is_annotated
JA637491 Ecteinascidia turbinata 22.8373702422145 4335.0 observed 1 0
JA637492 Ecteinascidia turbinata 20.3238414293691 5373.0 observed 1 0
JA637495 Ecteinascidia turbinata 19.8717948717949 624.0 observed 1 0
JA637500 Ecteinascidia turbinata 24.1252302025783 1086.0 observed 1 0
JA637506 Ecteinascidia turbinata 24.6138515196811 2007.0 observed 1 0
JA637507 Ecteinascidia turbinata 20.5828779599271 1098.0 observed 1 0
JA637508 Ecteinascidia turbinata 18.7112763320942 807.0 observed 1 0
JA637512 Ecteinascidia turbinata 26.0027662517289 1446.0 observed 1 0
JA637513 Ecteinascidia turbinata 29.0018832391714 531.0 observed 1 0
JA637514 Ecteinascidia turbinata 21.9373219373219 351.0 observed 1 0

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE Marine_Sequences (
        Sequence_accession_number TEXT PRIMARY KEY,
        Species_name TEXT,
        GC_content TEXT,
        Sequence_length TEXT,
        Sequence_status TEXT,
        Is_protein_coding_sequence INTEGER,
        Is_annotated INTEGER,
        FOREIGN KEY (Species_name) REFERENCES Marine_Species (Species_name),
        FOREIGN KEY (Sequence_accession_number) REFERENCES Sequences (Sequence_accession_number)
    );
Powered by Datasette · Queries took 384.51ms · Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT