home / MABPAT_dataset

Marine_Sequences

Provides detailed information about each marine genetic sequence, including its GC content, sequence length, and whether it contains protein-coding information.

Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT

Sequence_accession_number
Unique identifier for the genetic sequence as specified in INSDC databases (Primary Key). Searchable
Species_name
Species associated with the sequence, linking to the Marine Species table. Searchable
GC_content
GC content of the sequence
Sequence_length
Length of the sequence
Sequence_status
Status of the sequence (e.g., confirmed, predicted)
Is_protein_coding_sequence
Indicates if the sequence is protein-coding (1 for yes, 0 for no)
Is_annotated
Indicates if the sequence has been annotated (1 for yes, 0 for no)

16 rows where Sequence_status = "observed" and Species_name = "Gammaproteobacteria"

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: Sequence_length

Is_protein_coding_sequence 2

  • 1 14
  • 0 2

Is_annotated 2

  • 0 15
  • 1 1

Species_name 1

  • Gammaproteobacteria · 16 ✖

Sequence_status 1

  • observed · 16 ✖
Sequence_accession_number ▼ Species_name GC_content Sequence_length Sequence_status Is_protein_coding_sequence Is_annotated
GN042284 Gammaproteobacteria 50.207468879668 723.0 observed 0 0
GN042286 Gammaproteobacteria 52.1794871794872 780.0 observed 1 0
GN042290 Gammaproteobacteria 51.2396694214876 726.0 observed 0 0
HC508354 Gammaproteobacteria 57.7698695136418 1686.0 observed 1 0
HC508356 Gammaproteobacteria 60.7354685646501 1686.0 observed 1 0
JE571183 Gammaproteobacteria 57.8231292517007 882.0 observed 1 0
LP124750 Gammaproteobacteria 50.3534068419565 3537.0 observed 1 0
LP124793 Gammaproteobacteria 53.2896532896533 3663.0 observed 1 0
LP125886 Gammaproteobacteria 52.1176470588235 2550.0 observed 1 0
LP125929 Gammaproteobacteria 52.8187665531593 2643.0 observed 1 0
LP126958 Gammaproteobacteria 49.7126436781609 3480.0 observed 1 0
LP127003 Gammaproteobacteria 45.8949096880131 3654.0 observed 1 0
LP128272 Gammaproteobacteria 50.0 2064.0 observed 1 0
LP128312 Gammaproteobacteria 52.843137254902 2040.0 observed 1 1
LP129155 Gammaproteobacteria 49.0683229813665 483.0 observed 1 0
LP129196 Gammaproteobacteria 56.0975609756098 615.0 observed 1 0

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE Marine_Sequences (
        Sequence_accession_number TEXT PRIMARY KEY,
        Species_name TEXT,
        GC_content TEXT,
        Sequence_length TEXT,
        Sequence_status TEXT,
        Is_protein_coding_sequence INTEGER,
        Is_annotated INTEGER,
        FOREIGN KEY (Species_name) REFERENCES Marine_Species (Species_name),
        FOREIGN KEY (Sequence_accession_number) REFERENCES Sequences (Sequence_accession_number)
    );
Powered by Datasette · Queries took 1145.679ms · Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT