home / MABPAT_dataset

Marine_Sequences

Provides detailed information about each marine genetic sequence, including its GC content, sequence length, and whether it contains protein-coding information.

Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT

Sequence_accession_number
Unique identifier for the genetic sequence as specified in INSDC databases (Primary Key). Searchable
Species_name
Species associated with the sequence, linking to the Marine Species table. Searchable
GC_content
GC content of the sequence
Sequence_length
Length of the sequence
Sequence_status
Status of the sequence (e.g., confirmed, predicted)
Is_protein_coding_sequence
Indicates if the sequence is protein-coding (1 for yes, 0 for no)
Is_annotated
Indicates if the sequence has been annotated (1 for yes, 0 for no)

18 rows where Is_annotated = 1, Sequence_length = "384.0" and Sequence_status = "observed"

✎ View and edit SQL

This data as json, CSV (advanced)

Species_name 16

  • Geobacillus kaustophilus 2
  • Prosthecochloris aestuarii dsm 271 2
  • Bermanella marisrubri 1
  • Colwellia psychrerythraea 34h 1
  • Neptuniibacter caesariensis 1
  • Oceanobacillus iheyensis 1
  • Oncorhynchus mykiss 1
  • Photobacterium profundum 1
  • Piscirickettsia salmonis 1
  • Psammechinus miliaris 1
  • Pseudoalteromonas tunicata d2 1
  • Rhodopirellula baltica 1
  • Shewanella sp. mr-4 1
  • Vibrio cholerae o1 biovar el tor str. n16961 1
  • Vibrio parahaemolyticus rimd 2210633 1
  • Vibrio vulnificus cmcp6 1

Sequence_status 1

  • observed · 18 ✖

Is_protein_coding_sequence 1

  • 1 18

Is_annotated 1

  • 1 · 18 ✖
Sequence_accession_number ▼ Species_name GC_content Sequence_length Sequence_status Is_protein_coding_sequence Is_annotated
CS417596 Piscirickettsia salmonis 39.3229166666667 384.0 observed 1 1
DD287600 Oncorhynchus mykiss 47.1354166666667 384.0 observed 1 1
FB334630 Psammechinus miliaris 49.4791666666667 384.0 observed 1 1
GN111576 Vibrio vulnificus cmcp6 45.0520833333333 384.0 observed 1 1
GN111582 Vibrio parahaemolyticus rimd 2210633 44.0104166666667 384.0 observed 1 1
GN111638 Colwellia psychrerythraea 34h 38.0208333333333 384.0 observed 1 1
GN111688 Shewanella sp. mr-4 48.9583333333333 384.0 observed 1 1
GN111698 Pseudoalteromonas tunicata d2 39.5833333333333 384.0 observed 1 1
GN111702 Neptuniibacter caesariensis 44.2708333333333 384.0 observed 1 1
GN124380 Vibrio cholerae o1 biovar el tor str. n16961 50.5208333333333 384.0 observed 1 1
HB437431 Prosthecochloris aestuarii dsm 271 48.4375 384.0 observed 1 1
HB437985 Rhodopirellula baltica 55.7291666666667 384.0 observed 1 1
HB911734 Photobacterium profundum 41.6666666666667 384.0 observed 1 1
HC743342 Prosthecochloris aestuarii dsm 271 51.0416666666667 384.0 observed 1 1
HC743364 Bermanella marisrubri 46.6145833333333 384.0 observed 1 1
HC743526 Geobacillus kaustophilus 52.6041666666667 384.0 observed 1 1
HC745217 Geobacillus kaustophilus 51.3020833333333 384.0 observed 1 1
HC745271 Oceanobacillus iheyensis 36.1979166666667 384.0 observed 1 1

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE Marine_Sequences (
        Sequence_accession_number TEXT PRIMARY KEY,
        Species_name TEXT,
        GC_content TEXT,
        Sequence_length TEXT,
        Sequence_status TEXT,
        Is_protein_coding_sequence INTEGER,
        Is_annotated INTEGER,
        FOREIGN KEY (Species_name) REFERENCES Marine_Species (Species_name),
        FOREIGN KEY (Sequence_accession_number) REFERENCES Sequences (Sequence_accession_number)
    );
Powered by Datasette · Queries took 388.518ms · Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT