home / MABPAT_dataset

Marine_Sequences

Provides detailed information about each marine genetic sequence, including its GC content, sequence length, and whether it contains protein-coding information.

Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT

Sequence_accession_number
Unique identifier for the genetic sequence as specified in INSDC databases (Primary Key). Searchable
Species_name
Species associated with the sequence, linking to the Marine Species table. Searchable
GC_content
GC content of the sequence
Sequence_length
Length of the sequence
Sequence_status
Status of the sequence (e.g., confirmed, predicted)
Is_protein_coding_sequence
Indicates if the sequence is protein-coding (1 for yes, 0 for no)
Is_annotated
Indicates if the sequence has been annotated (1 for yes, 0 for no)

27 rows where Sequence_length = "828.0" and Sequence_status = "observed"

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: GC_content

Species_name 18

  • Anguilla japonica 3
  • Geobacillus kaustophilus 3
  • Uncultured marine bacterium 66a03 3
  • Vibrio parahaemolyticus rimd 2210633 3
  • Shewanella oneidensis 2
  • Hyphomonas neptunium 1
  • Lactococcus garvieae 1
  • Moritella sp. pe36 1
  • Neptuniibacter caesariensis 1
  • Oncorhynchus mykiss 1
  • Ostreococcus tauri 1
  • Prochlorococcus marinus subsp. marinus str. ccmp1375 1
  • Raoultella ornithinolytica 1
  • Sulfitobacter sp. ee-36 1
  • Thermococcus kodakarensis 1
  • Trichodesmium erythraeum ims101 1
  • Vibrio cholerae 1
  • Zobellia galactanivorans 1

Is_protein_coding_sequence 2

  • 1 23
  • 0 4

Is_annotated 2

  • 1 16
  • 0 11

Sequence_status 1

  • observed · 27 ✖
Sequence_accession_number ▼ Species_name GC_content Sequence_length Sequence_status Is_protein_coding_sequence Is_annotated
AX008612 Zobellia galactanivorans 43.3574879227053 828.0 observed 1 1
AX109366 Raoultella ornithinolytica 54.5893719806763 828.0 observed 1 1
AX109942 Vibrio cholerae 49.6376811594203 828.0 observed 0 0
AX111128 Lactococcus garvieae 43.8405797101449 828.0 observed 0 0
CS469322 Thermococcus kodakarensis 53.6231884057971 828.0 observed 1 1
DI342078 Uncultured marine bacterium 66a03 34.7826086956522 828.0 observed 1 0
DI488592 Uncultured marine bacterium 66a03 50.6038647342995 828.0 observed 1 0
DM012307 Anguilla japonica 58.0917874396135 828.0 observed 1 1
DM012466 Anguilla japonica 59.9033816425121 828.0 observed 0 0
DM012591 Anguilla japonica 63.5265700483092 828.0 observed 0 0
DM171005 Geobacillus kaustophilus 53.8647342995169 828.0 observed 1 0
FB335534 Shewanella oneidensis 46.9806763285024 828.0 observed 1 1
GN060310 Geobacillus kaustophilus 53.8647342995169 828.0 observed 1 0
GN122190 Ostreococcus tauri 61.3526570048309 828.0 observed 1 1
GN128146 Sulfitobacter sp. ee-36 62.43961352657 828.0 observed 1 1
GN128366 Hyphomonas neptunium 64.6135265700483 828.0 observed 1 1
HB951622 Vibrio parahaemolyticus rimd 2210633 49.5169082125604 828.0 observed 1 1
HC750569 Prochlorococcus marinus subsp. marinus str. ccmp1375 41.1835748792271 828.0 observed 1 1
HC750743 Trichodesmium erythraeum ims101 43.9613526570048 828.0 observed 1 1
HC750779 Neptuniibacter caesariensis 50.7246376811594 828.0 observed 1 1
HC750789 Moritella sp. pe36 44.6859903381643 828.0 observed 1 1
HC904917 Vibrio parahaemolyticus rimd 2210633 45.2898550724638 828.0 observed 1 1
HD009032 Vibrio parahaemolyticus rimd 2210633 49.5169082125604 828.0 observed 1 0
HH766873 Shewanella oneidensis 46.9806763285024 828.0 observed 1 0
HW502733 Geobacillus kaustophilus 53.8647342995169 828.0 observed 1 1
LG178656 Uncultured marine bacterium 66a03 50.6038647342995 828.0 observed 1 0
LZ221772 Oncorhynchus mykiss 52.7777777777778 828.0 observed 1 1

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE Marine_Sequences (
        Sequence_accession_number TEXT PRIMARY KEY,
        Species_name TEXT,
        GC_content TEXT,
        Sequence_length TEXT,
        Sequence_status TEXT,
        Is_protein_coding_sequence INTEGER,
        Is_annotated INTEGER,
        FOREIGN KEY (Species_name) REFERENCES Marine_Species (Species_name),
        FOREIGN KEY (Sequence_accession_number) REFERENCES Sequences (Sequence_accession_number)
    );
Powered by Datasette · Queries took 548.065ms · Data license: CC BY 4.0 · Data source: Novel dataset reveals growing prominence of deep-sea life for marine bioprospecting · About: About MABPAT