💬

KBase Genomes

kbase_genomes

Primary

Philosophy

Provide foundational genome sequence data. Access raw sequences, structural annotations, and protein products for detailed analysis.

Data sources: KBase genome repository NCBI JGI

Citation & Attribution

Provider: KBase, DOE

Website: https://www.kbase.us/

Scale

16
tables

Schema Browser

protein 253,173,194 rows

Unique protein sequences.

Column Type Description
protein_id string **Primary Key**. CDM UUID
hash string Content hash (MD5 of sequence)
description string Protein description
evidence_for_existence string Evidence type
length string Protein length (amino acids)
sequence string **Full amino acid sequence**

Sample Queries

Get genome metadata

SELECT *
FROM kbase_genomes.genome
LIMIT 20

Related Collections

Projects Using This Collection

Start Exploring

Access the full KBase Genomes data through BERDL JupyterHub.

Open JupyterHub