KBase Genomes
kbase_genomes
Philosophy
Provide foundational genome sequence data. Access raw sequences, structural annotations, and protein products for detailed analysis.
Data sources:
KBase genome repository
NCBI
JGI
Citation & Attribution
Provider: KBase, DOE
Website: https://www.kbase.us/
Scale
16
tables
Schema Browser
Tables (6)
contig_collection 293,059 contig 34,582,336 feature 1,011,650,903 encoded_feature 1,011,650,903 protein 253,173,194 name 1,046,526,298
protein
253,173,194 rows
Unique protein sequences.
| Column | Type | Description |
|---|---|---|
protein_id |
string | **Primary Key**. CDM UUID |
hash |
string | Content hash (MD5 of sequence) |
description |
string | Protein description |
evidence_for_existence |
string | Evidence type |
length |
string | Protein length (amino acids) |
sequence |
string | **Full amino acid sequence** |
Sample Queries
Get genome metadata
SELECT *
FROM kbase_genomes.genome
LIMIT 20