KBase Genomes
kbase_genomes
Philosophy
Provide foundational genome sequence data. Access raw sequences, structural annotations, and protein products for detailed analysis.
Data sources:
KBase genome repository
NCBI
JGI
Citation & Attribution
Provider: KBase, DOE
Website: https://www.kbase.us/
Scale
16
tables
Schema Browser
Tables (6)
contig_collection 293,059 contig 34,582,336 feature 1,011,650,903 encoded_feature 1,011,650,903 protein 253,173,194 name 1,046,526,298
feature
1,011,650,903 rows
Gene/feature records with genomic coordinates.
| Column | Type | Description |
|---|---|---|
feature_id |
string | **Primary Key**. CDM UUID |
hash |
string | Content hash |
cds_phase |
string | CDS reading frame phase |
e_value |
string | Prediction e-value |
p_value |
string | Prediction p-value |
start |
string | Start coordinate |
end |
string | End coordinate |
strand |
string | Strand (+/-) |
source_database |
string | Source annotation database |
protocol_id |
string | Annotation protocol |
type |
string | Feature type (CDS, rRNA, tRNA, etc.) |
Sample Queries
Get genome metadata
SELECT *
FROM kbase_genomes.genome
LIMIT 20