🔬

NMDC Multi-omics

nmdc_arkin

Domain

Philosophy

Enable integrated microbiome analysis across multiple omics layers. Combine metabolomics, proteomics, and metagenomics with standardized annotations and embeddings for comprehensive sample characterization.

Data sources: NMDC COG KEGG MetaCyc GO

Citation & Attribution

Provider: NMDC

Website: https://microbiomedata.org/

Scale

48
studies
3M+
metabolomics_records
1.4M+
lipidomics_records
60+
tables

Schema Browser

Tables (6)

annotation_terms_unified 67,353
metabolomics_gold 3,129,061
lipidomics_gold 1,395,867
embeddings_v1 5,316
trait_features
study_table 48

Key Tables

Table Description Rows
annotation_terms_unified Unified annotation terms (COG, EC, GO, KEGG, MetaCyc) 67,353
metabolomics_gold Metabolomics measurements 3,129,061
lipidomics_gold Lipidomics measurements 1,395,867
embeddings_v1 256-dimensional sample embeddings 5,316
trait_features Microbial trait profiles (90+ traits)
study_table Study definitions 48

Sample Queries

Get NMDC studies

SELECT *
FROM nmdc_arkin.study_table

Get metabolomics data

SELECT *
FROM nmdc_arkin.metabolomics_gold
LIMIT 20

Related Collections

Projects Using This Collection

Start Exploring

Access the full NMDC Multi-omics data through BERDL JupyterHub.

Open JupyterHub