Overview
scRNA-seq data from the Immunobiology of Aging cohort was generated on the 10x Genomics Flex Gene Expression platform.
Below, we provide labeled and annotated PBMC scRNA-seq data from this healthy adult cohorts. We provide the full dataset, as well as data for each major class of cell types.
All .h5ad files for this project contain sample and subject metadata, in addition to cell type labels and QC metrics. Click the header below for descriptions of these metadata:
Each file contains sample-level metadata, as well as cell-level cell type labels and QC metrics. The following values are stored in the .obs
section of these .h5ad files as descriptions of observations:
Sample Identifierscohort.cohortGuid:
A Globally Unique Identifier (GUID) of the Cohort the subject enrolled in for our study subject.subjectGuid:
A GUID for the Subjectsample.sampleKitGuid:
A GUID for the Sample Kit, representing all material collected at a visitspecimen.specimenGuid:
A GUID for the specific aliquot used for the experiment
Subject Metadatasubject.biologicalSex:
The biological sex of the Subjectsubject.ageAtFirstDraw:
The Age of the Subject at their first on-study sample collectionsubject.race:
The self-reported Race of the Subjectsubject.ethnicity:
The self-reported Ethnicity of the subjectsubject.cmv:
The CMV Status of the subject, as determined by an HCMV assay
Sample Metadatasample.visitName:
The name of the study visit (i.e. time point)sample.subjectAgeAtDraw:
The age of the Subject in years at the time of sample collection
Process Identifiersbatch_id:
A GUID for the batch of samples processed together (e.g. B039)pool_id:
A GUID for the pool of samples combined for Cell Hashing (e.g. B039-P1)chip_id:
A GUID for the 10x Genomics chip the cells were loaded into (e.g. B039-P1C2)well_id:
A GUID for the 10x Genomics well the cells were loaded into within the chip (e.g. B039-P1C2W4)*barcodes:
A GUID for the individual celloriginal_barcodes:
The original, sequence-based barcode generated by 10x Genomics Cell Ranger softwarecell_name:
A quasi-unique, memorable cell identifier generated using an adjective-adjective-animal structure
*used as the primary cell index in our .h5ad files
Cell QC Metricsn_reads:
Number of reads assigned to the cell barcoden_umis:
Number of Unique Molecular Identifiers (unique molecules) detectedn_genes:
Number of genes with at least 1 UMI detectedtotal_counts_mito:
Total number of reads that were assigned to mitochondrial genespct_counts_mito:
Percent of reads that were assigned to mitochondrial genesdoublet_score:
Doublet score assigned by Scrublet for doublet detection
Cell Labeling ResultsAIFI_L1:
Final broad class cell type label (9 types)predicted_AIFI_L1:
Predicted AIFI_L1 type assigned by CellTypistAIFI_L2:
Final mid resolution cell type label (29 types)predicted_AIFI_L2:
Predicted AIFI_L2 type assigned by CellTypistAIFI_L3:
Final high resolution cell type label (71 types)predicted_AIFI_L3:
Predicted AIFI_L3 type assigned by CellTypist
Cell population .h5ad files
Here, we group cells by major population category. These files contain cells from all samples.
We are providing our scRNA-seq data in AnnData (.h5ad) format. For more details about AnnData, see the AnnData Documentation Page.
These files contain both normalized high-variance genes and raw count data. Normalized data is the active layer by default. In Python, the raw counts can be accessed using:
adata = adata.raw.to_adata()
Each file provided below contains the full set of ~3.8 million cells, or a subset for a major cell population category. Each file contains cells from the full set of 234 samples. Cell counts, and approximate file sizes are below:
File Name | N Cells | File Size |
---|---|---|
imm-of-aging_all_cells.h5ad | 3,758,514 | 50 GB |
imm-of-aging_b-plasma_cells.h5ad | 455,893 | 14 GB |
imm-of-aging_cd4-memory-treg_cells.h5ad | 854,753 | 30 GB |
imm-of-aging_cd4-naive_cells.h5ad | 770,809 | 25 GB |
imm-of-aging_cd8-gdt-mait-dnt_cells.h5ad | 717,559 | 25 GB |
imm-of-aging_dc-monocyte_cells.h5ad | 389,187 | 16 GB |
imm-of-aging_nk-ilc_cells.h5ad | 560,959 | 19 GB |
imm-of-aging_other_cells.h5ad | 9,354 | 0.25 GB |