Manage biological registries¶

This guide shows how to manage metadata for basic biological entities based on plugin bionty.

# pip install 'lamindb[bionty]'
!lamin init --storage ./test-registries --modules bionty

import lamindb as ln
import bionty as bt

→ connected lamindb: testuser1/test-registries

Import records from public ontologies¶

Let’s first populate our CellType registry with the default public ontology (Cell Ontology).

# [optional] inspect the available public ontology versions
bt.Source.df()

Show code cell output

Hide code cell output

	uid	entity	organism	name	in_db	currently_used	description	url	md5	source_website	space_id	dataframe_artifact_id	version	run_id	created_at	created_by_id	_aux	branch_id
id
1	33TUF039	bionty.Organism	vertebrates	ensembl	False	True	Ensembl	https://ftp.ensembl.org/pub/release-112/specie...	None	https://www.ensembl.org	1	None	release-112	None	2025-07-15 14:29:55.057000+00:00	1	None	1
2	6bbVUTCS	bionty.Organism	bacteria	ensembl	False	True	Ensembl	https://ftp.ensemblgenomes.ebi.ac.uk/pub/bacte...	None	https://www.ensembl.org	1	None	release-57	None	2025-07-15 14:29:55.057000+00:00	1	None	1
3	6s9nV6xh	bionty.Organism	fungi	ensembl	False	True	Ensembl	https://ftp.ensemblgenomes.ebi.ac.uk/pub/fungi...	None	https://www.ensembl.org	1	None	release-57	None	2025-07-15 14:29:55.057000+00:00	1	None	1
4	2PmTrc8x	bionty.Organism	metazoa	ensembl	False	True	Ensembl	https://ftp.ensemblgenomes.ebi.ac.uk/pub/metaz...	None	https://www.ensembl.org	1	None	release-57	None	2025-07-15 14:29:55.057000+00:00	1	None	1
5	7GPHh16S	bionty.Organism	plants	ensembl	False	True	Ensembl	https://ftp.ensemblgenomes.ebi.ac.uk/pub/plant...	None	https://www.ensembl.org	1	None	release-57	None	2025-07-15 14:29:55.057000+00:00	1	None	1
6	4tsksCMX	bionty.Organism	all	ncbitaxon	False	True	NCBItaxon Ontology	http://purl.obolibrary.org/obo/ncbitaxon/2023-...	None	https://github.com/obophenotype/ncbitaxon	1	None	2023-06-20	None	2025-07-15 14:29:55.057000+00:00	1	None	1
7	4UGNz3fr	bionty.Gene	human	ensembl	False	True	Ensembl	s3://bionty-assets/df_human__ensembl__release-...	None	https://www.ensembl.org	1	None	release-112	None	2025-07-15 14:29:55.057000+00:00	1	None	1
8	4r4fvV0S	bionty.Gene	mouse	ensembl	False	True	Ensembl	s3://bionty-assets/df_mouse__ensembl__release-...	None	https://www.ensembl.org	1	None	release-112	None	2025-07-15 14:29:55.057000+00:00	1	None	1
9	4RPA3Re0	bionty.Gene	saccharomyces cerevisiae	ensembl	False	True	Ensembl	s3://bionty-assets/df_saccharomyces cerevisiae...	None	https://www.ensembl.org	1	None	release-112	None	2025-07-15 14:29:55.057000+00:00	1	None	1
10	3EYyGRYN	bionty.Protein	human	uniprot	False	True	Uniprot	s3://bionty-assets/df_human__uniprot__2024-03_...	None	https://www.uniprot.org	1	None	2024-03	None	2025-07-15 14:29:55.057000+00:00	1	None	1
11	01RWXN2V	bionty.Protein	mouse	uniprot	False	True	Uniprot	s3://bionty-assets/df_mouse__uniprot__2024-03_...	None	https://www.uniprot.org	1	None	2024-03	None	2025-07-15 14:29:55.057000+00:00	1	None	1
12	3kDh8qAX	bionty.CellMarker	human	cellmarker	False	True	CellMarker	s3://bionty-assets/human_cellmarker_2.0_CellMa...	None	http://bio-bigdata.hrbmu.edu.cn/CellMarker	1	None	2.0	None	2025-07-15 14:29:55.057000+00:00	1	None	1
13	7bV5uJo3	bionty.CellMarker	mouse	cellmarker	False	True	CellMarker	s3://bionty-assets/mouse_cellmarker_2.0_CellMa...	None	http://bio-bigdata.hrbmu.edu.cn/CellMarker	1	None	2.0	None	2025-07-15 14:29:55.057000+00:00	1	None	1
14	6LyRtvz8	bionty.CellLine	all	clo	False	True	Cell Line Ontology	s3://bionty-assets/df_all__clo__2022-03-21__Ce...	None	https://bioportal.bioontology.org/ontologies/CLO	1	None	2022-03-21	None	2025-07-15 14:29:55.057000+00:00	1	None	1
15	2zHOtEVP	bionty.CellLine	all	depmap	False	False	Dependency Map	s3://bionty-assets/df_all__depmap__2024-Q2__Ce...	None	https://depmap.org/portal/	1	None	2024-Q2	None	2025-07-15 14:29:55.057000+00:00	1	None	1
16	3Uw2Va7a	bionty.CellType	all	cl	False	True	Cell Ontology	http://purl.obolibrary.org/obo/cl/releases/202...	None	https://obophenotype.github.io/cell-ontology	1	None	2024-08-16	None	2025-07-15 14:29:55.057000+00:00	1	None	1
17	MUtAGdL4	bionty.Tissue	all	uberon	False	True	Uberon multi-species anatomy ontology	http://purl.obolibrary.org/obo/uberon/releases...	None	http://obophenotype.github.io/uberon	1	None	2024-08-07	None	2025-07-15 14:29:55.057000+00:00	1	None	1
18	IGIkseWQ	bionty.Disease	all	mondo	False	True	Mondo Disease Ontology	http://purl.obolibrary.org/obo/mondo/releases/...	None	https://mondo.monarchinitiative.org	1	None	2025-06-03	None	2025-07-15 14:29:55.057000+00:00	1	None	1
19	4kswnHVF	bionty.Disease	human	doid	False	True	Human Disease Ontology	http://purl.obolibrary.org/obo/doid/releases/2...	None	https://disease-ontology.org	1	None	2024-05-29	None	2025-07-15 14:29:55.057000+00:00	1	None	1
20	25rhq3yV	bionty.Disease	human	icd	False	False	International Classification of Diseases (ICD)	s3://bionty-assets/df_human__icd__icd-11-2023_...	None	https://www.who.int/standards/classifications/...	1	None	icd-11-2023	None	2025-07-15 14:29:55.057000+00:00	1	None	1
21	2a1HvjdB	bionty.ExperimentalFactor	all	efo	False	True	The Experimental Factor Ontology	http://www.ebi.ac.uk/efo/releases/v3.70.0/efo.owl	None	https://bioportal.bioontology.org/ontologies/EFO	1	None	3.70.0	None	2025-07-15 14:29:55.057000+00:00	1	None	1
22	6S4qkDx1	bionty.Phenotype	all	pato	False	True	Phenotype And Trait Ontology	http://purl.obolibrary.org/obo/pato/releases/2...	None	https://github.com/pato-ontology/pato	1	None	2024-03-28	None	2025-07-15 14:29:55.057000+00:00	1	None	1
23	48fBFLmn	bionty.Phenotype	human	hp	False	True	Human Phenotype Ontology	https://github.com/obophenotype/human-phenotyp...	None	https://hpo.jax.org	1	None	2024-04-26	None	2025-07-15 14:29:55.057000+00:00	1	None	1
24	15uFx5W4	bionty.Phenotype	human	phe	False	False	Phecodes ICD10 map	s3://bionty-assets/df_human__phe__1.2__Phenoty...	None	https://phewascatalog.org/phecodes_icd10	1	None	1.2	None	2025-07-15 14:29:55.057000+00:00	1	None	1
25	7Ent3V2y	bionty.Pathway	all	go	False	True	Gene Ontology	http://purl.obolibrary.org/obo/go/releases/202...	None	http://geneontology.org	1	None	2024-06-17	None	2025-07-15 14:29:55.057000+00:00	1	None	1
26	40JkiRMw	bionty.Pathway	all	pw	False	False	Pathway Ontology	http://purl.obolibrary.org/obo/pw/7.84/pw.owl	None	https://www.ebi.ac.uk/ols/ontologies/pw	1	None	7.84	None	2025-07-15 14:29:55.057000+00:00	1	None	1
27	3rm9aOzL	BFXPipeline	all	lamin	False	True	Bioinformatics Pipeline	s3://bionty-assets/df_all__lamin__1.0.0__BFXpi...	None	https://lamin.ai	1	None	1.0.0	None	2025-07-15 14:29:55.057000+00:00	1	None	1
28	ugaIoIlj	Drug	all	dron	False	True	Drug Ontology	http://purl.obolibrary.org/obo/dron/releases/2...	None	https://bioportal.bioontology.org/ontologies/DRON	1	None	2024-08-05	None	2025-07-15 14:29:55.057000+00:00	1	None	1
29	1atB0WnU	Drug	all	chebi	False	False	Chemical Entities of Biological Interest	s3://bionty-assets/df_all__chebi__2024-07-27__...	None	https://www.ebi.ac.uk/chebi/	1	None	2024-07-27	None	2025-07-15 14:29:55.057000+00:00	1	None	1
30	1GbFkOdz	bionty.DevelopmentalStage	human	hsapdv	False	True	Human Developmental Stages	https://github.com/obophenotype/developmental-...	None	https://github.com/obophenotype/developmental-...	1	None	2024-05-28	None	2025-07-15 14:29:55.057000+00:00	1	None	1
31	10va5JSt	bionty.DevelopmentalStage	mouse	mmusdv	False	True	Mouse Developmental Stages	https://github.com/obophenotype/developmental-...	None	https://github.com/obophenotype/developmental-...	1	None	2024-05-28	None	2025-07-15 14:29:55.057000+00:00	1	None	1
32	MJRqduf9	bionty.Ethnicity	human	hancestro	False	True	Human Ancestry Ontology	http://purl.obolibrary.org/obo/hancestro/relea...	None	https://github.com/EBISPOT/hancestro	1	None	3.0	None	2025-07-15 14:29:55.057000+00:00	1	None	1
33	5JnVODh4	BioSample	all	ncbi	False	True	NCBI BioSample attributes	s3://bionty-assets/df_all__ncbi__2023-09__BioS...	None	https://www.ncbi.nlm.nih.gov/biosample/docs/at...	1	None	2023-09	None	2025-07-15 14:29:55.057000+00:00	1	None	1

# [optional] inspect which version we're about to import
bt.Source.get(entity="bionty.CellType", currently_used=True)

# populate the database with the public ontology
bt.CellType.import_source()

This is now your in-house cell type registry in which you can add & modify records as you like.

# all public cell types are now available in LaminDB
bt.CellType.df()

Show code cell output

Hide code cell output

	uid	name	ontology_id	abbr	synonyms	description	space_id	source_id	run_id	created_at	created_by_id	_aux	branch_id
id
2912	29rAJeJ9	endovascular extravillous trophoblast cell	CL:4033063	None	None	A Trophoblast Cell That Invades The Maternal S...	1	16	None	2025-07-15 14:30:04.138000+00:00	1	None	1
2913	7SMaSp5h	uterine resident macrophage	CL:4033064	None	None	A Tissue-Resident Macrophage That Is Part Of T...	1	16	None	2025-07-15 14:30:04.138000+00:00	1	None	1
2914	6wH4tvBX	preplasmablast	CL:4033065	None	pre-plasmablast\|preplasmablastic cell	A Mature B Cell That Serves As An Intermediate...	1	16	None	2025-07-15 14:30:04.138000+00:00	1	None	1
2915	3Tw0A57U	pre-granulosa cell	CL:4033066	None	ovarian pregranulosa cell\|pregranulosa cell	A Supporting Cell That Is Part Of The Ovary An...	1	16	None	2025-07-15 14:30:04.138000+00:00	1	None	1
2916	73IWLjLT	mural granulosa cell	CL:4033067	None	None	A Follicular Cell Of Ovary That Differentiates...	1	16	None	2025-07-15 14:30:04.138000+00:00	1	None	1
...	...	...	...	...	...	...	...	...	...	...	...	...	...
2888	698Db1Al	lung resident memory CD8-positive, alpha-beta ...	CL:4033039	None	None	An Alpha-Beta Cd8 T Cell That Resides In The L...	1	16	None	2025-07-15 14:30:04.127000+00:00	1	None	1
2889	1LsoaHZ2	lung resident memory CD8-positive, CD103-posit...	CL:4033040	None	lung resident memory CD8-positive CD103-positi...	A Lung Resident Memory Cd8-Positive, Alpha-Bet...	1	16	None	2025-07-15 14:30:04.127000+00:00	1	None	1
2890	7aGSQUeY	CCL3-positive alveolar macrophage	CL:4033041	None	alveolar macrophage CCL3-positive	An Alveolar Macrophage That Expresses Ccl3.	1	16	None	2025-07-15 14:30:04.127000+00:00	1	None	1
2891	1p6UUgMR	metallothionein-positive alveolar macrophage	CL:4033042	None	alveolar macrophage metallothionein-positive\|a...	An Alveolar Macrophage That Expresses Metallot...	1	16	None	2025-07-15 14:30:04.127000+00:00	1	None	1
2892	2NmnUGf5	lung interstitial macrophage	CL:4033043	None	None	A Macrophage That Is Part Of The Lung Connecti...	1	16	None	2025-07-15 14:30:04.127000+00:00	1	None	1

100 rows × 13 columns

# let's also populate the Gene registry with human and mouse genes
bt.Gene.import_source(organism="human")
bt.Gene.import_source(organism="mouse")

! Starting bulk_create for 75829 Gene records in batches of 10000

! Starting bulk_create for 57510 Gene records in batches of 10000

Access records in in-house registries¶

Search key words:

bt.CellType.search("gamma-delta T").df().head(2)

	uid	name	ontology_id	abbr	synonyms	description	space_id	source_id	run_id	created_at	created_by_id	_aux	branch_id
id
780	1HuNn2EP	gamma-delta T cell	CL:0000798	None	gamma-delta T-cell\|gamma-delta T lymphocyte\|ga...	A T Cell That Expresses A Gamma-Delta T Cell R...	1	16	None	2025-07-15 14:30:03.681000+00:00	1	None	1
781	70lHcCNw	immature gamma-delta T cell	CL:0000799	None	immature gamma-delta T lymphocyte\|immature gam...	A Gamma-Delta T Cell That Has An Immature Phen...	1	16	None	2025-07-15 14:30:03.681000+00:00	1	None	1

Or look up with auto-complete:

cell_types = bt.CellType.lookup()
hsc_record = cell_types.hematopoietic_stem_cell
hsc_record

CellType(uid='2U8xapxu', name='hematopoietic stem cell', ontology_id='CL:0000037', synonyms='hemopoietic stem cell|blood forming stem cell', description='A Stem Cell From Which All Cells Of The Lymphoid And Myeloid Lineages Develop, Including Blood Cells And Cells Of The Immune System. Hematopoietic Stem Cells Lack Cell Markers Of Effector Cells (Lin-Negative). Lin-Negative Is Defined By Lacking One Or More Of The Following Cell Surface Markers: Cd2, Cd3 Epsilon, Cd4, Cd5 ,Cd8 Alpha Chain, Cd11B, Cd14, Cd19, Cd20, Cd56, Ly6G, Ter119.', branch_id=1, space_id=1, created_by_id=1, source_id=16, created_at=2025-07-15 14:30:03 UTC)

Filter by fields and relationships:

gdt_cell = bt.CellType.get(ontology_id="CL:0000798", created_by__handle="testuser1")
gdt_cell

CellType(uid='1HuNn2EP', name='gamma-delta T cell', ontology_id='CL:0000798', synonyms='gamma-delta T-cell|gamma-delta T lymphocyte|gammadelta T cell|gamma-delta T-lymphocyte', description='A T Cell That Expresses A Gamma-Delta T Cell Receptor Complex.', branch_id=1, space_id=1, created_by_id=1, source_id=16, created_at=2025-07-15 14:30:03 UTC)

View the ontological hierarchy:

gdt_cell.view_parents()  # pass with_children=True to also view children

_images/263a9f9f74a548ea1b204bbe59b2b306f2a462ef89dc238e1ad3eaa22404a8f0.svg

Or access the parents and children directly:

gdt_cell.parents.df()

Show code cell output

Hide code cell output

	uid	name	ontology_id	abbr	synonyms	description	space_id	source_id	run_id	created_at	created_by_id	_aux	branch_id
id
83	22LvKd01	T cell	CL:0000084	None	T-cell\|T-lymphocyte\|T lymphocyte	A Type Of Lymphocyte Whose Defining Characteri...	1	16	None	2025-07-15 14:30:03.428000+00:00	1	None	1

gdt_cell.children.df()

Show code cell output

Hide code cell output

	uid	name	ontology_id	abbr	synonyms	description	space_id	source_id	run_id	created_at	created_by_id	_aux	branch_id
id
781	70lHcCNw	immature gamma-delta T cell	CL:0000799	None	immature gamma-delta T lymphocyte\|immature gam...	A Gamma-Delta T Cell That Has An Immature Phen...	1	16	None	2025-07-15 14:30:03.681000+00:00	1	None	1
782	3W6NKGpW	mature gamma-delta T cell	CL:0000800	None	mature gamma-delta T-lymphocyte\|mature gamma-d...	A Gamma-Delta T Cell That Has A Mature Phenoty...	1	16	None	2025-07-15 14:30:03.695000+00:00	1	None	1
1465	26icgrTr	gamma-delta thymocyte	CL:0002405	None	gd thymocyte\|gammadelta thymocyte	A Post-Natal Thymocyte Expressing Components O...	1	16	None	2025-07-15 14:30:03.828000+00:00	1	None	1
2921	5XXsI4tm	cycling gamma-delta T cell	CL:4033072	None	proliferating gamma-delta T cell	A(N) Gamma-Delta T Cell That Is Cycling.	1	16	None	2025-07-15 14:30:04.138000+00:00	1	None	1

It is also possible to recursively query parents or children, getting direct parents (children), their parents, and so forth.

gdt_cell.query_parents().df()

Show code cell output

Hide code cell output

	uid	name	ontology_id	abbr	synonyms	description	space_id	source_id	run_id	created_at	created_by_id	_aux	branch_id
id
83	22LvKd01	T cell	CL:0000084	None	T-cell\|T-lymphocyte\|T lymphocyte	A Type Of Lymphocyte Whose Defining Characteri...	1	16	None	2025-07-15 14:30:03.428000+00:00	1	None	1
822	2Jgr5Xx4	mononuclear cell	CL:0000842	None	mononuclear leukocyte	A Leukocyte With A Single Non-Segmented Nucleu...	1	16	None	2025-07-15 14:30:03.695000+00:00	1	None	1
214	2K93w3xO	motile cell	CL:0000219	None	None	A Cell That Moves By Its Own Activities.	1	16	None	2025-07-15 14:30:03.577000+00:00	1	None	1
221	2cXC7cgF	single nucleate cell	CL:0000226	None	None	A Cell With A Single Nucleus.	1	16	None	2025-07-15 14:30:03.577000+00:00	1	None	1
721	3VEAlFdi	leukocyte	CL:0000738	None	white blood cell\|leucocyte	An Achromatic Cell Of The Myeloid Or Lymphoid ...	1	16	None	2025-07-15 14:30:03.681000+00:00	1	None	1
967	4Ilrnj9U	hematopoietic cell	CL:0000988	None	haematopoietic cell\|hemopoietic cell\|haemopoie...	A Cell Of A Hematopoietic Lineage.	1	16	None	2025-07-15 14:30:03.725000+00:00	1	None	1
250	4WnpvUTH	eukaryotic cell	CL:0000255	None	None	Any Cell That Only Exists In Eukaryota.	1	16	None	2025-07-15 14:30:03.577000+00:00	1	None	1
1	4bKGljt0	cell	CL:0000000	None	None	A Material Entity Of Anatomical Origin (Part O...	1	16	None	2025-07-15 14:30:03.411000+00:00	1	None	1
529	X6c7osZ5	lymphocyte	CL:0000542	None	None	A Lymphocyte Is A Leukocyte Commonly Found In ...	1	16	None	2025-07-15 14:30:03.636000+00:00	1	None	1
1303	u3sr1Gdf	nucleate cell	CL:0002242	None	None	A Cell Containing At Least One Nucleus.	1	16	None	2025-07-15 14:30:03.799000+00:00	1	None	1

gdt_cell.query_children().df()

Show code cell output

Hide code cell output

	uid	name	ontology_id	abbr	synonyms	description	space_id	source_id	run_id	created_at	created_by_id	_aux	branch_id
id
1190	1DEERh4L	CD27-negative gamma-delta T cell	CL:0002125	None	gammadelta-17 cells	A Circulating Gamma-Delta T Cell That Expresse...	1	16	None	2025-07-15 14:30:03.769000+00:00	1	None	1
1572	1jlK4jJ9	Vgamma5-positive CD8alpha alpha positive gamma...	CL:0002513	None	tgd.vg5+.IEL	A Cd8Alpha Alpha Positive Gamma-Delta Intraepi...	1	16	None	2025-07-15 14:30:03.858000+00:00	1	None	1
785	1mNzVotO	CD4-negative CD8-negative gamma-delta intraepi...	CL:0000803	None	CD4-positive, gamma-delta intraepithelial T-ce...	A Gamma-Delta Intraepithelial T Cell That Has ...	1	16	None	2025-07-15 14:30:03.695000+00:00	1	None	1
895	1tYOPZxH	dendritic epidermal T cell	CL:0000916	None	dendritic epidermal T lymphocyte\|DETC\|dendriti...	A Mature Gamma-Delta T Cell Located In The Epi...	1	16	None	2025-07-15 14:30:03.710000+00:00	1	None	1
1465	26icgrTr	gamma-delta thymocyte	CL:0002405	None	gd thymocyte\|gammadelta thymocyte	A Post-Natal Thymocyte Expressing Components O...	1	16	None	2025-07-15 14:30:03.828000+00:00	1	None	1
1475	2SYX59uO	immature Vgamma1.1-positive, Vdelta6.3-positiv...	CL:0002415	None	immature Vg1.1+Vd6.3+ T cell	A Vgamma1.1-Positive, Vdelta6.3-Positive Thymo...	1	16	None	2025-07-15 14:30:03.828000+00:00	1	None	1
783	2xXcHDQq	gamma-delta intraepithelial T cell	CL:0000801	None	gamma-delta intraepithelial T-cell\|gamma-delta...	A Mature Gamma-Delta T Cell That Is Found In T...	1	16	None	2025-07-15 14:30:03.695000+00:00	1	None	1
1468	3ABJ1l1O	immature Vgamma2-negative thymocyte	CL:0002408	None	None	A Double Negative Post-Natal Thymocyte That Ha...	1	16	None	2025-07-15 14:30:03.828000+00:00	1	None	1
782	3W6NKGpW	mature gamma-delta T cell	CL:0000800	None	mature gamma-delta T-lymphocyte\|mature gamma-d...	A Gamma-Delta T Cell That Has A Mature Phenoty...	1	16	None	2025-07-15 14:30:03.695000+00:00	1	None	1
1191	3efemme8	CD25-positive, CD27-positive immature gamma-de...	CL:0002126	None	None	A Cd25-Positive, Cd27-Positive Immature Gamma-...	1	16	None	2025-07-15 14:30:03.769000+00:00	1	None	1
1473	4cYNDr25	mature Vgamma1.1-positive, Vdelta6.3-negative ...	CL:0002413	None	None	A Vgamma1.1-Positive, Vdelta6.3-Negative Thymo...	1	16	None	2025-07-15 14:30:03.828000+00:00	1	None	1
1466	4hrSce5T	immature Vgamma2-positive thymocyte	CL:0002406	None	None	A Double Negative Post-Natal Thymocyte That Ha...	1	16	None	2025-07-15 14:30:03.828000+00:00	1	None	1
2921	5XXsI4tm	cycling gamma-delta T cell	CL:4033072	None	proliferating gamma-delta T cell	A(N) Gamma-Delta T Cell That Is Cycling.	1	16	None	2025-07-15 14:30:04.138000+00:00	1	None	1
1474	5pDjyjfF	immature Vgamma1.1-positive, Vdelta6.3-negativ...	CL:0002414	None	None	A Vgamma1.1-Positive, Vdelta6.3-Negative Thymo...	1	16	None	2025-07-15 14:30:03.828000+00:00	1	None	1
1471	64PCjpkJ	Vgamma1.1-positive, Vdelta6.3-negative thymocyte	CL:0002411	None	Vg1.1-positive, Vd6.3-negative T cell	A Gamma-Delta Receptor That Expresses Vgamma1....	1	16	None	2025-07-15 14:30:03.828000+00:00	1	None	1
1476	6JxpxGgM	mature Vgamma1.1-positive, Vdelta6.3-positive ...	CL:0002416	None	mature Vg1.1+Vd6.3+ T cell	A Vgamma1.1-Positive, Vdelta6.3-Positive Thymo...	1	16	None	2025-07-15 14:30:03.828000+00:00	1	None	1
1469	6RBJq86b	mature Vgamma2-negative thymocyte	CL:0002409	None	Vgamma2-negative	A Thymocyte That Has A T Cell Receptor Consist...	1	16	None	2025-07-15 14:30:03.828000+00:00	1	None	1
784	6fdlvmJ3	CD8-alpha alpha positive, gamma-delta intraepi...	CL:0000802	None	CD8-positive, gamma-delta intraepithelial T-ly...	A Gamma-Delta Intraepithelial T Cell That Has ...	1	16	None	2025-07-15 14:30:03.695000+00:00	1	None	1
1472	6vYlL7zk	Vgamma1.1-positive, Vdelta6.3-positive thymocyte	CL:0002412	None	Vg1.1+Vd6.3+ T cell	A Gamma-Delta Receptor That Expresses Vgamma1....	1	16	None	2025-07-15 14:30:03.828000+00:00	1	None	1
781	70lHcCNw	immature gamma-delta T cell	CL:0000799	None	immature gamma-delta T lymphocyte\|immature gam...	A Gamma-Delta T Cell That Has An Immature Phen...	1	16	None	2025-07-15 14:30:03.681000+00:00	1	None	1
1467	76CEFg3A	mature Vgamma2-positive thymocyte	CL:0002407	None	Vgamma2-positive	A Thymocyte That Has A T Cell Receptor Consist...	1	16	None	2025-07-15 14:30:03.828000+00:00	1	None	1
1189	7MDv71IV	CD27-positive gamma-delta T cell	CL:0002124	None	gammadelta27-positive\|gd27-positive	A Circulating Gamma-Delta T Cell That Is Cd27-...	1	16	None	2025-07-15 14:30:03.769000+00:00	1	None	1
1573	E2koIf0l	Vgamma5-negative CD8alpha alpha positive gamma...	CL:0002514	None	tgd.vg5-.IEL	A Cd8Alpha Alpha Positive Gamma-Delta Intraepi...	1	16	None	2025-07-15 14:30:03.858000+00:00	1	None	1

You can construct custom hierarchies of records:

# register a new cell type
my_celltype = bt.CellType(name="my new T-cell subtype").save()
# specify "gamma-delta T cell" as a parent
my_celltype.parents.add(gdt_cell)

# visualize hierarchy
gdt_cell.view_parents(distance=2, with_children=True)

_images/dac450fae9ba35eae1fc937c952dcabc1eb65176ce3e3f1b4acd61f324f3bf9e.svg

Create records from values¶

When accessing datasets, one often encounters bulk references to entities that might be corrupted or standardized using different standardization schemes.

Let’s consider an example based on an AnnData object, in the cell_type annotations of this AnnData object, we find 4 references to cell types:

adata = ln.core.datasets.anndata_with_obs()
adata.obs.cell_type.value_counts()

We’d like to load the corresponding records in our in-house registry to annotate a dataset.

To this end, you’ll typically use from_values, which will both validate & retrieve records that match the values.

cell_types = bt.CellType.from_values(adata.obs.cell_type)
cell_types

Logging informed us that 3 cell types were validated. Since we loaded these records at the same time, we could readily use them to annotate a dataset.

Alternatively, we can retrieve records based on ontology ids:

adata.obs.cell_type_id.unique().tolist()

bt.CellType.from_values(adata.obs.cell_type_id, field=bt.CellType.ontology_id)

Validate & standardize¶

Simple validation of an iterable of values works like so:

bt.CellType.validate(["fat cell", "blood forming stem cell"])

Because these values don’t comply with the registry, they’re not validated!

You can easily convert these values to validated standardized names based on synonyms like so:

bt.CellType.standardize(["fat cell", "blood forming stem cell"])

Alternatively, you can use .from_values(), which will only ever return validated records and automatically standardize under-the-hood:

bt.CellType.from_values(["fat cell", "blood forming stem cell"])

If you are now sure what to do, use .inspect() to get instructions:

bt.CellType.inspect(["fat cell", "blood forming stem cell"]);

We can also add new synonyms to a record:

hsc_record.add_synonym("HSC")

And when we encounter this synonym as a value, it will now be standardized using synonyms-lookup, and mapped on the correct registry record:

bt.CellType.standardize(["HSC"])

A special synonym is .abbr (short for abbreviation), which has its own field and can be assigned via:

hsc_record.set_abbr("HSC")

You can create a lookup object from the .abbr field:

cell_types = bt.CellType.lookup("abbr")
cell_types.hsc

The same workflow works for all of bionty’s registries.

Manage registries across organisms¶

Several registries are organism-aware (has a .organism field), for instance, Gene.

In this case, API calls that interact with multi-organism registries require an organism argument when there’s ambiguity.

For instance, when validating gene symbols:

bt.Gene.validate(["TCF7", "ABC1"], organism="human")

In contrary, working with Ensembl Gene IDs doesn’t require passing organism, as there’s no ambiguity:

bt.Gene.validate(
    ["ENSG00000000419", "ENSMUSG00002076988"], field=bt.Gene.ensembl_gene_id
)

! 1 unique term (50.00%) is not validated for ensembl_gene_id: 'ENSMUSG00002076988'

array([ True, False])

When working with the same organism throughout your analysis/workflow, you can omit the organism argument by configuring it globally:

bt.settings.organism = "mouse"
bt.Gene.from_source(symbol="Ap5b1")

! using default organism = mouse

Gene(uid='3b8mHb0MRal4', symbol='Ap5b1', ensembl_gene_id='ENSMUSG00000049562', biotype='protein_coding', synonyms='Gm962', description='adaptor-related protein complex 5, beta 1 subunit ', branch_id=1, space_id=1, created_by_id=1, source_id=8, organism_id=2, created_at=2025-07-15 14:33:56 UTC)

Track underlying ontology source versions¶

Under-the-hood, source ontology versions are automatically tracked for each registry:

bt.Source.filter(currently_used=True).df()

Show code cell output

Hide code cell output

	uid	entity	organism	name	in_db	currently_used	description	url	md5	source_website	space_id	dataframe_artifact_id	version	run_id	created_at	created_by_id	_aux	branch_id
id
1	33TUF039	bionty.Organism	vertebrates	ensembl	False	True	Ensembl	https://ftp.ensembl.org/pub/release-112/specie...	None	https://www.ensembl.org	1	None	release-112	None	2025-07-15 14:29:55.057000+00:00	1	None	1
2	6bbVUTCS	bionty.Organism	bacteria	ensembl	False	True	Ensembl	https://ftp.ensemblgenomes.ebi.ac.uk/pub/bacte...	None	https://www.ensembl.org	1	None	release-57	None	2025-07-15 14:29:55.057000+00:00	1	None	1
3	6s9nV6xh	bionty.Organism	fungi	ensembl	False	True	Ensembl	https://ftp.ensemblgenomes.ebi.ac.uk/pub/fungi...	None	https://www.ensembl.org	1	None	release-57	None	2025-07-15 14:29:55.057000+00:00	1	None	1
4	2PmTrc8x	bionty.Organism	metazoa	ensembl	False	True	Ensembl	https://ftp.ensemblgenomes.ebi.ac.uk/pub/metaz...	None	https://www.ensembl.org	1	None	release-57	None	2025-07-15 14:29:55.057000+00:00	1	None	1
5	7GPHh16S	bionty.Organism	plants	ensembl	False	True	Ensembl	https://ftp.ensemblgenomes.ebi.ac.uk/pub/plant...	None	https://www.ensembl.org	1	None	release-57	None	2025-07-15 14:29:55.057000+00:00	1	None	1
6	4tsksCMX	bionty.Organism	all	ncbitaxon	False	True	NCBItaxon Ontology	http://purl.obolibrary.org/obo/ncbitaxon/2023-...	None	https://github.com/obophenotype/ncbitaxon	1	None	2023-06-20	None	2025-07-15 14:29:55.057000+00:00	1	None	1
7	4UGNz3fr	bionty.Gene	human	ensembl	True	True	Ensembl	s3://bionty-assets/df_human__ensembl__release-...	None	https://www.ensembl.org	1	None	release-112	None	2025-07-15 14:29:55.057000+00:00	1	None	1
8	4r4fvV0S	bionty.Gene	mouse	ensembl	True	True	Ensembl	s3://bionty-assets/df_mouse__ensembl__release-...	None	https://www.ensembl.org	1	None	release-112	None	2025-07-15 14:29:55.057000+00:00	1	None	1
9	4RPA3Re0	bionty.Gene	saccharomyces cerevisiae	ensembl	False	True	Ensembl	s3://bionty-assets/df_saccharomyces cerevisiae...	None	https://www.ensembl.org	1	None	release-112	None	2025-07-15 14:29:55.057000+00:00	1	None	1
10	3EYyGRYN	bionty.Protein	human	uniprot	False	True	Uniprot	s3://bionty-assets/df_human__uniprot__2024-03_...	None	https://www.uniprot.org	1	None	2024-03	None	2025-07-15 14:29:55.057000+00:00	1	None	1
11	01RWXN2V	bionty.Protein	mouse	uniprot	False	True	Uniprot	s3://bionty-assets/df_mouse__uniprot__2024-03_...	None	https://www.uniprot.org	1	None	2024-03	None	2025-07-15 14:29:55.057000+00:00	1	None	1
12	3kDh8qAX	bionty.CellMarker	human	cellmarker	False	True	CellMarker	s3://bionty-assets/human_cellmarker_2.0_CellMa...	None	http://bio-bigdata.hrbmu.edu.cn/CellMarker	1	None	2.0	None	2025-07-15 14:29:55.057000+00:00	1	None	1
13	7bV5uJo3	bionty.CellMarker	mouse	cellmarker	False	True	CellMarker	s3://bionty-assets/mouse_cellmarker_2.0_CellMa...	None	http://bio-bigdata.hrbmu.edu.cn/CellMarker	1	None	2.0	None	2025-07-15 14:29:55.057000+00:00	1	None	1
14	6LyRtvz8	bionty.CellLine	all	clo	False	True	Cell Line Ontology	s3://bionty-assets/df_all__clo__2022-03-21__Ce...	None	https://bioportal.bioontology.org/ontologies/CLO	1	None	2022-03-21	None	2025-07-15 14:29:55.057000+00:00	1	None	1
16	3Uw2Va7a	bionty.CellType	all	cl	True	True	Cell Ontology	http://purl.obolibrary.org/obo/cl/releases/202...	None	https://obophenotype.github.io/cell-ontology	1	None	2024-08-16	None	2025-07-15 14:29:55.057000+00:00	1	None	1
17	MUtAGdL4	bionty.Tissue	all	uberon	False	True	Uberon multi-species anatomy ontology	http://purl.obolibrary.org/obo/uberon/releases...	None	http://obophenotype.github.io/uberon	1	None	2024-08-07	None	2025-07-15 14:29:55.057000+00:00	1	None	1
18	IGIkseWQ	bionty.Disease	all	mondo	False	True	Mondo Disease Ontology	http://purl.obolibrary.org/obo/mondo/releases/...	None	https://mondo.monarchinitiative.org	1	None	2025-06-03	None	2025-07-15 14:29:55.057000+00:00	1	None	1
19	4kswnHVF	bionty.Disease	human	doid	False	True	Human Disease Ontology	http://purl.obolibrary.org/obo/doid/releases/2...	None	https://disease-ontology.org	1	None	2024-05-29	None	2025-07-15 14:29:55.057000+00:00	1	None	1
21	2a1HvjdB	bionty.ExperimentalFactor	all	efo	False	True	The Experimental Factor Ontology	http://www.ebi.ac.uk/efo/releases/v3.70.0/efo.owl	None	https://bioportal.bioontology.org/ontologies/EFO	1	None	3.70.0	None	2025-07-15 14:29:55.057000+00:00	1	None	1
22	6S4qkDx1	bionty.Phenotype	all	pato	False	True	Phenotype And Trait Ontology	http://purl.obolibrary.org/obo/pato/releases/2...	None	https://github.com/pato-ontology/pato	1	None	2024-03-28	None	2025-07-15 14:29:55.057000+00:00	1	None	1
23	48fBFLmn	bionty.Phenotype	human	hp	False	True	Human Phenotype Ontology	https://github.com/obophenotype/human-phenotyp...	None	https://hpo.jax.org	1	None	2024-04-26	None	2025-07-15 14:29:55.057000+00:00	1	None	1
25	7Ent3V2y	bionty.Pathway	all	go	False	True	Gene Ontology	http://purl.obolibrary.org/obo/go/releases/202...	None	http://geneontology.org	1	None	2024-06-17	None	2025-07-15 14:29:55.057000+00:00	1	None	1
27	3rm9aOzL	BFXPipeline	all	lamin	False	True	Bioinformatics Pipeline	s3://bionty-assets/df_all__lamin__1.0.0__BFXpi...	None	https://lamin.ai	1	None	1.0.0	None	2025-07-15 14:29:55.057000+00:00	1	None	1
28	ugaIoIlj	Drug	all	dron	False	True	Drug Ontology	http://purl.obolibrary.org/obo/dron/releases/2...	None	https://bioportal.bioontology.org/ontologies/DRON	1	None	2024-08-05	None	2025-07-15 14:29:55.057000+00:00	1	None	1
30	1GbFkOdz	bionty.DevelopmentalStage	human	hsapdv	False	True	Human Developmental Stages	https://github.com/obophenotype/developmental-...	None	https://github.com/obophenotype/developmental-...	1	None	2024-05-28	None	2025-07-15 14:29:55.057000+00:00	1	None	1
31	10va5JSt	bionty.DevelopmentalStage	mouse	mmusdv	False	True	Mouse Developmental Stages	https://github.com/obophenotype/developmental-...	None	https://github.com/obophenotype/developmental-...	1	None	2024-05-28	None	2025-07-15 14:29:55.057000+00:00	1	None	1
32	MJRqduf9	bionty.Ethnicity	human	hancestro	False	True	Human Ancestry Ontology	http://purl.obolibrary.org/obo/hancestro/relea...	None	https://github.com/EBISPOT/hancestro	1	None	3.0	None	2025-07-15 14:29:55.057000+00:00	1	None	1
33	5JnVODh4	BioSample	all	ncbi	False	True	NCBI BioSample attributes	s3://bionty-assets/df_all__ncbi__2023-09__BioS...	None	https://www.ncbi.nlm.nih.gov/biosample/docs/at...	1	None	2023-09	None	2025-07-15 14:29:55.057000+00:00	1	None	1

Each record is linked to a versioned public source (if it was created from public):

hepatocyte = bt.CellType.get(name="hepatocyte")
hepatocyte.source

Create records from specific source¶

By default, new records are imported or created from the "currently_used" public sources which are configured during the instance initialization, e.g.:

bt.Source.filter(entity="bionty.Phenotype", currently_used=True).df()

Show code cell output

Hide code cell output

	uid	entity	organism	name	in_db	currently_used	description	url	md5	source_website	space_id	dataframe_artifact_id	version	run_id	created_at	created_by_id	_aux	branch_id
id
22	6S4qkDx1	bionty.Phenotype	all	pato	False	True	Phenotype And Trait Ontology	http://purl.obolibrary.org/obo/pato/releases/2...	None	https://github.com/pato-ontology/pato	1	None	2024-03-28	None	2025-07-15 14:29:55.057000+00:00	1	None	1
23	48fBFLmn	bionty.Phenotype	human	hp	False	True	Human Phenotype Ontology	https://github.com/obophenotype/human-phenotyp...	None	https://hpo.jax.org	1	None	2024-04-26	None	2025-07-15 14:29:55.057000+00:00	1	None	1

Sometimes, the default source doesn’t contain the ontology term you are looking for.

You can then specify to create a record from a non-default source. For instance, we can use the ncbitaxon ontology:

source = bt.Source.get(entity="bionty.Organism", name="ncbitaxon")
source

Source(uid='4tsksCMX', entity='bionty.Organism', organism='all', name='ncbitaxon', version='2023-06-20', in_db=False, currently_used=True, description='NCBItaxon Ontology', url='http://purl.obolibrary.org/obo/ncbitaxon/2023-06-20/ncbitaxon.owl', source_website='https://github.com/obophenotype/ncbitaxon', branch_id=1, space_id=1, created_by_id=1, created_at=2025-07-15 14:29:55 UTC)

# validate against the NCBI Taxonomy
bt.Organism.validate(
    ["iris setosa", "iris versicolor", "iris virginica"], source=source
)

# since we didn't seed the Organism registry with the NCBITaxon public ontology
# we need to save the records to the database
records = bt.Organism.from_values(
    ["iris setosa", "iris versicolor", "iris virginica"], source=source
).save()

# now we can query a iris organism and view its parents and children
bt.Organism.get(name="iris").view_parents(with_children=True)

Access any Ensembl genes¶

Genes from all Ensembl versions and organisms can be accessed, even though they are not yet present in the bt.Source registry.

For instance, if you want to use rabbit genes from Ensembl version release-103:

# pip install pymysql
import bionty as bt

# automatically download genes for a new organism
gene_ontology = bt.base.Gene(source="ensembl", organism="rabbit", version='release-103')

# register the new source in lamindb
gene_ontology.register_source_in_lamindb()

# now you can start using this source

# import all genes from this source to your Gene registry
source = bt.Source.get(entity="bionty.Gene", name="ensembl", organism="rabbit", version="release-103")
bt.Gene.import_source(source=source)