feat(zarr-adapter): recognize 'gene' as a gene-symbol column candidate by hweej · Pull Request #162 · cBioPortal/cell-explorer-py

hweej · 2026-05-22T13:58:14Z

Summary

Adds `gene` (the literal column name) to `GENE_SYMBOL_COLUMNS` so datasets like `egfr_all_cells.zarr` — which store symbols in a column named `gene` rather than the common `feature_name`/`gene_symbol` conventions — auto-resolve gene symbols correctly. Without this the chat agent's tools return raw Ensembl IDs and have to guess symbols from training data.

`gene` is the most ambiguous candidate (the name could in theory mean other things), so it sits at the end of the priority list. A dataset that has both `feature_name` and `gene` still picks `feature_name`.

Per-dataset overrides for truly unusual schemas are tracked separately.

Test plan

17 zarr_adapter tests pass locally, +2 new (gene resolution; feature_name-wins-over-gene priority)
CI green

egfr_all_cells.zarr stores symbols in a column literally named 'gene' rather than the common 'feature_name' / 'gene_symbol' conventions. Adds 'gene' at the END of the priority list so datasets with a canonical column still win. Per-dataset overrides for unusual schemas are tracked in a separate issue.

hweej self-assigned this May 22, 2026

hweej added the enhancement New feature or request label May 22, 2026

This was referenced May 22, 2026

feat(zarrstore): recognize 'gene' as a gene-symbol column candidate cBioPortal/cbioportal-cell-explorer#285

Merged

Per-dataset gene_label_column override #163

Open

hweej merged commit a5962a7 into main May 22, 2026
3 checks passed

hweej deleted the feat/gene-symbol-column-candidate-gene branch May 22, 2026 14:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(zarr-adapter): recognize 'gene' as a gene-symbol column candidate#162

feat(zarr-adapter): recognize 'gene' as a gene-symbol column candidate#162
hweej merged 1 commit into
mainfrom
feat/gene-symbol-column-candidate-gene

hweej commented May 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

hweej commented May 22, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant