From a4e348369974751eead9fd03c8057bee6639e34c Mon Sep 17 00:00:00 2001
From: Gaofei Zhao <zhaogaofeimail@gmail.com>
Date: Thu, 5 Mar 2026 14:45:41 -0500
Subject: [PATCH 1/4] initial plan

---
 plan.md | 147 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 147 insertions(+)
 create mode 100644 plan.md

diff --git a/plan.md b/plan.md
new file mode 100644
index 00000000000..8fe542e5ab1
--- /dev/null
+++ b/plan.md
@@ -0,0 +1,147 @@
+# Resource Data Model Redesign Plan
+
+## Database Changes
+
+### `resource_definition` — unchanged
+
+Its `DISPLAY_NAME` serves as the top-level category label in the UI tree.
+
+### Drop old tables
+
+- `resource_sample`
+- `resource_patient`
+- `resource_study`
+
+### Add `resource_node`
+
+```sql
+CREATE TABLE `resource_node` (
+  `ID`                  bigint        NOT NULL AUTO_INCREMENT,
+  `RESOURCE_ID`         varchar(255)  NOT NULL,
+  `CANCER_STUDY_ID`     int(11)       NOT NULL,
+  `ENTITY_TYPE`         ENUM('STUDY','PATIENT','SAMPLE') NOT NULL,
+  `ENTITY_INTERNAL_ID`  int(11)       NOT NULL,
+  `PARENT_ID`           bigint        DEFAULT NULL,
+  `NODE_TYPE`           ENUM('GROUP','ITEM') NOT NULL,
+  `DISPLAY_NAME`        varchar(255)  NOT NULL,
+  `URL`                 varchar(512)  DEFAULT NULL,   -- ITEM nodes only
+  `TYPE`                varchar(64)   DEFAULT NULL,   -- ITEM nodes only; free-text
+  `METADATA`            JSON          DEFAULT NULL,
+  `PRIORITY`            int(11)       DEFAULT 0,
+  PRIMARY KEY (`ID`),
+  INDEX `idx_node_entity` (`RESOURCE_ID`, `CANCER_STUDY_ID`, `ENTITY_TYPE`, `ENTITY_INTERNAL_ID`),
+  INDEX `idx_node_parent` (`PARENT_ID`),
+  FOREIGN KEY (`RESOURCE_ID`, `CANCER_STUDY_ID`)
+      REFERENCES `resource_definition` (`RESOURCE_ID`, `CANCER_STUDY_ID`) ON DELETE CASCADE,
+  FOREIGN KEY (`PARENT_ID`)
+      REFERENCES `resource_node` (`ID`) ON DELETE CASCADE
+);
+```
+
+### Node semantics
+
+| Field | Description |
+|---|---|
+| `NODE_TYPE=GROUP` | Folder node; no URL; can have children |
+| `NODE_TYPE=ITEM` | Leaf node; has URL; no children |
+| `PARENT_ID=NULL` | Root node directly under a `resource_definition` |
+| `TYPE` | Free-text domain label (e.g. `H_AND_E`, `IHC`, `CT`, `BAM`, `PDF`); no schema change needed for new domains |
+
+### Migration
+
+Existing `resource_sample`, `resource_patient`, `resource_study` rows each become a root-level `ITEM` node with `PARENT_ID=NULL`.
+
+```sql
+-- Example: resource_patient → resource_node
+INSERT INTO resource_node
+  (RESOURCE_ID, CANCER_STUDY_ID, ENTITY_TYPE, ENTITY_INTERNAL_ID,
+   PARENT_ID, NODE_TYPE, DISPLAY_NAME, URL)
+SELECT
+  rd.RESOURCE_ID, rd.CANCER_STUDY_ID, 'PATIENT', rp.INTERNAL_ID,
+  NULL, 'ITEM', rd.DISPLAY_NAME, rp.URL
+FROM resource_patient rp
+JOIN resource_definition rd
+  ON rp.RESOURCE_ID = rd.RESOURCE_ID;
+
+-- Same pattern for resource_sample (ENTITY_TYPE='SAMPLE')
+-- and resource_study (ENTITY_TYPE='STUDY')
+```
+
+---
+
+## Import File Format
+
+Three optional columns added to existing data files
+(`data_resource_patient.txt`, `data_resource_sample.txt`, `data_resource_study.txt`):
+
+| Column | Description |
+|---|---|
+| `DISPLAY_NAME` | Human-readable label for the item |
+| `TYPE` | Free-text sub-classification (e.g. `H_AND_E`, `CT`, `BAM`) |
+| `GROUP_PATH` | `/`-separated path of GROUP ancestors (e.g. `Block A/H&E Panel`) |
+
+### Importer logic for `GROUP_PATH`
+
+1. Split value on `/` to get ordered path segments
+2. For each segment, upsert a GROUP node at the correct depth within the same `(RESOURCE_ID, ENTITY_ID)` scope
+3. Insert the ITEM node under the deepest GROUP
+4. Empty `GROUP_PATH` → root-level ITEM (backward compatible with old format)
+
+### Example
+
+```
+PATIENT_ID  RESOURCE_ID   URL                    DISPLAY_NAME     TYPE           GROUP_PATH
+P001        pathology     https://viewer/1       H&E              H_AND_E        Block A – Primary Tumor
+P001        pathology     https://viewer/2       IHC CD3          IHC            Block A – Primary Tumor
+P001        pathology     https://viewer/3       IHC PD-L1        IHC            Block A – Primary Tumor
+P001        pathology     https://viewer/4       H&E              H_AND_E        Block B – Metastasis
+P001        ct_scans      https://ohif/1         Instance         CT             CT 2023-01-15/Series 1: Axial T2
+P001        ct_scans      https://ohif/2         Instance         CT             CT 2023-01-15/Series 2: Coronal T1
+P001        reports       https://reports/1      Biopsy Report    PATH_REPORT    2023
+P001        raw_data      https://storage/1      Tumor BAM        BAM            WGS
+P001        publications  https://pubmed/1       TCGA Paper 2021  JOURNAL
+```
+
+---
+
+## UI Tree Structure
+
+```
+▼ Pathology                      ← resource_definition.DISPLAY_NAME
+  ▼ Block A – Primary Tumor      ← GROUP node
+      H&E            [H_AND_E]   → iframe
+      IHC CD3        [IHC]       → iframe
+      IHC PD-L1      [IHC]       → iframe
+  ▼ Block B – Metastasis         ← GROUP node
+      H&E            [H_AND_E]   → iframe
+
+▼ Radiology
+  ▼ CT 2023-01-15                ← GROUP node (depth 1)
+    ▼ Series 1: Axial T2         ← GROUP node (depth 2)
+        Instance     [CT]        → iframe
+    ▼ Series 2: Coronal T1
+        Instance     [CT]        → iframe
+
+▼ Clinical Reports
+  ▼ 2023                         ← GROUP node
+      Biopsy Report  [PATH_REPORT] → link
+
+▼ Raw Data Files
+  ▼ WGS                          ← GROUP node
+      Tumor BAM      [BAM]       → link
+      Normal BAM     [BAM]       → link
+
+▼ Publications                   ← flat, no GROUP
+    TCGA Paper 2021  [JOURNAL]   → link
+```
+
+---
+
+## Implementation Steps
+
+1. Write migration SQL (drop old tables → create `resource_node` → migrate existing rows)
+2. Bump DB schema version in `pom.xml` and `cgds.sql`
+3. Update data importer to handle `GROUP_PATH` resolution
+4. Update persistence layer (JPA/MyBatis) for `resource_node`
+5. Update REST API to return tree-structured resource responses
+6. Update frontend to render the accordion/tree UI

From bc7f1e475473bab490e932482f701e7133f398af Mon Sep 17 00:00:00 2001
From: Gaofei Zhao <zhaogaofeimail@gmail.com>
Date: Wed, 11 Mar 2026 11:42:23 -0400
Subject: [PATCH 2/4] plan: add METADATA and PRIORITY to import format; fix
 migration SQL

- Add METADATA and PRIORITY to node semantics table
- Expand import file format from 3 to 5 optional columns (add METADATA, PRIORITY)
- Add METADATA examples to data file example rows
- Fix migration SQL: use RESOURCE_ID (not rd.DISPLAY_NAME) as DISPLAY_NAME
- Expand migration SQL to cover all 3 tables (was placeholder comment)
- Fix resource_study migration to use CANCER_STUDY_ID as ENTITY_INTERNAL_ID
- Clarify GROUP node DISPLAY_NAME = path segment string in importer logic
- Update implementation step 3 to mention METADATA and PRIORITY handling

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
---
 plan.md | 65 ++++++++++++++++++++++++++++++++++++++++-----------------
 1 file changed, 46 insertions(+), 19 deletions(-)

diff --git a/plan.md b/plan.md
index 8fe542e5ab1..fbbbec677c6 100644
--- a/plan.md
+++ b/plan.md
@@ -46,32 +46,57 @@ CREATE TABLE `resource_node` (
 | `NODE_TYPE=ITEM` | Leaf node; has URL; no children |
 | `PARENT_ID=NULL` | Root node directly under a `resource_definition` |
 | `TYPE` | Free-text domain label (e.g. `H_AND_E`, `IHC`, `CT`, `BAM`, `PDF`); no schema change needed for new domains |
+| `METADATA` | JSON blob for descriptive domain-specific data (e.g. stain info, acquisition params) |
+| `PRIORITY` | Integer for explicit display ordering; kept as a dedicated column (not in METADATA) so `ORDER BY PRIORITY` is efficient |
 
 ### Migration
 
-Existing `resource_sample`, `resource_patient`, `resource_study` rows each become a root-level `ITEM` node with `PARENT_ID=NULL`.
+Existing `resource_sample`, `resource_patient`, `resource_study` rows each become a root-level `ITEM` node with `PARENT_ID=NULL`. Migrated rows will have `TYPE=NULL` and `METADATA=NULL` since old tables have no equivalent columns — this is acceptable and expected.
+
+**Note on `DISPLAY_NAME` in migration:** old tables have no per-row display name. The resource ID is used as the node's `DISPLAY_NAME`, NOT `resource_definition.DISPLAY_NAME`, which is already used as the tree root label.
+
+**Note on `ENTITY_INTERNAL_ID` for `resource_study`:** study-scoped nodes have no patient/sample to reference. Use the study's own `CANCER_STUDY_ID` as `ENTITY_INTERNAL_ID`.
 
 ```sql
--- Example: resource_patient → resource_node
+-- resource_patient → resource_node
 INSERT INTO resource_node
   (RESOURCE_ID, CANCER_STUDY_ID, ENTITY_TYPE, ENTITY_INTERNAL_ID,
    PARENT_ID, NODE_TYPE, DISPLAY_NAME, URL)
 SELECT
-  rd.RESOURCE_ID, rd.CANCER_STUDY_ID, 'PATIENT', rp.INTERNAL_ID,
-  NULL, 'ITEM', rd.DISPLAY_NAME, rp.URL
+  rp.RESOURCE_ID, rd.CANCER_STUDY_ID, 'PATIENT', rp.INTERNAL_ID,
+  NULL, 'ITEM', rp.RESOURCE_ID, rp.URL
 FROM resource_patient rp
 JOIN resource_definition rd
   ON rp.RESOURCE_ID = rd.RESOURCE_ID;
 
--- Same pattern for resource_sample (ENTITY_TYPE='SAMPLE')
--- and resource_study (ENTITY_TYPE='STUDY')
+-- resource_sample → resource_node (ENTITY_TYPE='SAMPLE')
+INSERT INTO resource_node
+  (RESOURCE_ID, CANCER_STUDY_ID, ENTITY_TYPE, ENTITY_INTERNAL_ID,
+   PARENT_ID, NODE_TYPE, DISPLAY_NAME, URL)
+SELECT
+  rs.RESOURCE_ID, rd.CANCER_STUDY_ID, 'SAMPLE', rs.INTERNAL_ID,
+  NULL, 'ITEM', rs.RESOURCE_ID, rs.URL
+FROM resource_sample rs
+JOIN resource_definition rd
+  ON rs.RESOURCE_ID = rd.RESOURCE_ID;
+
+-- resource_study → resource_node (ENTITY_TYPE='STUDY', ENTITY_INTERNAL_ID=CANCER_STUDY_ID)
+INSERT INTO resource_node
+  (RESOURCE_ID, CANCER_STUDY_ID, ENTITY_TYPE, ENTITY_INTERNAL_ID,
+   PARENT_ID, NODE_TYPE, DISPLAY_NAME, URL)
+SELECT
+  rs.RESOURCE_ID, rd.CANCER_STUDY_ID, 'STUDY', rd.CANCER_STUDY_ID,
+  NULL, 'ITEM', rs.RESOURCE_ID, rs.URL
+FROM resource_study rs
+JOIN resource_definition rd
+  ON rs.RESOURCE_ID = rd.RESOURCE_ID;
 ```
 
 ---
 
 ## Import File Format
 
-Three optional columns added to existing data files
+Five optional columns added to existing data files
 (`data_resource_patient.txt`, `data_resource_sample.txt`, `data_resource_study.txt`):
 
 | Column | Description |
@@ -79,27 +104,29 @@ Three optional columns added to existing data files
 | `DISPLAY_NAME` | Human-readable label for the item |
 | `TYPE` | Free-text sub-classification (e.g. `H_AND_E`, `CT`, `BAM`) |
 | `GROUP_PATH` | `/`-separated path of GROUP ancestors (e.g. `Block A/H&E Panel`) |
+| `METADATA` | JSON string for arbitrary domain-specific key/value data |
+| `PRIORITY` | Integer display order (default `0`); lower values appear first |
 
 ### Importer logic for `GROUP_PATH`
 
 1. Split value on `/` to get ordered path segments
-2. For each segment, upsert a GROUP node at the correct depth within the same `(RESOURCE_ID, ENTITY_ID)` scope
+2. For each segment, upsert a GROUP node at the correct depth within the same `(RESOURCE_ID, ENTITY_ID)` scope; use the path segment string as the GROUP node's `DISPLAY_NAME`
 3. Insert the ITEM node under the deepest GROUP
 4. Empty `GROUP_PATH` → root-level ITEM (backward compatible with old format)
 
 ### Example
 
 ```
-PATIENT_ID  RESOURCE_ID   URL                    DISPLAY_NAME     TYPE           GROUP_PATH
-P001        pathology     https://viewer/1       H&E              H_AND_E        Block A – Primary Tumor
-P001        pathology     https://viewer/2       IHC CD3          IHC            Block A – Primary Tumor
-P001        pathology     https://viewer/3       IHC PD-L1        IHC            Block A – Primary Tumor
-P001        pathology     https://viewer/4       H&E              H_AND_E        Block B – Metastasis
-P001        ct_scans      https://ohif/1         Instance         CT             CT 2023-01-15/Series 1: Axial T2
-P001        ct_scans      https://ohif/2         Instance         CT             CT 2023-01-15/Series 2: Coronal T1
-P001        reports       https://reports/1      Biopsy Report    PATH_REPORT    2023
-P001        raw_data      https://storage/1      Tumor BAM        BAM            WGS
-P001        publications  https://pubmed/1       TCGA Paper 2021  JOURNAL
+PATIENT_ID  RESOURCE_ID   URL                    DISPLAY_NAME     TYPE           GROUP_PATH                         METADATA                                        PRIORITY
+P001        pathology     https://viewer/1       H&E              H_AND_E        Block A – Primary Tumor            {"stain":"hematoxylin","magnification":"20x"}    0
+P001        pathology     https://viewer/2       IHC CD3          IHC            Block A – Primary Tumor            {"antibody":"CD3","clone":"SP7"}                 1
+P001        pathology     https://viewer/3       IHC PD-L1        IHC            Block A – Primary Tumor            {"antibody":"PD-L1","clone":"22C3"}              2
+P001        pathology     https://viewer/4       H&E              H_AND_E        Block B – Metastasis               {"stain":"hematoxylin","magnification":"20x"}    0
+P001        ct_scans      https://ohif/1         Instance         CT             CT 2023-01-15/Series 1: Axial T2   {"modality":"CT","slices":120}                  0
+P001        ct_scans      https://ohif/2         Instance         CT             CT 2023-01-15/Series 2: Coronal T1 {"modality":"CT","slices":80}                   0
+P001        reports       https://reports/1      Biopsy Report    PATH_REPORT    2023                                                                               0
+P001        raw_data      https://storage/1      Tumor BAM        BAM            WGS                                                                                0
+P001        publications  https://pubmed/1       TCGA Paper 2021  JOURNAL                                                                                           0
 ```
 
 ---
@@ -141,7 +168,7 @@ P001        publications  https://pubmed/1       TCGA Paper 2021  JOURNAL
 
 1. Write migration SQL (drop old tables → create `resource_node` → migrate existing rows)
 2. Bump DB schema version in `pom.xml` and `cgds.sql`
-3. Update data importer to handle `GROUP_PATH` resolution
+3. Update data importer to handle `GROUP_PATH` resolution, `METADATA`, and `PRIORITY`
 4. Update persistence layer (JPA/MyBatis) for `resource_node`
 5. Update REST API to return tree-structured resource responses
 6. Update frontend to render the accordion/tree UI

From 8f281089fedc05bd4a3639a283b1d6d284893de1 Mon Sep 17 00:00:00 2001
From: Gaofei Zhao <zhaogaofeimail@gmail.com>
Date: Wed, 11 Mar 2026 12:00:05 -0400
Subject: [PATCH 3/4] plan: PATIENT_ID and SAMPLE_ID are optional; study
 resources omit them
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

Entity ID columns are optional — their presence depends on resource scope:
- STUDY resources: no PATIENT_ID or SAMPLE_ID
- PATIENT resources: PATIENT_ID required, no SAMPLE_ID
- SAMPLE resources: both PATIENT_ID and SAMPLE_ID required

Entity type is resolved from resource_definition.RESOURCE_TYPE, not
column presence. Updated column table and added a study-resource example.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
---
 plan.md | 31 +++++++++++++++++++++----------
 1 file changed, 21 insertions(+), 10 deletions(-)

diff --git a/plan.md b/plan.md
index fbbbec677c6..2ff788405dc 100644
--- a/plan.md
+++ b/plan.md
@@ -96,16 +96,21 @@ JOIN resource_definition rd
 
 ## Import File Format
 
-Five optional columns added to existing data files
-(`data_resource_patient.txt`, `data_resource_sample.txt`, `data_resource_study.txt`):
-
-| Column | Description |
-|---|---|
-| `DISPLAY_NAME` | Human-readable label for the item |
-| `TYPE` | Free-text sub-classification (e.g. `H_AND_E`, `CT`, `BAM`) |
-| `GROUP_PATH` | `/`-separated path of GROUP ancestors (e.g. `Block A/H&E Panel`) |
-| `METADATA` | JSON string for arbitrary domain-specific key/value data |
-| `PRIORITY` | Integer display order (default `0`); lower values appear first |
+All three data files share a unified column set. Required vs optional columns:
+
+| Column | Required | Description |
+|---|---|---|
+| `PATIENT_ID` | Optional | Required for patient- and sample-scoped resources; omit for study resources |
+| `SAMPLE_ID` | Optional | Required for sample-scoped resources only; omit for patient/study resources |
+| `RESOURCE_ID` | Required | References a `resource_definition` |
+| `URL` | Required | The resource URL |
+| `DISPLAY_NAME` | Optional | Human-readable label for the item |
+| `TYPE` | Optional | Free-text sub-classification (e.g. `H_AND_E`, `CT`, `BAM`) |
+| `GROUP_PATH` | Optional | `/`-separated path of GROUP ancestors (e.g. `Block A/H&E Panel`) |
+| `METADATA` | Optional | JSON string for arbitrary domain-specific key/value data |
+| `PRIORITY` | Optional | Integer display order (default `0`); lower values appear first |
+
+The importer determines the entity type from the `resource_definition.RESOURCE_TYPE` field (STUDY/PATIENT/SAMPLE) rather than from column presence.
 
 ### Importer logic for `GROUP_PATH`
 
@@ -117,6 +122,7 @@ Five optional columns added to existing data files
 ### Example
 
 ```
+# data_resource_patient.txt — PATIENT_ID required, no SAMPLE_ID
 PATIENT_ID  RESOURCE_ID   URL                    DISPLAY_NAME     TYPE           GROUP_PATH                         METADATA                                        PRIORITY
 P001        pathology     https://viewer/1       H&E              H_AND_E        Block A – Primary Tumor            {"stain":"hematoxylin","magnification":"20x"}    0
 P001        pathology     https://viewer/2       IHC CD3          IHC            Block A – Primary Tumor            {"antibody":"CD3","clone":"SP7"}                 1
@@ -127,6 +133,11 @@ P001        ct_scans      https://ohif/2         Instance         CT
 P001        reports       https://reports/1      Biopsy Report    PATH_REPORT    2023                                                                               0
 P001        raw_data      https://storage/1      Tumor BAM        BAM            WGS                                                                                0
 P001        publications  https://pubmed/1       TCGA Paper 2021  JOURNAL                                                                                           0
+
+# data_resource_study.txt — no PATIENT_ID or SAMPLE_ID
+RESOURCE_ID     URL                           DISPLAY_NAME          TYPE        GROUP_PATH   METADATA   PRIORITY
+study_sponsors  https://sponsor-info/1        Sponsor Overview      PDF                                 0
+study_protocol  https://protocol-docs/1       Protocol v2           PDF         2023                    0
 ```
 
 ---

From 371a199b00c68c322ba9f20e1cd1a9ccbb4f262b Mon Sep 17 00:00:00 2001
From: Gaofei Zhao <zhaogaofeimail@gmail.com>
Date: Wed, 11 Mar 2026 12:03:34 -0400
Subject: [PATCH 4/4] plan: use resource_definition.DISPLAY_NAME as migrated
 node DISPLAY_NAME

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
---
 plan.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/plan.md b/plan.md
index 2ff788405dc..e5d504ef8b5 100644
--- a/plan.md
+++ b/plan.md
@@ -53,7 +53,7 @@ CREATE TABLE `resource_node` (
 
 Existing `resource_sample`, `resource_patient`, `resource_study` rows each become a root-level `ITEM` node with `PARENT_ID=NULL`. Migrated rows will have `TYPE=NULL` and `METADATA=NULL` since old tables have no equivalent columns — this is acceptable and expected.
 
-**Note on `DISPLAY_NAME` in migration:** old tables have no per-row display name. The resource ID is used as the node's `DISPLAY_NAME`, NOT `resource_definition.DISPLAY_NAME`, which is already used as the tree root label.
+**Note on `DISPLAY_NAME` in migration:** old tables have no per-row display name. The `resource_definition.DISPLAY_NAME` is used as the node's `DISPLAY_NAME`.
 
 **Note on `ENTITY_INTERNAL_ID` for `resource_study`:** study-scoped nodes have no patient/sample to reference. Use the study's own `CANCER_STUDY_ID` as `ENTITY_INTERNAL_ID`.