Cmd line by J-Fabila · Pull Request #61 · DataTools4Heart/flcore

J-Fabila · 2026-02-27T19:05:53Z

load base format

…_line

Copilot

Pull request overview

This PR updates command-line workflows for dataset/model configuration, adds base-format dataset loading, and expands model saving/explainability support across several federated model clients.

Changes:

Adds CLI/config options for checkpoint saving, survival inputs, and base-format dataset loading.
Adds model checkpoint metadata saving and SHAP-style explainability hooks for several models.
Updates Docker/dependency setup and XGBoost federated aggregation logic, while removing prior test/package tooling and sample datasets.

Reviewed changes

Copilot reviewed 34 out of 41 changed files in this pull request and generated 18 comments.

Show a summary per file

File	Description
`client_cmd.py`	Adds CLI defaults/options and logging changes for client startup.
`server_cmd.py`	Updates logging and result compilation placement.
`flcore/datasets.py`	Adds `base_format`, rewrites DT4H/survival loading, and changes partition helpers.
`flcore/utils.py`	Updates config validation and metadata-derived output sizing.
`flcore/metrics.py`	Adjusts metric tensor shaping by task.
`flcore/models/xgb/server.py`	Replaces XGBoost strategy/aggregation implementation.
`flcore/models/xgb/client.py`	Refactors XGBoost client config and model saving.
`flcore/models/random_forest/client.py`	Adds periodic model/metadata saving and partition task handling.
`flcore/models/linear_models/client.py`	Adjusts evaluation and adds model/metadata saving.
`flcore/models/logistic_regression/client.py`	Adds model/metadata saving hooks.
`flcore/models/nn/client.py`	Adds periodic NN checkpoint/metadata saving.
`flcore/models/nn/FedCustomAggregator.py`	Removes debug entropy logging.
`flcore/models/cox/*`	Adds strategy client counts, save hooks, and explainability API.
`flcore/models/rsf/*`	Adds strategy client counts, save hooks, and explainability API.
`flcore/models/gbs/*`	Adds strategy client counts, save hooks, and explainability API.
`requirements.txt`	Updates Flower/PyYAML versions and adds setuptools.
`Dockerfile`	Switches to `python:3.11-slim` and revises package installation.
`tutorial.md`	Adds command-line argument reference documentation.
`pruebas.md`	Adds example command table.
`muestras`	Adds dataset metadata/sample notes.
`tox.ini`	Removes tox test/lint configuration.
`pyproject.toml`	Removes package/build metadata.
`tests/test_models.py`	Removes existing model tests.
`repeated.py`	Removes repeated experiment helper script.
`dataset/maggic/metadata.json`	Removes bundled MAGGIC metadata sample.
`dataset/kaggle_hf.csv`	Removes bundled Kaggle HF sample dataset.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

-RUN pip3 install -r /home/requirements.txt
-RUN ln -s /usr/bin/python3 /usr/bin/python
+RUN apt-get update && DEBIAN_FRONTEND=noninteractive apt-get install -y \
+    iputils-ping curl wget \


+    with open("dataset_description.json", 'r') as file:
        metadata = json.load(file)

-    data_file = Path(config['data_file'])
-    dat = pd.read_parquet(data_file)
-
+    dat = pd.read_csv("data.csv")


+    target_labels = config["target_labels"]
+    train_labels = config["train_labels"]
+
+    split_idx = int(dat_len * config["train_size"])


-    sss = get_stratifiedPartitions(n_splits,test_size, random_state)
-    splits_nested = (sss.split(X_data, y_data))
-    return splits_nested
+def split_partitions(n_splits, test_size, random_state, X_data, y_data, task):


+        self.config = config
        self.local_data = local_data
-        self.saving_path = Path(saving_path)
+        self.saving_path = config["experiment_dir"]


+        if self.round % self.config["save_every_n_rounds"] == 0:
+            self.save_model()


+        """
+        Generate SHAP values for the CoxPH model.
+        """
+        import shap


+        """
+        Generate SHAP values for the GBS model (FPBoost).
+        """
+        import shap


+        """
+        Generate SHAP values for the RandomSurvivalForest model.
+        """
+        import shap


+        label_to_int = {v: int(k) for k, v in categories.items()}
+        label_to_int.update({int(k): int(k) for k in categories})
+        label_to_int.update({k: int(k) for k in categories})
+        cat_map[col] = label_to_int
+        for col, mapa in cat_map.items():
+            dat[col] = dat[col].map(mapa)
+
+        for feat in metadata:
+            if feat["type"] == "continuous":
+                # Should we normalize?
+                pass


J-Fabila added 30 commits February 15, 2026 01:22

pruebas

d80812a

corregido

9d58702

load base nuevo

e6bc4c4

correcciones

57a7483

2 a 1 cliente para pruebas

ed658aa

correccion nuevo formato

0e363b2

cambios en train labels

5ab3791

sinteticos prueba nuevo formato

a4ccd0a

listo para tests

a57bac1

nueva data sintetica

ba63a3e

me es util

a4cbb29

prueas nuevas etiquetas

0650766

añadido el sintetico

b259106

faltaba outcomes de convertir al formato

8abee63

utils n out feats corregido

f41ca52

tutorial nuevo

0632bfb

correccion

d81d943

corrección en el iloc

1bc82f8

correccion lineales cliente multiclass

5ce7bb0

corrección para que aceptes las labels

a7439de

client stratify corregido

3d36d79

nuevo dockerfile

a8aa11a

cliente corregido linear

a30d6b5

metrics correccion segun task

e55563f

sanityc heck mejorado

22ba136

lsvc::corrección para multiclass

6103327

corrección en el sanity check

c199b64

daatsets con stratify y sin para regresion

c1db94c

client llama al dataset split correcto

cb4bb47

correccion multiclass

abda8d4

J-Fabila and others added 27 commits March 16, 2026 10:15

corrección en el save

ba5a9a1

correcciones

2a1b1b7

solved nout

b58ebb4

correcciones cmd line

c91e79c

ajuste en sanity check

999140f

dataloader nuevo

5c37263

Merge branch 'cmd_line' of github.com:DataTools4Heart/flcore into cmd…

a7757b1

…_line

read csv -> read parquet

46e593c

movido al repo correcto

bb5f28f

client cmd save every añadida

7bd1367

function save every N añadido

1807e65

correccion en contador

6d17aae

logistic reg::save model añadido

d902486

XGB client save model añadido

a319585

random forest client save model añadido

a672d3f

NN client save model añadido

5298749

rsf client save model añadido

b472e04

cox cliente save model añadido

8a71e1a

gbs cliente save model añadido

8206747

survival corrections and expleinability

ba77d33

requirements arreglado

9c4c076

robust datareader

614e22e

actualizado

1138776

removed innecesary files and dirs

1494fe2

clean nans añadido

21ada9f

correction in metadata format

864906c

logs limpios

164d726

J-Fabila requested a review from Copilot May 28, 2026 12:18

Copilot started reviewing on behalf of J-Fabila May 28, 2026 12:18 View session

Copilot AI reviewed May 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cmd line#61

Cmd line#61
J-Fabila wants to merge 64 commits into
mainfrom
cmd_line

J-Fabila commented Feb 27, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		if self.round % self.config["save_every_n_rounds"] == 0:
		self.save_model()

Conversation

J-Fabila commented Feb 27, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants