Remove the need for `to_homogeneous(dataset.train_node_ids)` for Labeled homogeneous inputs #115

kmontemayor2-sc · 2025-06-26T18:31:37Z

Headline change here is to remove the need to do:

loader = DistABLPLoader(input_nodes=to_homogeneous(dataset.train_node_ids), ...)

As demonstrated in _run_cora_supervised 1 and _run_distributed_neighbor_loader_labeled_homogeneous 2.

Also:

Breakout node input parsing logic to shared util neighborloader.resolve_node_sampler_input_from_user_input
Add NodeSamplerInput type alias to be shared between DistABLPLoader and DistNeighborLoader.

Added unit tests for new util.

kmontemayor2-sc · 2025-06-26T18:32:13Z

/unit_test

kmontemayor2-sc · 2025-06-26T18:32:18Z

/e2e_test

kmontemayor2-sc · 2025-06-26T18:32:22Z

/integration_test

github-actions · 2025-06-26T18:32:24Z

GiGL Automation

@ 18:32:23UTC : 🔄 Unit Test started.

github-actions · 2025-06-26T18:32:28Z

GiGL Automation

@ 18:32:27UTC : 🔄 E2E Test started.

@ 20:00:36UTC : ✅ Workflow completed successfully.

github-actions · 2025-06-26T18:32:32Z

GiGL Automation

@ 18:32:32UTC : 🔄 Integration Test started.

@ 19:20:47UTC : ✅ Workflow completed successfully.

kmontemayor2-sc · 2025-06-26T20:13:57Z

/unit_test

github-actions · 2025-06-26T20:14:09Z

GiGL Automation

@ 20:14:09UTC : 🔄 Unit Test started.

kmontemayor2-sc · 2025-06-26T21:52:02Z

python/gigl/distributed/utils/neighborloader.py

+    elif isinstance(input_nodes, abc.Mapping):
+        if len(input_nodes) != 1:
+            raise ValueError(
+                f"If input_nodes is provided as a mapping, it must contain exactly one key/value pair. Received: {input_nodes}. This may happen if you call Loader(node_ids=dataset.node_ids) with a heterogeneous dataset."
+            )
+        node_type, node_ids = next(iter(input_nodes.items()))
+        is_labeled_homoogeneous = node_type == DEFAULT_HOMOGENEOUS_NODE_TYPE


FYI I'm ok with not adding this change - we can just add the util if there's pushback here.

mkolodner-sc

Thanks Kyle!

python/gigl/distributed/dist_ablp_neighborloader.py

mkolodner-sc · 2025-07-01T18:15:31Z

python/gigl/distributed/utils/neighborloader.py

+
+
+@dataclass(frozen=True)
+class _ResolvedNodeSamplerInput:


I'd prefer not to add anotherNodeSamplerInput derivative if we can avoid it for the sake of reducing the complexity of our codebase -- is the value here primarily the is_labeled_homogeneous field? Is there any way we can just have resolve_node_sampler_input_from_user_input return a Tuple[NodeSamplerInput, bool] or even a Tuple[NodeType, torch.Tensor, bool], since it seems like the only place this is used in in the __init__ of the ABLPLoader?

Yeah I wanted to do this to avoid the three sized tuple.

I understand the apprehension here about the new dataclass though - do you think renaming it could help? Something like _ParsedInputs or equivalent?

mkolodner-sc

Thanks! LGTM provided comments are addressed

kmontemayor2-sc · 2025-07-01T20:15:04Z

/unit_test

github-actions · 2025-07-01T20:15:16Z

GiGL Automation

@ 20:15:15UTC : 🔄 Unit Test started.

@ 20:52:38UTC : ✅ Workflow completed successfully.

wip to allow dict inputs

64fac2d

kmonte added 2 commits June 26, 2025 20:08

with tests and docs

463f60f

run tests and comment update

d88276d

kmontemayor2-sc changed the title ~~wip to allow dict inputs~~ Remove the need for to_homogeneous(dataset.train_node_ids) for Labeled homogeneous inputs Jun 26, 2025

kmontemayor2-sc commented Jun 26, 2025

View reviewed changes

kmonte added 3 commits June 26, 2025 21:53

format

df28cfd

Merge branch 'main' into kmonte/allow-dict

f8bf0bd

update docstrings

2a59b95

mkolodner-sc reviewed Jul 1, 2025

View reviewed changes

Merge branch 'main' into kmonte/allow-dict

bf5f1c7

mkolodner-sc approved these changes Jul 1, 2025

View reviewed changes

kmonte and others added 2 commits July 1, 2025 20:14

swap to return tuple

e43a93a

Merge branch 'main' into kmonte/allow-dict

a80eea9



		@dataclass(frozen=True)
		class _ResolvedNodeSamplerInput:

Remove the need for to_homogeneous(dataset.train_node_ids) for Labeled homogeneous inputs #115

Are you sure you want to change the base?

Remove the need for to_homogeneous(dataset.train_node_ids) for Labeled homogeneous inputs #115

Uh oh!

Conversation

kmontemayor2-sc commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kmontemayor2-sc commented Jun 26, 2025

Uh oh!

kmontemayor2-sc commented Jun 26, 2025

Uh oh!

kmontemayor2-sc commented Jun 26, 2025

Uh oh!

github-actions bot commented Jun 26, 2025

GiGL Automation

Uh oh!

github-actions bot commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GiGL Automation

Uh oh!

github-actions bot commented Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GiGL Automation

Uh oh!

kmontemayor2-sc commented Jun 26, 2025

Uh oh!

github-actions bot commented Jun 26, 2025

GiGL Automation

Uh oh!

kmontemayor2-sc Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

mkolodner-sc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mkolodner-sc Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kmontemayor2-sc Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

mkolodner-sc left a comment

Choose a reason for hiding this comment

Uh oh!

kmontemayor2-sc commented Jul 1, 2025

Uh oh!

github-actions bot commented Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

GiGL Automation

Uh oh!

Uh oh!

Remove the need for `to_homogeneous(dataset.train_node_ids)` for Labeled homogeneous inputs #115

Remove the need for `to_homogeneous(dataset.train_node_ids)` for Labeled homogeneous inputs #115

kmontemayor2-sc commented Jun 26, 2025 •

edited

Loading

github-actions bot commented Jun 26, 2025 •

edited

Loading

github-actions bot commented Jun 26, 2025 •

edited

Loading

mkolodner-sc Jul 1, 2025 •

edited

Loading

github-actions bot commented Jul 1, 2025 •

edited

Loading